Skip to content

Data Science

Python is the dominant language for data science and scientific computing. NumPy provides the foundational array operations and linear algebra, while pandas offers high-performance data structures for tabular data analysis. Together, they form the backbone of the Python data ecosystem.

This section covers essential patterns for numerical computing and data manipulation.

Topics

  • NumPy — Array creation, shape manipulation, broadcasting, universal functions, linear algebra, random sampling
  • Pandas — Series/DataFrame, CSV/Excel I/O, groupby, merge/join, pivot tables, missing data, datetime handling

References