Python for Data Analysis: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
Line 20: Line 20:
visualization libraries:
visualization libraries:
* [[matplotlib]]
* [[matplotlib]]
* [[plotly]])
* [[plotly]]
and other tools:
and other tools:
* [[Jupyter Notebook]]
* [[Jupyter Notebook]]
* [[Jupyter Lab]])
* [[Jupyter Lab]])
and Python's overall strength for general-purpose software engineering.
and Python's overall strength for general-purpose software engineering.

Revision as of 23:17, 14 May 2024

External

Internal

Overview

This article is loosely based on Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter 3rd Edition by Wes McKinney.

The set of packages referred from this article focus on structured data, which includes tabular or spreadsheet-like data, in which each column may be a different type (relational database data, spreadsheets and CSV files), multidimensional arrays (matrices), multiple tables or related data joined by key columns, and evenly and unevenly spaced time series.

Python is uniquely positioned for use in data analysis because of the availability of many specialized data processing libraries:

visualization libraries:

and other tools:

and Python's overall strength for general-purpose software engineering.