Python for Data Analysis: Difference between revisions
Jump to navigation
Jump to search
Line 14: | Line 14: | ||
The set of packages referred from this article focus on structured data, which includes tabular or spreadsheet-like data, in which each column may be a different type (relational database data, spreadsheets and CSV files), multidimensional arrays (matrices), multiple tables or related data joined by key columns, and evenly and unevenly spaced time series. | The set of packages referred from this article focus on structured data, which includes tabular or spreadsheet-like data, in which each column may be a different type (relational database data, spreadsheets and CSV files), multidimensional arrays (matrices), multiple tables or related data joined by key columns, and evenly and unevenly spaced time series. | ||
Python is uniquely positioned for use in data analysis because of many specialized data processing libraries ([[Numpy]], [[Pandas]], [[scikit-learn]]), visualization libraries ([[matplotlib]], [[plotly]]) and other tools ([[Jupyter Notebook]], [[Jupyter Lab]]). | Python is uniquely positioned for use in data analysis because of many specialized data processing libraries ([[Numpy|numpy]], [[Pandas|pandas]], [[scikit-learn]]), visualization libraries ([[matplotlib]], [[plotly]]) and other tools ([[Jupyter Notebook]], [[Jupyter Lab]]). | ||
=C= | =C= |
Revision as of 23:05, 14 May 2024
External
- Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter 3rd Edition by Wes McKinney
Internal
Overview
This article is loosely based on Python for Data Analysis: Data Wrangling with pandas, NumPy, and Jupyter 3rd Edition by Wes McKinney.
The set of packages referred from this article focus on structured data, which includes tabular or spreadsheet-like data, in which each column may be a different type (relational database data, spreadsheets and CSV files), multidimensional arrays (matrices), multiple tables or related data joined by key columns, and evenly and unevenly spaced time series.
Python is uniquely positioned for use in data analysis because of many specialized data processing libraries (numpy, pandas, scikit-learn), visualization libraries (matplotlib, plotly) and other tools (Jupyter Notebook, Jupyter Lab).