Excel is not enough to handle data? The founder of pandas teaches you how to use Python for data analysis

Why use Python for data analysis?

For data analyst practitioners, it is often necessary to engage in: database operations, report writing, data visualization, and data mining. These tasks can be operated without writing codes. Using Excel for data visualization and using some people-friendly platform tools such as SPSS for data mining, although the use of platform tools is highly manipulative, there will inevitably be repetitive mechanical labor, thereby reducing Your own work efficiency, but if you can write code in Python, you have a higher degree of freedom in operation and greater potential for development.

How to use Python for data analysis?

This book "Data Analysis Using Python" is recommended here, which is suitable for beginners who are new to Python data analysis. Using the interactive shell of IPython as your primary development environment, it tells about starting to use high-performance tools from the data analysis tools of the pandas library.

Let's show it directly:

Chapter 1 Preparations

1.1 Contents of this book

1.2 Why use Python for data analysis

1.3 Important Python libraries

1.4 Installation and Setup

img

Chapter 2 Basics of Python Syntax, IPython and Jupyter Notebooks

2.1 Python interpreter

2.2 Basics of IPython

2.3 Basics of Python syntax

img

Chapter 3 Python Data Structures, Functions, and Files

3.1 Data Structures and Sequences

3.2 Functions

3.3 Files and Operating System

3.4 Conclusion

img

Chapter 4 NumPy Basics: Array and Vector Computing

4.1 NumPy's ndarray: a multidimensional array object

4.2 Universal Functions: Fast Element-wise Array Functions

4.3 Using arrays for data processing

4.4 File input and output for arrays

img

Chapter 5 Getting Started with Pandas

5.1 Introduction to data structure of pandas

5.2 Basic functions

5.3 Summarizing and computing descriptive statistics

5.4 Summary

img

Chapter 6 Data Loading, Storage, and File Formats

6.1 Reading and writing data in text format

6.2 Binary data format

6.3 Interaction with Web APIs

6.4 Database Interaction

6.5 Summary

img

Chapter 7 Data Cleaning and Preparation

7.1 Handling missing data

7.2 Data Conversion

7.3 String manipulation

7.4 Summary

img

Chapter 8 Data Wrangling: Aggregating, Merging, and Reshaping

8.1 Hierarchical Index

8.2 Merging datasets

8.3 Reshaping and axial rotation

8.4 Summary

img

Chapter 9 Plotting and Visualization

9.1 Getting started with the matplotlib API

9.2 Plotting with pandas and seaborn

9.3 Other Python visualization tools

9.4 Summary

img

Chapter 10 Data Aggregation and Grouping Operations

10.1 GroupBy mechanism

10.2 Data Aggregation

10.3 apply: general "split-apply-merge"

10.4 Pivot tables and crosstabs

10.5 Summary

img

Chapter 11 Time Series

11.1 Date and time data types and tools

11.2 Time Series Basics

11.3 Range, frequency and movement of dates

11.4 Time Zone Handling

img

Chapter 12 Advanced Applications of Pandas

12.1 Categorical data

12.2 Advanced Application of GroupBy

12.3 Chain programming technology

12.4 Summary

img

Chapter 13 Introduction to the Python Modeling Library

13.1 Interface between pandas and model code

13.2 Creating Model Descriptions with Patsy

13.3 Introduction to statsmodels

13.4 Introduction to scikit-learn

13.5 Continue learning

img

Chapter 14 Data Analysis Cases

14.1 USA.gov data from Bitly

14.2 MovieLens 1M Dataset

14.3 National Baby Names 1880-2010

14.4 USDA Food Database

14.5 2012 Federal Election Commission Database

14.6 Summary

img

Space is limited, here is just a display

Friends, if you need this complete information, you can scan the QR code of CSDN official certification below on WeChat to get it for free [guaranteed 100% free]

Guess you like

Origin blog.csdn.net/WANGJUNAIJIAO/article/details/130760585