The pandas documentation is quite hairy and long-winded. I am listing down the main topics from the pandas documentation.
Bold indicates I have tried out this feature e2e myself. I intend to to keep track of progress, as I try out the different features. Hence this list for bookkeeping.
- 10 Minutes to Pandas:
- Essential Basic Functionality.
- Indexing and Selecting data
- Different Choices for Indexing
- Basics
- Attribute Access
- Slicing ranges
- Selection By Label
- Selection By Position
- Selecting Random Samples
- Setting With Enlargement
- Fast scalar value getting and setting
- Boolean indexing
- Indexing with isin
- The where() Method and Masking
- The query() Method (Experimental)
- Duplicate Data
- Dictionary-like get() method
- The select() Method
- The lookup() Method
- Index objects
- Set / Reset Index
- Returning a view versus a copy
- MultiIndex
- Working With Missing Data.
- Group By
- Splitting an object into groups
- Iterating through groups
- Selecting a group
- Aggregation
- Transformation
- Filtration
- Dispatching to instance methods
- Flexible apply
- Other useful features
- Automatic exclusion of “nuisance” columns
- NA and NaT group handling
- Grouping with ordered factors
- Grouping with a Grouper specification
- Taking the first rows of each group
- Taking the nth row of each group
- Enumerate group items
- Plotting
- Examples
- Regrouping by factor
- Returning a Series to propagate names
- Merge, join, concatenate
- Reshaping And Pivot Tables
- Reshaping by pivoting DataFrame objects
- Reshaping by stacking and unstacking
- Multiple Levels
- Missing Data
- With a MultiIndex
- Reshaping by Melt
- Combining with stats and GroupBy
- Pivot tables and cross-tabulations
- Cross tabulations
- Adding margins (partial aggregates)
- Tiling
- Computing indicator / dummy variables
- Factorizing values
- Time Series / Date Functionality.