15. General Functions

15. General Functions

Created
Sep 1, 2025 10:14 PM
Multi-select
Status
Not started
image

1. Reshaping & Pivoting Data

  • pd.melt() - Unpivot data from wide to long format
  • df.pivot() - Reshape data (index to columns)
  • pd.pivot_table() - Create a pivot table with aggregation
  • pd.crosstab() - Compute a cross-tabulation (frequency table)
  • pd.wide_to_long() - Convert wide-format data to long-format

2. Combining & Merging Data

  • pd.merge() - Merge DataFrames (SQL-style join)
  • pd.merge_ordered() - Merge with ordered data (like SQL outer join)
  • pd.merge_asof() - Merge based on nearest key (as of join)
  • pd.concat() - Concatenate DataFrames along an axis

3. Handling Categorical & Dummy Data

  • pd.get_dummies() - Convert categorical variables into dummy/indicator variables
  • pd.from_dummies() - Convert dummy variables back to categorical
  • df['col'].factorize() - Encode categorical data as numerical labels
  • df['col'].unique() - Return unique values
  • pd.cut(df['col'], bins) - Bin values into discrete intervals
  • pd.qcut(df['col'], q) - Bin values into quantile-based intervals

4. Missing Data Handling

  • df['col'].isna() / df['col'].isnull() - Detect missing values
  • df['col'].notna() / df['col'].notnull() - Detect non-missing values

5. Type Conversion & Parsing

  • pd.to_numeric(df['col']) - Convert to numeric dtype
  • pd.to_datetime(df['col']) - Convert to datetime
  • pd.to_timedelta(df['col']) - Convert to timedelta
  • pd.eval() - Evaluate string expressions

6. Date & Time Handling

  • pd.date_range() - Generate date-time range
  • pd.bdate_range() - Generate business day range
  • pd.period_range() - Generate period range
  • pd.timedelta_range() - Generate timedelta range
  • df['datetime_col'].infer_freq() - Infer frequency
  • pd.interval_range() - Generate interval ranges
  • pd.tseries.api.guess_datetime_format() - Guess datetime format

7. Hashing & Interoperability

  • pd.util.hash_array() - Hash a NumPy array
  • pd.util.hash_pandas_object() - Hash a pandas object
  • pd.api.interchange.from_dataframe() - Convert to interchange object

Pandas General Functions