Welcome to etl_toolkit’s documentation!#

The etl_toolkit contains many utilities to write better pyspark to simplify pipelines on Databricks. Navigate to each module of the etl_toolkit to learn about the functions provided and how to use them.

  • expressions: Contains functions for deriving complex pyspark.Columns

  • analyses: Contains functions for deriving complex pyspark.DataFrames that apply many data transformations.

Contents:

Indices and tables#