Welcome to etl_toolkit’s documentation!#
The etl_toolkit contains many utilities to write better pyspark to simplify pipelines on Databricks.
Navigate to each module of the etl_toolkit to learn about the functions provided and how to use them.
expressions: Contains functions for deriving complex
pyspark.Columnsanalyses: Contains functions for deriving complex
pyspark.DataFramesthat apply many data transformations.
Contents:
Emodule (“expressions”)Amodule (“analyses”)- Table creation functions
- Mutable Table functions