data:image/s3,"s3://crabby-images/1807a/1807a903851adefed5161fe304654717aca1e3b2" alt="blogpost-figure"
Overlapping intervals in real life
Feb 3, 2025
Leetcode problem encountered in the wild
data:image/s3,"s3://crabby-images/8ba01/8ba0152458fd9d3473758c8df922bf6f894302dd" alt="blogpost-figure"
Using XCom in Airflow
Jan 20, 2025
How to pass data between Airflow tasks
data:image/s3,"s3://crabby-images/c35cc/c35cc9e132889900a720d00f8a0449331c0aa54d" alt="blogpost-figure"
Unit testing Airflow DAGs
Jan 6, 2025
How to unit test Airflow DAGs and tasks
data:image/s3,"s3://crabby-images/676be/676be8d80719f0efc6379391cf8adf558c9df835" alt="blogpost-figure"
Airflow on local machine
Dec 16, 2024
How to set up Airflow locally
data:image/s3,"s3://crabby-images/39932/399325a22ccd32df99afd73fb55e376dce2e6d0b" alt="blogpost-figure"
Orchestration Wars - Airflow
Dec 2, 2024
What you need to know before using Airflow
data:image/s3,"s3://crabby-images/c17a8/c17a8364e8bee3605504f79a2af4392f8015177b" alt="blogpost-figure"
Data Orchestration via Events
Oct 7, 2024
Orchestrating data pipelines across systems and teams without relying on a platform
Python Transactions
Sep 2, 2024
How to write a utility Python decorator for a transaction
data:image/s3,"s3://crabby-images/b7c54/b7c5468fe08a6f7751a44ff0fa8369f8cc05b01c" alt="blogpost-figure"
How to extract a graph from your Python function
Aug 5, 2024
How to statically analyze the operations done by a Python function
data:image/s3,"s3://crabby-images/28c1e/28c1ea148e19d4108b4a8bc8bbd67c7a32b228e4" alt="blogpost-figure"
Data Lineage via OpenLineage - The Easy Way
Jul 1, 2024
Data Lineage and emitting OpenLineage events via Python decorator
data:image/s3,"s3://crabby-images/9e2bc/9e2bc7daf46a50023b5f4770f020488ecd5deedc" alt="blogpost-figure"
Highlights from PyCon DE & PyData Berlin 2024
May 6, 2024
The Engineer's Highlights from PyCon DE & PyData Berlin 2024
data:image/s3,"s3://crabby-images/abe50/abe5012a3da466894462fde1000a792f289aa75a" alt="blogpost-figure"
The Lost Art of Documentation
Apr 1, 2024
How and when to document your code
data:image/s3,"s3://crabby-images/777cd/777cdad486de1d852b9bbaa6b9a2964265fd8c17" alt="blogpost-figure"
PySpark jar dependencies
Mar 18, 2024
How to add jar dependencies to pyspark
data:image/s3,"s3://crabby-images/63812/638128b7d183b5411efc5de7d0dd0434ce9b3f3f" alt="blogpost-figure"
PySpark by download
Mar 4, 2024
How to set up pyspark by downloading
data:image/s3,"s3://crabby-images/1dcbc/1dcbcc8679aaa309cad1148b407a805c5d7a8e99" alt="blogpost-figure"
PySpark with Docker
Feb 19, 2024
How to set up pyspark with Docker
data:image/s3,"s3://crabby-images/caa68/caa6804e57620ec009f57f02b2d8544ffb696b14" alt="blogpost-figure"
Spark UDFs
Feb 5, 2024
The different ways to use UDFs in pyspark
data:image/s3,"s3://crabby-images/9725d/9725d298e185664ec1c549a45fa1e549ef1de868" alt="blogpost-figure"
Spark and all the versions
Jan 15, 2024
Spark, Python, Java, pyarrow, which versions work together?
data:image/s3,"s3://crabby-images/481b9/481b9e5e2058d4c9c7ea99c059070527dbcbc4fb" alt="blogpost-figure"
PySpark with pip
Jan 8, 2024
How to set up pyspark with pip
data:image/s3,"s3://crabby-images/75fb5/75fb5a4fc633751615d1ea0ef3b31d61caccc7ae" alt="blogpost-figure"
PySpark with conda
Jan 1, 2024
How to set up pyspark with conda