Thoughts on data engineering, system design, lessons learned from production, and best practices I've picked up along the way.
January 15, 2026 · 9 min read
A deep dive into designing fault-tolerant, exactly-once streaming pipelines that move data from Kafka through Dataflow into BigQuery at scale.
Read more December 10, 2025 · 10 min read
Practical patterns for building reliable batch pipelines from Oracle and Snowflake into BigQuery using Cloud Dataflow and GCS as the staging layer.
Read more November 5, 2025 · 8 min read
Key architectural decisions for building a robust, cost-efficient data platform on GCP — covering BigQuery design, Dataflow pipeline patterns, and Airflow orchestration.
Read more