January 15, 2026
Building End-to-End Streaming Pipelines with Kafka and Google Dataflow
A deep dive into designing fault-tolerant, exactly-once streaming pipelines that move data from Kafka through Dataflow into BigQuery at scale.
Read moreHi, my name is
I'm a Senior Data Engineer with 8+ years of experience designing and building scalable, reliable data infrastructure. I specialise in batch and streaming systems that power analytics and machine learning at scale.
Hello! I'm Gnana, a Senior Data Engineer based in Chennai, India. I enjoy creating things that live in the cloud and process massive amounts of data. My interest in data engineering started back in 2017 when I realised the power of turning raw data into actionable insights.
Fast-forward to today, and I've had the privilege of working on complex data platforms at companies like PayPal, Ford Motors, and Tredence Analytics — processing terabytes of data daily. My main focus is building reliable, scalable data infrastructure and mentoring engineers on best practices.
Here are a few technologies I've been working with recently:
Languages
Big Data
Cloud — GCP
Databases
Orchestration & DevOps
Other Tools
December 2024 – Present
June 2024 – December 2024
June 2021 – June 2024
October 2017 – June 2021
Featured Project
An end-to-end streaming pipeline ingesting events from Kafka, transforming them via Cloud Dataflow (Apache Beam), and landing into BigQuery with exactly-once semantics. Handles 100K+ events/sec with automatic autoscaling and sub-second latency.
Enterprise batch ingestion pipeline extracting data from Oracle, transforming via Dataflow, landing in GCS, and loading into BigQuery for analytics-ready datasets.
I write about data engineering, system design, lessons learned from production systems, and best practices I've discovered along the way.
January 15, 2026
A deep dive into designing fault-tolerant, exactly-once streaming pipelines that move data from Kafka through Dataflow into BigQuery at scale.
Read moreDecember 10, 2025
Practical patterns for building reliable batch pipelines from Oracle and Snowflake into BigQuery using Cloud Dataflow and GCS as the staging layer.
Read moreNovember 5, 2025
Key architectural decisions for BigQuery design, Dataflow pipeline patterns, and Airflow orchestration — lessons from production at Ford and Tredence.
Read moreWhat's Next?
I'm currently open to new opportunities and interesting data engineering challenges. Whether you have a question, want to discuss a project, or just want to say hi — my inbox is always open!
Say Hello