About Me
Transforming complex data challenges into scalable, efficient solutions

My Journey
With almost 15 years of experience in the data space, I've transitioned from analyst roles into architecting scalable batch and real-time data infrastructure that supports millions of users and billions of events. I specialize in building modern data platforms powered by Airflow, Spark, Kafka, Flink, and cloud-native tools.
My work focuses on optimizing ETL workflows, enhancing data accessibility, and driving cost-efficiency for high-impact teams across marketing, product, and engineering. I'm passionate about designing systems that turn complex data into clean, reliable insights—and thrive on tackling infrastructure challenges that unlock business growth.
Technical Expertise
Data Processing & ETL
Cloud & Storage
Languages & Frameworks
Data Visualization & BI
Analytics & Monitoring
Professional Experience
Lead Data Engineer
- •Optimized GCS, Composer, and BigQuery ETL pipelines by refactoring legacy workflows, reducing processing time, cutting data costs by over $100K, and enhancing scalability across 20+ content products.
- •Developed and optimized batch data pipelines using Apache Airflow, Spark, SQL, and Python, supporting 10M daily active users and over 3B monthly ad impressions. Experimented with Flink for enhanced real-time streaming performance.
- •Integrated Looker with BigQuery and other data sources to create interactive dashboards, improving data accessibility for stakeholders.
- •Developed API-based data ingestion pipelines, improving ETL efficiency, reducing processing time and data maintenance complexity.
- •Maintained Databricks workflows, working with notebooks to troubleshoot Spark-based pipeline issues and ensure 100% uptime for critical data operations.
Associate Lead Analyst
- •Managed analytics for multiple HHS and NIH government websites under the Digital Analytics Program (DAP), driving performance and user engagement improvements through SEO audits, A/B testing, and goal funneling strategies.
- •Developed and managed marketing tag strategies using Google Analytics and Google Tag Manager, ensuring 100% data accuracy and alignment with client objectives.
- •Spearheaded the development of data warehousing solutions to understand key trends, enabling data-driven decision-making and actionable insights.
- •Regularly analyzed website metrics and delivered comprehensive analytics reports that shaped and enhanced client strategies, aligning with organizational goals.
Data Analyst
- •Oversaw daily operations of the ACS Web Stats System, supporting marketing and sales analytics. Created monthly and annual reports to deliver strategic insights for editorial and marketing teams.
- •Analyzed ad performance across Google Search, YouTube, Google Analytics, Business Object, and external platforms to optimize campaign effectiveness.
- •Implemented metrics dashboards with Tableau and Google Data Studio for real-time web traffic monitoring.
- •Automated monthly and quarterly analytics reporting, reducing manual effort by 50%.
- •Migrated analytics infrastructure from legacy Google Analytics to Universal Analytics, ensuring seamless tracking and improved reporting.
Education & Certifications
B.S.B.A. in Business Administration
Western New England University
Featured Projects
Explore my data engineering projects and technical solutions

BigQuery ETL Pipelines for Digital Turbine
Designed and optimized BigQuery ETL pipelines for scalable, high-performance data processing, supporting over 10M daily users and enabling analytics for 3B+ monthly ad impressions.

Streaming and Batch Experiments
Designing real-time streaming and batch workflows for soccer match data using Kafka, Flink, Spark, and Airflow to enable high-velocity data ingestion, transformation, and analytics at scale.
Data Engineering Consultancy
Turn your complex data challenges into strategic business advantages
Data Solutions
With nearly 15 years of data engineering experience, I help businesses build scalable, efficient data pipelines and infrastructure that deliver real value and solve complex data challenges.
Core Technologies
Ready to transform your data infrastructure?
Let's discuss how my expertise can help your organization build scalable, efficient solutions.
Get in TouchData Pipeline Architecture
Custom-designed batch and streaming data pipelines that scale with your business needs and optimize for cost efficiency.
Data Infrastructure Optimization
Refactoring and optimizing existing data workflows to reduce costs, improve reliability, and enhance performance.
Analytics & Visualization
Implementation of comprehensive analytics solutions and interactive dashboards to unlock actionable insights.
The Inner Join
Build Scalable Data Pipelines.
Less NULLs. More value.
Streaming ETL is no longer a niche—it's the foundation of real-time, event-driven systems. In this post, I break down when to use streaming pipelines, how Kafka and Flink fit together, and walk through a real-world example.
More Stories
What is ETL in 2025? Moving Beyond Extract, Transform, Load
ETL has evolved—fast. Here's a clear, thoughtful guide on modern ETL vs. ELT, highlighting real-world use cases, tooling insights, and best practices for data engineers.
From Pipelines to Purpose: Why I’m Sharing My Journey in Data Engineering
A senior data engineer's story of building real-time and batch pipelines—and why I'm sharing my journey to land my dream role.
Let's Connect
Interested in working together? I'm always open to discussing new projects, creative ideas or opportunities to be part of your vision.
Get In Touch
Feel free to reach out for collaborations or just a friendly hello
Currently
Location:Washington, DC Metro Area
Availability:Open to Work
Looking for:Full-time opportunities