Data Engineer — 3+ years

Building scalable data platforms for analytics + cloud.

I build reliable ETL/ELT pipelines, optimize Snowflake performance, and design clean data models for reporting and decision-making. I also support cloud data architecture across AWS and Azure with observability, automation, and CI/CD-ready workflows.

Snowflake Warehousing SQL + Python Engineering Spark Processing dbt + Airflow Modern Stack Cloud AWS • Azure
GitHub
Profile photo
Sai Jahnavi Kolli
Data Engineer • Cloud & Analytics
Snowflake • SQL • Python • Spark • AWS • Azure
Data Pipeline Ingest → Transform → Validate → Curate → Serve
running

Skills

Core Data Engineering

Snowflake Advanced SQL Python Apache Spark ETL/ELT Pipelines Dimensional Modeling Data Warehousing

Modern Data Stack

dbt Apache Airflow Data Quality Testing Data Validation Data Observability CI/CD for Data

Cloud & Platforms

AWS (S3, EC2) Azure Azure Data Factory GCP (Basics) BigQuery

Databases

PostgreSQL MySQL

Analytics & BI

Power BI Tableau KPI Dashboards

Delivery

Agile / Scrum Stakeholder Collaboration

Projects

Automotive Loan Risk Analytics Platform Snowflake ELT pipelines + curated datasets for portfolio performance and risk reporting.
Built an analytics-ready warehouse layer in Snowflake with clean dimensional modeling, standardized transformations, and automated validation to support risk KPIs and BI dashboards.
SnowflakeSQLPythonDimensional Modelingdbt
Manufacturing Operations Performance Analytics Integrated production + quality + supply chain data to power KPI dashboards and ops decisions.
Unified multiple sources into a consistent analytics layer, improved data reliability with profiling/validation, and enabled KPI-based operational reporting for leadership.
SQLPythonData ModelingAnalytics
Pipeline Reliability & Performance Optimization Tuning, validation frameworks, and monitoring to strengthen data trust.
Reduced processing time through query/Spark optimization, standardized reconciliation and error handling, and strengthened observability for production stability.
SparkAirflowData QualitySnowflake

Experience

GM Financial — Data Engineer

2025 – Present
  • Designed and maintained scalable ETL/ELT pipelines using SQL, Python, and Spark to process high-volume financial datasets in Snowflake data warehouse environments.
  • Developed cloud-based data architectures supporting analytics workloads across AWS and Azure services, ensuring secure and efficient data movement.
  • Built dimensional data models and optimized Snowflake storage and compute performance for reporting and business intelligence applications.
  • Implemented automated data quality validation, monitoring, and error-handling mechanisms to improve reliability and observability of production pipelines.
  • Collaborated with DevOps and cloud teams to support CI/CD deployments, environment configuration, and infrastructure optimization in Agile environments.

Reliance Industries — Data Engineer

2022 – 2024
  • Engineered end-to-end data pipelines integrating operational and supply chain datasets using Python, advanced SQL, and cloud-based processing frameworks.
  • Designed and maintained scalable data warehouse solutions enabling KPI dashboards and enterprise analytics reporting.
  • Optimized distributed data processing workflows to improve performance, scalability, and resource utilization.
  • Implemented data validation, transformation, and governance standards to ensure data consistency across cloud environments.
  • Worked closely with cross-functional teams to translate business requirements into secure, scalable cloud data solutions.

Contact

Send me a message

available
Prefer direct email? jahnavikolli09@gmail.com