Hi, I am Ramachandra Nalam!

Data Engineer |

Building large-scale data pipelines, real-time analytics & data warehousing solutions

Ramachandra Nalam - Data Engineer
Name: Ramachandra Nalam
Profile: Data Engineer — Real-time Analytics & Data Pipelines
Current Location: Seattle, WA 98104

Technical Skills

Python & R
Apache Spark & PySpark
Apache Kafka & Streaming
AWS (Glue, S3, Redshift)
Snowflake & Data Warehousing
SQL & Database Management
Tableau & Power BI
Apache Airflow & Orchestration
dbt & Data Modeling
Machine Learning & TensorFlow

Technologies & Tools

Python R Scala SQL JavaScript TypeScript Apache Spark Apache Hadoop Apache Kafka Apache Airflow Apache NiFi AWS Glue Azure Data Factory Google BigQuery Snowflake Redshift Tableau Power BI Looker dbt Great Expectations TensorFlow Git Jenkins Docker Databricks CI/CD Azure DevOps Jira NLP OpenAI

Professional Summary

Result-oriented professional with 7+ years in building large-scale data pipelines, real-time analytics, and data warehousing across technology, retail, and financial domains.

Expertise in Apache Spark, Kafka, Airflow, and Snowflake, designing pipelines, and reducing data lag by 35%. Proficient in data modeling, automation, dashboard development, and achieving cost savings and workload reduction.

Proven success in partnering with cross-functional teams, delivering high-impact data solutions, and driving measurable business outcomes. Currently working at Meta as a Data Engineer, handling 50B+ daily events and creating multi-dimensional data models for user engagement tracking.

Core Competencies: Real-time Analytics, Data Pipeline Architecture, ETL/ELT Development, Data Warehouse Design, Stream Processing, Data Quality Framework, Business Intelligence, Machine Learning Integration

Professional Experience

Sep 2024 - Present

Data Engineer

Meta

  • Created a real-time messaging analytics pipeline handling 50B+ daily events using Apache Kafka and Spark, applying dbt transformations, simplified data modeling by 60% and supported self-serve metrics
  • Developed multi-dimensional data models for user engagement tracking, creating reusable dbt semantic layers powering 15+ executive dashboards in Looker and reducing report generation time
  • Implemented data quality framework with Great Expectations and Python validators, maintaining 99.9% data accuracy and preventing 12+ business-critical incidents through anomaly detection automation
  • Coordinated cross-functional analytics projects with Product, Growth, and Safety teams, building Airflow ETL pipelines processing 2TB+ daily, contributing to an 8% improvement in message delivery success rate
  • Designed Tableau dashboards for marketing and sales teams, tracking key performance indicators, achieving a 40% increase in data accessibility for business stakeholders
  • Designed data warehouse solutions on Snowflake for Messenger analytics, applying query optimizations and partitioning strategies, lowering costs by 40% while supporting 500+ monthly active users
Jul 2022 - Aug 2024

Data Engineer

Amazon

  • Led ETL pipeline development and optimization in AWS using AWS Glue, processing 750GB daily from S3 to Aurora DB with 98% data accuracy
  • Managed Apache Kafka streaming pipelines handling 500,000 events per second, shortening data lag by 35% and enabling real-time analytics for business reporting
  • Developed PySpark and Spark SQL applications, transforming 1TB daily from CSV, Parquet, and JSON files, increasing data aggregation precision
  • Presented weekly data visualizations and actionable insights to stakeholders, integrating SQL, Python, and BI tools, automating reports, improving strategic planning timeliness, increasing campaign reach by 20%
  • Optimized Redshift queries for reporting pipelines, reducing CPU utilization by 52% and saving 9 hours weekly in monitoring, troubleshooting, and on-call tasks for analysts
Jan 2021 - Jun 2022

Data Engineer

University at Buffalo (UB)

  • Consolidated 20,000+ student records, identifying graduation timelines and patterns, improving predictive accuracy for academic interventions by 28%
  • Orchestrated different machine learning models, including logistic regression, decision tree, and random forest, enhancing risk prediction for 3,500 at-risk students
  • Achieved 88.74% accuracy with Random Forest, contributing to a strategic initiative, increasing 4-year graduation rates by 23% for targeted student groups
  • Deployed Snowflake and Redshift data warehouses, partitioning tables and optimizing queries, leading to a 40% boost in resource consumption and supporting internal users
  • Managed cloud storage on AWS S3 and Azure Data Lake for 50TB+ data, enabling standardized access, governance, and data reliability across departments
Aug 2018 - Dec 2020

Data Engineer

Nike India

  • Orchestrated ETL pipelines for 5 queues, ingesting over 6 million records daily from different sources, achieving 98% data availability and consistency for business intelligence teams
  • Automated PySpark jobs processing 20TB+ daily datasets, transforming all records per BI requirements, minimizing data processing errors while supporting cross-functional analytics needs
  • Executed Apache NiFi workflows transferring data across shared folders, triggering automated notifications, and ensuring 95% of Hive table loads completed each business day
  • Engineered Python scripts aggregating login activities for 150,000 users, transferring processed data to Snowflake, and achieving a 75% reduction in potential security incidents for the organization
Jul 2017 - Dec 2020

Analytics Engineer

Freelancer

  • Deployed a TensorFlow-based load forecasting system for households, leading to a 10% reduction in energy costs and minimizing customer disruptions by 8%
  • Led a cross-functional team to prototype system deployment, monitored outcomes, and implemented CI/CD with GitHub Actions and AWS, achieving a 30% acceleration in deployment speed

Featured Projects

Click on any card to reveal project details

🤖

Competitive Marketing Intelligence

GenAI & NLP

Click to reveal details →

Gen-AI Marketing Intelligence

Feb 2022 – Jul 2022

• Directed GenAI chatbot with LLMs, NLP, and vector search

• Extracted intelligence from earnings calls - 30% better data access

• Automated financial page synthesis - 20% workload reduction

• Accelerated insight extraction for executive teams

GenAI LLM NLP Vector Search
📊

Real-time Messaging Analytics

Real-time Analytics

Click to reveal details →

Messaging Analytics Pipeline

Meta - Current

• Kafka + Spark pipelines for 50B+ events/day

• dbt semantic layers powering 15+ Looker dashboards

• 99.9% data quality with Great Expectations

• Real-time analytics for business decisions

Kafka Spark dbt Looker
☁️

AWS Streaming & ETL

Cloud Engineering

Click to reveal details →

AWS ETL Optimization

Amazon

• Kafka streaming handling 500k events/sec

• AWS Glue ETL processing 750GB daily

• Redshift optimization - 52% CPU reduction

• Campaign reach increased by 20%

AWS Glue Aurora Kafka Redshift
🎓

Student Success Analytics

Machine Learning

Click to reveal details →

Academic Risk Prediction

University at Buffalo

• ML models for 3,500 at-risk students

• 88.74% accuracy with Random Forest

• 23% improvement in graduation rates

• Predictive interventions system

Machine Learning Python Snowflake

Nike ETL Platform

Data Engineering

Click to reveal details →

Enterprise ETL System

Nike India

• 5 queues processing 6M+ records/day

• 98% data availability achieved

• PySpark jobs for 20TB+ daily datasets

• Apache NiFi workflow automation

PySpark Apache NiFi Hive
🔮

Load Forecasting System

Time Series & Forecasting

Click to reveal details →

Energy Load Prediction

Freelance Project

• TensorFlow-based forecasting model

• 10% energy cost reduction achieved

• 8% fewer customer disruptions

• CI/CD with GitHub Actions & AWS

TensorFlow Forecasting CI/CD

Education

🎓

Master of Science in Data Science

University at Buffalo

Jan 2021 – Aug 2022

Specialized in machine learning, big data analytics, and statistical modeling. Developed expertise in building scalable data solutions and predictive analytics systems.

🎓

Bachelor of Technology in Computer Science

K L University

Jun 2014 – Aug 2018

Strong foundation in computer science fundamentals, algorithms, data structures, and software engineering principles.

Get in Touch

Let's discuss how we can work together on your next data engineering project.