Ramachandra Nalam - Data Engineer Portfolio

Professional Summary

Result-oriented professional with 7+ years in building large-scale data pipelines, real-time analytics, and data warehousing across technology, retail, and financial domains.

Expertise in Apache Spark, Kafka, Airflow, and Snowflake, designing pipelines, and reducing data lag by 35%. Proficient in data modeling, automation, dashboard development, and achieving cost savings and workload reduction.

Proven success in partnering with cross-functional teams, delivering high-impact data solutions, and driving measurable business outcomes. Currently working at Meta as a Data Engineer, handling 50B+ daily events and creating multi-dimensional data models for user engagement tracking.

Core Competencies: Real-time Analytics, Data Pipeline Architecture, ETL/ELT Development, Data Warehouse Design, Stream Processing, Data Quality Framework, Business Intelligence, Machine Learning Integration

Professional Experience

Sep 2024 - Present

Data Engineer

Amazon

Led ETL pipeline development and optimization in AWS using AWS Glue, processing 750GB daily from S3 to Aurora DB with 98% data accuracy
Managed Apache Kafka streaming pipelines handling 500,000 events per second, shortening data lag by 35% and enabling real-time analytics for business reporting
Developed PySpark and Spark SQL applications, transforming 1TB daily from CSV, Parquet, and JSON files, increasing data aggregation precision
Presented weekly data visualizations and actionable insights to stakeholders, integrating SQL, Python, and BI tools, automating reports, improving strategic planning timeliness, increasing campaign reach by 20%
Optimized Redshift queries for reporting pipelines, reducing CPU utilization by 52% and saving 9 hours weekly in monitoring, troubleshooting, and on-call tasks for analysts

Jan 2021 - Jun 2022

Data Engineer

University at Buffalo (UB)

Consolidated 20,000+ student records, identifying graduation timelines and patterns, improving predictive accuracy for academic interventions by 28%
Orchestrated different machine learning models, including logistic regression, decision tree, and random forest, enhancing risk prediction for 3,500 at-risk students
Achieved 88.74% accuracy with Random Forest, contributing to a strategic initiative, increasing 4-year graduation rates by 23% for targeted student groups
Deployed Snowflake and Redshift data warehouses, partitioning tables and optimizing queries, leading to a 40% boost in resource consumption and supporting internal users
Managed cloud storage on AWS S3 and Azure Data Lake for 50TB+ data, enabling standardized access, governance, and data reliability across departments

Aug 2018 - Dec 2020

Data Engineer

Nike India

Orchestrated ETL pipelines for 5 queues, ingesting over 6 million records daily from different sources, achieving 98% data availability and consistency for business intelligence teams
Automated PySpark jobs processing 20TB+ daily datasets, transforming all records per BI requirements, minimizing data processing errors while supporting cross-functional analytics needs
Executed Apache NiFi workflows transferring data across shared folders, triggering automated notifications, and ensuring 95% of Hive table loads completed each business day
Engineered Python scripts aggregating login activities for 150,000 users, transferring processed data to Snowflake, and achieving a 75% reduction in potential security incidents for the organization

Jul 2017 - Dec 2020

Analytics Engineer

Freelancer

Deployed a TensorFlow-based load forecasting system for households, leading to a 10% reduction in energy costs and minimizing customer disruptions by 8%
Led a cross-functional team to prototype system deployment, monitored outcomes, and implemented CI/CD with GitHub Actions and AWS, achieving a 30% acceleration in deployment speed

Featured Projects

Click on any card to reveal project details

🤖

Competitive Marketing Intelligence

GenAI & NLP

Click to reveal details →

Gen-AI Marketing Intelligence

Feb 2022 – Jul 2022

• Directed GenAI chatbot with LLMs, NLP, and vector search

• Extracted intelligence from earnings calls - 30% better data access

• Automated financial page synthesis - 20% workload reduction

• Accelerated insight extraction for executive teams

GenAI LLM NLP Vector Search

📊

Real-time Messaging Analytics

Real-time Analytics

Click to reveal details →

Messaging Analytics Pipeline

Meta - Current

• Kafka + Spark pipelines for 50B+ events/day

• dbt semantic layers powering 15+ Looker dashboards

• 99.9% data quality with Great Expectations

• Real-time analytics for business decisions

Kafka Spark dbt Looker

☁️

AWS Streaming & ETL

Cloud Engineering

Click to reveal details →

AWS ETL Optimization

Amazon

• Kafka streaming handling 500k events/sec

• AWS Glue ETL processing 750GB daily

• Redshift optimization - 52% CPU reduction

• Campaign reach increased by 20%

AWS Glue Aurora Kafka Redshift

🎓

Student Success Analytics

Machine Learning

Click to reveal details →

Academic Risk Prediction

University at Buffalo

• ML models for 3,500 at-risk students

• 88.74% accuracy with Random Forest

• 23% improvement in graduation rates

• Predictive interventions system

Machine Learning Python Snowflake

⚡

Nike ETL Platform

Data Engineering

Click to reveal details →

Enterprise ETL System

Nike India

• 5 queues processing 6M+ records/day

• 98% data availability achieved

• PySpark jobs for 20TB+ daily datasets

• Apache NiFi workflow automation

PySpark Apache NiFi Hive

🔮

Load Forecasting System

Time Series & Forecasting

Click to reveal details →

Energy Load Prediction

Freelance Project

• TensorFlow-based forecasting model

• 10% energy cost reduction achieved

• 8% fewer customer disruptions

• CI/CD with GitHub Actions & AWS

TensorFlow Forecasting CI/CD

Hi, I am Ramachandra Nalam!

Technical Skills

Technologies & Tools

Professional Summary

Professional Experience

Data Engineer

Meta

Data Engineer

Amazon

Data Engineer

University at Buffalo (UB)

Data Engineer

Nike India

Analytics Engineer

Freelancer

Featured Projects

Competitive Marketing Intelligence

Gen-AI Marketing Intelligence

Real-time Messaging Analytics

Messaging Analytics Pipeline

AWS Streaming & ETL

AWS ETL Optimization

Student Success Analytics

Academic Risk Prediction

Nike ETL Platform

Enterprise ETL System

Load Forecasting System

Energy Load Prediction

Education

Master of Science in Data Science

University at Buffalo

Bachelor of Technology in Computer Science

K L University

Get in Touch