Avatar

Sumith Reddi Baddam

Applied Scientist

Amazon

Biography

I am an Applied Scientist at Amazon Ads, where I develop and optimize machine learning models to improve advertising performance and customer experience. My work involves building scalable algorithms for ad relevance, ranking, and targeting, conducting rigorous A/B tests to evaluate new features, and performing statistical analyses to identify patterns and improve system accuracy. Prior to my current role, I worked at Amazon Alexa, focusing on enhancing speech recognition for the AI Shopping Agent through conversational modeling and at Amazon Web Services (AWS), where I developed end-to-end Machine Learning pipelines for Infrastructure as a Code service (CloudFormation) to estimate resource provisioning times. I hold two master’s degrees in Data Science from Indiana University Bloomington and in Information Technology from IIIT Bangalore, and previously spent 2.5 years as a Data Scientist at Cisco Systems India, building large-scale deep learning models to improve product quality, engineer productivity, and customer experience. During my time at Cisco, I published research paper on a customer-centric bug prioritization system with predictive modeling using deep learning. I have also delivered keynote talks at the International Conference of Business Analytics and Intelligence at IIM Bangalore (2017) and at IISc Bangalore (2018).

Interests

  • Deep Learning
  • Large Language Models
  • Artificial Intelligence
  • Agentic AI
  • Machine Learning
  • Natural Language Processing
  • Predictive modeling
  • Statistics

Education

  • Master of Science in Data Science, 2020

    Indiana University Bloomington

  • Master of Technology in Information Technology, 2017

    International Institute of Information Technology Bangalore

  • Bachelor of Technology in Information Technology, 2017

    International Institute of Information Technology Bangalore

Skills

Programming Languages

Python, R, Java, C++, C, MATLAB

Machine Learning

TensorFlow, Keras, OpenCV, AWS SageMaker, Scikit-learn, Tableau

Security

Software Security, WebApp Security

Web development

Django, Flask, React, AngularJS, Javascript, HTML

Database

AWS, SQL, MongoDB, Google Cloud, JDBC, NoSQL, ZoDB

Cloud Platforms

Amazon Web Services (AWS), Google Cloud Platform (GCP)

Experience

 
 
 
 
 

Applied Scientist

Amazon

Oct 2023 – Present Virginia, United States
I am an Applied Scientist at Amazon Ads, where I develop and optimize machine learning models to improve advertising performance and customer experience. My work involves building scalable algorithms for ad relevance, ranking, and targeting, conducting rigorous A/B tests to evaluate new features, and performing statistical analyses to identify patterns and improve system accuracy.
 
 
 
 
 

Data Scientist

Amazon Alexa

Apr 2021 – Oct 2023 Seattle, United States
I worked at Amazon Alexa focusing on enhancing speech recognition for the AI Shopping Agent through conversational modeling.
 
 
 
 
 

Software Development Engineer (ML)

Amazon Web Services (AWS)

Jun 2020 – Apr 2021 Seattle, United States
Built machine learning applications and pipelines for AWS CloudFormation service (Infrastructure as a code). I built end-to-end machine learning applications to estimate the resource provision time for deploying the cloud infrastructure on AWS. The pipeline consists of weekly jobs for data extraction from S3 buckets, pre-processing using lambda functions and the prediction models built and hosted on Amazon SageMaker. I also productionized this application into the AWS CloudFormation service’s workflow.
 
 
 
 
 

Data Scientist

Cisco Systems

Jan 2017 – Aug 2019 Bengaluru, India
Built machine learning models to improve the quality of Cisco products and its internal workflow:

  • Recommendation engine for identifying peer reviewers for testing on Cisco’s code review platform using NLP.
  • Keywords extraction and document classification of service request cases using unsupervised LDA modeling.
  • Classification of Cisco products into various categories to help the sales teams improve their revenue generation.
  • Identification of files that get impacted when set of files are committed to repository using Association Mining.
  • Clustering the features of products based on the text data and summary fields with NLP and K-means clustering.
  • Software upgrade recommendations to customers using random forest and data mining.
 
 
 
 
 

Data Semantics Intern

DataWeave Software Pvt. Ltd.

May 2016 – Jul 2016 Bengaluru, India
I implemented an algorithm that performs clustering of the products from various e-commerce websites and provides pricing insights to our customers. This product was built to scale to 10 Million concurrent users using distributed scheduling of jobs. I also built an automation engine that classifies the products into various categories using SVM, random forest and neural networks. I managed to improve the accuracy of the classification models from 81% to 90%.
 
 
 
 
 

Big Data Analytics Intern

Zettamine Labs (Apple Inc. client)

May 2015 – Jul 2015 Hyderabad, India
I built an end-to-end product that performs web-scrapping of data, analyse the customer reviews and provides insights to the manufacturers (Apple Inc.). These insights range from “what issues are the customers facing in iPad?” to “What extent is battery drain issue affecting the customers?” My research publication on building this product using Natural Language Processing was selected at the MongoDB Conference - New York, 2015.

Patent & Publications

NeuralCook – Image2Ingredients and cooking recommendation using Deep Learning

Deep learning application to identify ingredients from cooking dishes images and recommend dishes to cook, given a set of ingredients. This application leverages NLP and Computer Vision to learn semantic knowledge using joint embeddings.

Intelligent defect creation system using Siamese CNN LSTM techniques

Implemented a duplicate bug detector that identifies whether a newly created bug is a duplicate of an existing bug in the Cisco Defect Tracking System and then retrieves all similar bugs from the database with an accuracy close to 90%.

Customer Success Using Deep Learning

Explored using Deep Learning models to predict and enhance customer success metrics in business environments. Implemented models that analyze customer data to identify success patterns and recommend improvements, with applications for corporate decision-making.

Prediction of issues customers face in a software using unsupervised learning

Implemented Deep Neural Network model in TensorFlow which predicts the issues customers might face in a Cisco product post its release, helping developer teams fix them prior with an accuracy of 95% on Cisco’s Next-Gen devices.

Projects

NeuralCook – Image2Ingredients and cooking recommendation using Deep Learning

Deep learning application to identify ingredients from cooking dishes images and recommend dishes to cook, given a set of ingredients. This application leverages NLP and Computer Vision to learn semantic knowledge using joint embeddings.

Human Computer Interaction

Virtual agent that acts as a receptionist. A 3 layered architecture that has dialogue management, video analysis, speech to text and text to speech models. Built deep learning model for dialogue management. Video analysis involved fate detection and recognition. Speech synthesis was using API.

Object Recognition using Deep Neural Networks

Visual categorization of objects using Convolution Neural Networks in Python.

Automated Essay Grading System

The students in an interview were asked to write essays on specific topics and the task is to grade those essays. I have built a POS Tagger module using SCRDR algorithm and used this as a feature along with other word features. The model was trained using neural networks.

Visual Categorization with Bags of Key-points

Classification of objects in an image using SIFT descriptor and Support Vector Machine classifier. Implemented the paper by Xerox Research institute.

Data Analytics on Karnataka State Government Education data

Association rule mining, classification, clustering and statistical analysis on the Karnataka state secondary high school dataset to find the insights and suggest the government to function better.

Carpooling Web Application in NodeJS with Object Oriented Programming paradigm

Built a web application using crowd sourcing where people traveling for same destination can share a ride saving money, fuel and pollution. Django web framework was used for building the application.

Smart Canteen System

The project was implemented for our hostel canteen system. It aims at reducing the waiting time of students and professors in the queue by analyzing the queue length using image processing and estimates the time to be taken by using the previous historical data.

Cloud data analytics for efficient usage of water for a smart city initiative

Storage of data regarding the environmental conditions of the plants along the road dividers using Broadcom WICED sensor and doing an analysis on the data to predict the amount of water usage required for the plants and the time at which they need to be watered.

Object Graph Database

Building an object graph for cricket using Object Oriented Database (OODB) model using JDBC and MySQL storage and Spring web framework. The application was hosted on IBM Bluemix.

App Store using Object Oriented Programming in Java

Programmed database for online app store using Java, JDBC and MySql. A model similar to google play-store.

Object Oriented Programming in C++

Implemented a full-fledged Game of Scrabble in Java making use of Object Oriented Programming.

Contact