Vamshi Madala

vamchowdary72[at]gmail

Education

2021-2026
UCSB

PhD
- Understanding and designing DNN architectures from approximation theoretic perspective, with focus on efficincy and provable generalization.
- Advisor: Prof. Shivkumar Chandrasekaran
2019-2021
UCSB

Masters
- Thesis: A study of generalization in deep neural networks
- GPA: 4.0/4.0
2012-2016
IIT Roorkee

Bachelors in ECE
- Thesis: Low-cost display devices using nanoparticles
- Advisors: Prof. Brijesh Kumar and Prof. Sanjeev Manhas
- GPA: 8.09/10.0

Experience

2026-Present
Cartesia AI

Researcher
Summer 2025
Amazon AGI

Applied Scientist Intern
- Trained a novel Mixture of Experts (MoE) architecture to reduce inter-node communication costs.
Jan 2024-Sep 2024
Stealth startup

ML Researcher
- Part of the founding team, I led groundwork and architectural setup for ML based solutions, developing Vision, NLP, and speech-based models for tasks in the supply chain industry.
Summer 2022
Apple

Software Intern
- Developed physics based algorithms to improve the Fall Detection feature. Created data processing pipelines to efficiently handle hundreds of hours of time series data.
2020 - 2021
UCSF

Grad Researcher
- Developed Medviz - an AWS web portal and visualization tool for deploying machine learning models for processing of large PET/MRI datasets.
Summer 2020
Briteseed

ML Intern
- Trained CNNs on hyperspectral image data from surgical tools to detect tissues.
2016 - 2019
Samsung Research

Engineer
- Music Information Retrieval (MIR).
Summer 2015
VIOS Medical

Embedded Intern
- Characterized different wireless modules for energy consumption and connectivity.
- Developed software packaging and Linux distribution tools.

I completed my Ph.D. at the University of California, Santa Barbara (UCSB), where I was advised by Shiv Chandrasekaran. I am now a researcher at Cartesia AI.

My dissertation focused on approximation-theoretic approaches to designing neural network architectures that are efficient and have robust generalization. My recent work includes efficient neural network-based solvers for PDEs.