I came into machine learning from full-stack engineering, which shapes how I build AI systems. I treat models as production dependencies: versioned, evaluated, monitored, and improved with evidence instead of vibes. I read the eval results before celebrating the demo, and I write documentation because future me is usually the next engineer on call.
What I enjoy most is building the layer between an impressive prototype and a system people can actually use: clean APIs, reliable pipelines, retrieval quality, observability, and thoughtful user experience. I learn fastest by shipping projects slightly beyond my comfort zone, debugging the hard parts, and turning those lessons into better systems.
CurrentlyLLM serving & evaluation infrastructure at Qualcomm
Learninginference optimization — quantization, speculative decoding
Off the clocksoccer, fifa, day hikes, and over-engineering my home server