rishav's Website

I’m Rishav, an engineer at heart driven by the challenge of building machine learning systems that work reliably at scale. I’m at Mila, where my research focuses on real-time and explainable reinforcement learning. My long-term goal is to build trustworthy systems that can learn efficiently from feedback—unlike current ML models that require millions of samples for even basic tasks.

I previously co-founded Offside, where I built and scaled the product to 100k users and raised $300k from top-tier VCs and angels. Before that, I spent two enriching years at DFKI in Germany, developing real-time vision algorithms for precision farming—here’s a glimpse of that work: Spot Spraying for Precision Agriculture. I graduated from BITS Pilani in 2020 with a degree in Computer Science.

Outside of work, I enjoy reading about ancient civilizations, listening to classic rock, trekking, and strength training. I also write blogs about my projects and life learnings.

News

Mar 20, 2025	I’ve started a series of posts on CUDA programming, with the end goal of accelerating DQN using CUDA. The very first blog post is now live: link.
Jan 22, 2025	Handling delays in RL accepted at ICLR 2025.
Nov 15, 2024	KD-LoRA accepted at NeurIPS ENLSP Workshop.
Jun 12, 2024	Handling delays in real-time RL accepted at ICML Workshop and RLC Workshop.
Jan 1, 2024	back to research in AI, exploring reinforcement learning.