rishav

rishav1.png

I’m Rishav, an engineer at heart driven by the challenge of building machine learning systems that work reliably at scale. I’m at Mila, where my research focuses on real-time and explainable reinforcement learning. My long-term goal is to build trustworthy systems that can learn efficiently from feedback—unlike current ML models that require millions of samples for even basic tasks.

I previously co-founded Offside, where I built and scaled the product to 100k users and raised $300k from top-tier VCs and angels. Before that, I spent two enriching years at DFKI in Germany, developing real-time vision algorithms for precision farming—here’s a glimpse of that work: Spot Spraying for Precision Agriculture. I graduated from BITS Pilani in 2020 with a degree in Computer Science.

Outside of work, I enjoy reading about ancient civilizations, listening to classic rock, trekking, and strength training. I also write blogs about my projects and life learnings.

News

Mar 20, 2025 I’ve started a series of posts on CUDA programming, with the end goal of accelerating DQN using CUDA. The very first blog post is now live: link.
Jan 22, 2025 Handling delays in RL accepted at ICLR 2025.
Nov 15, 2024 KD-LoRA accepted at NeurIPS ENLSP Workshop.
Jun 12, 2024 Handling delays in real-time RL accepted at ICML Workshop and RLC Workshop.
Jan 1, 2024 back to research in AI, exploring reinforcement learning.