rishav

I’m Rishav, an engineer at heart driven by the challenge of building machine learning systems that work reliably at scale. I’m a graduate student at Mila, where my research focuses on real-time and explainable reinforcement learning. My long-term goal is to build trustworthy systems that can learn efficiently from feedback—unlike current ML models that require millions of samples for even basic tasks. I’m also a co-organizer of a workshop on explainability in deep learning at CRV.
I previously co-founded Offside, where I built and scaled the product to 100k users and raised $300k from top-tier VCs and angels. Before that, I spent two enriching years at DFKI in Germany, developing real-time vision algorithms for precision farming—here’s a glimpse of that work: Spot Spraying for Precision Agriculture. I graduated from BITS Pilani in 2020 with a degree in Computer Science.
Outside of work, I enjoy reading about ancient civilizations, listening to classic rock, trekking, and strength training. I also write blogs about my projects and life learnings.
News
Mar 20, 2025 | I’ve started a series of posts on CUDA programming, with the end goal of accelerating DQN using CUDA. The very first blog post is now live: link. |
---|---|
Jan 22, 2025 | Handling delays in RL accepted at ICLR 2025. |
Nov 15, 2024 | KD-LoRA accepted at NeurIPS ENLSP Workshop. |
Jun 12, 2024 | Handling delays in real-time RL accepted at ICML Workshop and RLC Workshop. |
Jan 1, 2024 | back to research in AI, exploring reinforcement learning. |