Rishav
  • Blog
  • Projects
  • Publications

Rishav’s Website

Graduate Researcher at Mila
Building Reliable Machine Learning Systems

GitHub Email LinkedIn Twitter Google Scholar

I’m a graduate researcher at Mila, broadly interested in building reliable machine learning systems that work at scale. At Mila, my research has focused on topics in RL, mainly offline (explainability and adaptive regularization), real-time RL, and benchmarking SSL methods for Atari games.

My larger goal is to develop safe, interpretable, real-time, and sample-efficient agents that scale, motivating my interests in areas like mechanistic interpretability, world models, and real-time distributed systems.

Before Mila, I co-founded Offside (scaled to 100k users), spent ~2 years at DFKI in Germany developing real-time vision algorithms for precision farming, and earned a BEng (Thesis) in Computer Science from BITS Pilani in 2020.

Latest Posts

The Identity Crisis: How DeepSeek Fixed the Flaw in Hyper-Connections

mHC

Jan 3, 2026

Real-Time Reinforcement Learning

Article on real-time reinforcement learning

Jun 20, 2025

GSPO vs GRPO - Theory, Practice, and the Limits of Approximation

GSPO vs GRPO

Nov 23, 2025
No matching items

All Posts »

Beyond Research

Outside of research, I enjoy reading about ancient civilizations, listening to classic rock, trekking, and strength training. I also write blogs reflecting on projects and life learnings.

 
Rishav ©