Bavish Kulur

I am a graduate student at the University of Alberta, part of the RLAI lab, advised by Prof. Marlos Machado. I’m currently working on exploration in continual reinforcement learning.

Previously, I was a research fellow (pre-doc) at Robert Bosch Centre for DS and AI (RBCDSAI) where I was advised by Prof. Balaraman Ravindran. Before that, I worked as a software engineer at Sprinklr. I completed my undergraduate studies at the Indian Institute of Technology Bombay in 2022.

Publications

PREFINe: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment
Richa Verma, Bavish Kulur, Sanjay Chawla, Balaraman Ravindran
International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2026
Paper