Padmanaba Srinivasan
CV
Blog
Publications
Tag Index
machine learning (6)
reinforcement learning (6)
tech (6)
machine learning (6)
Intuition Explained: Behavioral Supervisor Tuning
March 5, 2025
Thoughts on DPO and Offline RL
June 22, 2024
Some Interesting Offline RL Methods (Early 2024)
February 22, 2024
An Introduction to Preference-Based RL
February 22, 2024
An Overview of Model-Based Offline RL Methods
February 22, 2024
An Intro to Offline Reinforcement Learning
December 15, 2023
reinforcement learning (6)
Intuition Explained: Behavioral Supervisor Tuning
March 5, 2025
Thoughts on DPO and Offline RL
June 22, 2024
Some Interesting Offline RL Methods (Early 2024)
February 22, 2024
An Introduction to Preference-Based RL
February 22, 2024
An Overview of Model-Based Offline RL Methods
February 22, 2024
An Intro to Offline Reinforcement Learning
December 15, 2023
tech (6)
Intuition Explained: Behavioral Supervisor Tuning
March 5, 2025
Thoughts on DPO and Offline RL
June 22, 2024
Some Interesting Offline RL Methods (Early 2024)
February 22, 2024
An Introduction to Preference-Based RL
February 22, 2024
An Overview of Model-Based Offline RL Methods
February 22, 2024
An Intro to Offline Reinforcement Learning
December 15, 2023