bhuv's notebook
About
Blogs
Publications
twitter
github
linkedin
#ML
Some technical intuition on RLHF and Direct Preference Optimisation
May 4, 2024
Some technical intuition on RLHF and Direct Preference Optimisation