For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | fromregister
Reinforcement Learning (I.e. Policy Gradient Algorithms) (rlhfbook.com)
2 points by vinhnx 18 days ago | past
Reinforcement Learning from Human Feedback (rlhfbook.com)
133 points by onurkanbkrc 56 days ago | past | 5 comments
RLHF Book (rlhfbook.com)
479 points by jxmorris12 on Feb 1, 2025 | past | 37 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You