For the best experience on desktop, install the
Chrome extension
to track your reading on news.ycombinator.com
×
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
history
|
from
register
Reinforcement Learning (I.e. Policy Gradient Algorithms)
(
rlhfbook.com
)
2 points
by
vinhnx
18 days ago
|
past
Reinforcement Learning from Human Feedback
(
rlhfbook.com
)
133 points
by
onurkanbkrc
56 days ago
|
past
|
5 comments
RLHF Book
(
rlhfbook.com
)
479 points
by
jxmorris12
on Feb 1, 2025
|
past
|
37 comments
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
×
HN For You
Display Mode
Highlight
Top
Only
Debug mode
Sign Out
API Key:
Connect
Create an account
to get your API key.