For the best experience on desktop, install the
Chrome extension
to track your reading on news.ycombinator.com
×
Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
history
|
from
register
Alignment is not free: How model upgrades can silence your confidence signals
(
variance.co
)
121 points
by
karinemellata
on May 6, 2025
|
past
|
67 comments
We used sparse autoencoders to explain LLM moderation flags of violent threats
(
variance.co
)
6 points
by
karinemellata
on April 21, 2025
|
past
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
×
HN For You
Display Mode
Highlight
Top
Only
Debug mode
Sign Out
API Key:
Connect
Create an account
to get your API key.