For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | fromregister
Why Is Anything Conscious? (arxiv.org)
3 points by dboreham 3 hours ago | past | discuss
Real Money, Fake Models: Deceptive Model Claims in Shadow APIs (arxiv.org)
2 points by cxplay 5 hours ago | past | discuss
Towards a Science of Scaling Agent Systems (arxiv.org)
1 point by Anon84 5 hours ago | past | discuss
Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks (arxiv.org)
1 point by lbeurerkellner 8 hours ago | past | 1 comment
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI (arxiv.org)
102 points by mpweiher 12 hours ago | past | 36 comments
Multimodal Coding Agents as In-Context Policy Learners for Robot Manipulation (arxiv.org)
1 point by vaishak2future 17 hours ago | past | 1 comment
Technological Folie à Deux (arxiv.org)
3 points by rglover 21 hours ago | past | discuss
When ChatGPT is gone: Creativity reverts and homogeneity persists (2024) (arxiv.org)
3 points by doener 21 hours ago | past | 2 comments
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI (arxiv.org)
2 points by stepri 23 hours ago | past | discuss
Let It Flow: Agentic Crafting on Rock and Roll (arxiv.org)
3 points by killerdhmo 1 day ago | past | discuss
Agents of Chaos (arxiv.org)
25 points by pagade 1 day ago | past | 6 comments
Specialization After Generalization: Towards Understanding Test-Time Training (arxiv.org)
1 point by teleforce 1 day ago | past | discuss
MLP Memory: A Retriever-Pretrained Memory for Large Language Models (arxiv.org)
1 point by teleforce 1 day ago | past | discuss
Semi-formal reasoning helps agents reason about code without executing the code (arxiv.org)
1 point by dnw 1 day ago | past | discuss
Why Language Models Hallucinate (2025) (arxiv.org)
2 points by doener 1 day ago | past | discuss
Nested Training for Mutual Adaptation in Human-AI Teaming (arxiv.org)
2 points by PaulHoule 2 days ago | past | discuss
Cybersecurity Data Extraction from Common Crawl (arxiv.org)
5 points by PaulHoule 2 days ago | past | discuss
Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using LLMs (arxiv.org)
3 points by matt_d 2 days ago | past | discuss
Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond) (arxiv.org)
2 points by mpweiher 2 days ago | past | discuss
Next Embedding Prediction Makes World Models Stronger (arxiv.org)
1 point by lucrbvi 2 days ago | past | discuss
Unlocking Python's Cores:Energy Implications of Removing the GIL (arxiv.org)
2 points by runningmike 2 days ago | past | 1 comment
V1: Unifying Generation and Self-Verification for Parallel Reasoners (ArXiv) (arxiv.org)
2 points by harman2607 2 days ago | past | 1 comment
Beyond Language Modeling: An Exploration of Multimodal Pretraining (arxiv.org)
1 point by gmays 2 days ago | past | discuss
Statistical Uncertainties in the Non-Gravitational Acceleration of 3I/Atlas (arxiv.org)
1 point by Jimmc414 2 days ago | past | discuss
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory (arxiv.org)
2 points by simonpure 2 days ago | past | discuss
Generative Linguistics, LLMs, and the Social Nature of Scientific Success (arxiv.org)
3 points by 3willows 2 days ago | past | discuss
Dyson spheres on H-R diagram (arxiv.org)
4 points by zahrevsky 3 days ago | past | discuss
Asymmetric Goal Drift in Coding Agents Under Value Conflict (arxiv.org)
1 point by lrakster 3 days ago | past | discuss
Agentic Code Reasoning (arxiv.org)
3 points by gmays 3 days ago | past | discuss
General Agentic Memory via Deep Research (arxiv.org)
2 points by gmays 3 days ago | past | discuss

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You