For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | Metacelsus's commentsregister

The name "mythos" seems a bit too eldritch for my liking. Brings to mind Cthulhu.

>Legit password resets for example come from more random top level domains with "microsoft" in it, like microsoftonline.com

Or aka.ms


Yeah, missing the booster sep was a real bummer

Interestingly, in "The Island" Dr. Merrick pitched investors on growing brainless clones, but actually kept the brains in, because it worked better (and gave him a labor supply).

Anecephaly is a thing. Though those babies don't survive much past birth.

Most do not, which suggests to survive to adulthood more of a brain will be needed.

Some anacephalic babies survive for months, even years and function mentally: https://www.dailymail.co.uk/news/article-2226647/Nickolas-Co...

I am very sceptical that you could create a clone with enough of a brain to survive and guarantee the clone will have no awareness.


And if the price reflected the externalities of factory farming, eggs would be even more expensive!


IN MICE


Yayyyyyyyyyyyy


As a stem cell biologist: my guess is that it doesn't help much


I'm glad to see Dario and Anthropic showing some spine! A lot of other people would have caved.


According to benchmarks in the announcement, healthily ahead of Claude 4.6. I guess they didn't test ChatGPT 5.3 though.

Google has definitely been pulling ahead in AI over the last few months. I've been using Gemini and finding it's better than the other models (especially for biology where it doesn't refuse to answer harmless questions).


Google is way ahead in visual AI and world modelling. They're lagging hard in agentic AI and autonomous behavior.


The general purpose ChatGpt 5.3 hasn’t been released yet, just 5.3-codex.


It's ahead in raw power but not in function. Like it's got the worlds fast engine but one gear! Trouble is some benchmarks only measure horse power.


> Trouble is some benchmarks only measure horse power.

IMO it's the other way around. Benchmarks only measure applied horse power on a set plane, with no friction and your elephant is a point sphere. Goog's models have always punched over what benchmarks said, in real world use @ high context. They don't focus on "agentic this" or "specialised that", but the raw models, with good guidance are workhorses. I don't know any other models where you can throw lots of docs at it and get proper context following and data extraction from wherever it's at to where you'd need it.


> especially for biology where it doesn't refuse to answer harmless questions

Usually, when you decrease false positive rates, you increase false negative rates.

Maybe this doesn't matter for models at their current capabilities, but if you believe that AGI is imminent, a bit of conservatism seems responsible.


Google models and CLI harness feels behind in agentic coding compared OpenAI and Antrophic


I gather that 4.6 strengths are in long context agentic workflows? At least over Gemini 3 pro preview, opus 4.6 seems to have a lot of advantages


It's a giant game of leapfrog, shift or stretch time out a bit and they all look equivalent


The comparison should be with GPT 5.2 pro which has been used successfully to solve open math problems.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You