More

minimaxir · 2026-04-16T15:45:08 1776354308

Wait what? Opus 4.6 currently has a 3x multiplier and Opus 4.7 does not require more compute.

minimaxir · 2026-04-16T15:23:49 1776353029

The more efficient tokenizer reduces usage by representing text more efficiently with fewer tokens. But the lack of transparancy does indeed mean Anthropic could still scale down limits to account for that.

minimaxir · 2026-04-14T17:26:38 1776187598

Given the alleged recent extreme reduction in Claude Code usage limits (https://news.ycombinator.com/item?id=47739260), how do these more autonomous tools work within that constraint? Are they effectively only usable with a 20x Max plan?

EDIT: This comment is apparently [dead] and idk why.

giancarlostoro · 2026-04-14T18:57:06 1776193026

I've been talking to friends about this extensively, and read all sorts of different social media posts on X where people deep dove things (I'm at work so I don't have any links handy - though I did submit one on HN, grain of salt, unsure how valid it is but it was interesting: https://news.ycombinator.com/item?id=47752049 ).

I think the real issue stems from the 1 Million token context window change. They did not anticipate the amount of load it would give you. That first few days after they released the new token window, I was making amazing things in one single session from nothing, to something (a new .NET based programming language inspired by Python, and a Virtual Actor framework in Rust). I think since then they've been trying too many things to tweak things, whilst irritating their users.

They even added a new "Max" thinking mode, and made "High" the old medium, which is ridiculous because you think you're using "High" but really you're not. There's a hidden config file to change their terrible defaults to let Claude be smarter still, and apparently you can toggle off the 1M tokens.

I think the real fix, and I'm surprised nobody there has done this yet, is to let the user trim down their context window.

Think about it, you used to have what? 350k tokens or so? Now Claude will keep sending your prompt from 30 minutes ago that's completely irrelevant to the back-end, whereas 3 months ago it would have been compacted by now.

Others have noted that similar prompting for some ungodly reason adds tens of thousands of extra garbage tokens (not sure why).

Edit looks like someone figured out that if you downgrade your version of Claude Code and change one single setting it unruins Claude:

https://news.ycombinator.com/item?id=47769879

SkyPuncher · 2026-04-14T21:03:23 1776200603

Yea, I've realized that if I stay under 200k tokens I basically don't have usage issues any more.

A bit annoying, but not the end of the world.

consumer451 · 2026-04-14T23:03:00 1776207780

super-edit: Sorry, this is not a usage related question, I have move it to: https://news.ycombinator.com/item?id=47772971

Here is the question for which I cannot find an answer, and cannot yet afford to answer myself:

In Claude Code, I use Opus 4.6 1M, but stay under 250k via careful session management to avoid known NoLiMa [0] / context rot [1] crap. The question I keep wanting answered though: at ~165k tokens used, does Opus 1M actually deliver higher quality than Opus 200k?

NoLiMa would indicate that with a ~165k request, Opus 200k would suck, and Opus 1M would be better (as a lower percentage of the context window was used)... but they are the same model. However, there are practical inference deployment differences that could change the whole paradigm, right? I am so confused.

Anthropic says it's the same model [2]. But, Claude Code's own source treats them as distinct variants with separate routing [3]. Closest test I found [4] asserts they're identical below 200K but it never actually A/B tests, correct?

Inside Claude Code it's probably not testable, right? According to this issue [5], the CLI is non-deterministic for identical inputs, and agent sessions branch on tool-use. Would need a clean API-level test.

The API level test is what I really want to know for the Claude based features in my own apps. Is there a real benchmark for this?

I have reached the limits of my understanding on this problem. If what I am trying to say makes any sense, any help would be greatly appreciated.

If anyone could help me ask the question better, that would also be appreciated.

[0] https://arxiv.org/abs/2502.05167

[1] https://research.trychroma.com/context-rot

[2] https://claude.com/blog/1m-context-ga

[3] https://github.com/anthropics/claude-code/issues/35545

[4] https://www.claudecodecamp.com/p/claude-code-1m-context-wind...

[5] https://github.com/anthropics/claude-code/issues/3370

onenite · 2026-04-14T23:32:48 1776209568

2 parent comments above say that you can use older version of claude code with opus 200k to compare. my guess is that eventually you’ll be able to set it in model settings yourself

dacox · 2026-04-14T19:03:17 1776193397

Yeah, I have been seeing lots of comments, tweets, etc, but given everything I have learned about these models - i do not think the change to 1M was innocuous. I'm not sure what they've claimed publicly, but I'm fairly certain they must be doing additional quantization, or at minimum additional quantization of the KV cache. Plus, sequence length can change things even when not fully utilized. I had to manually re-enable the "clear context and continue" feature as well.

giancarlostoro · 2026-04-14T19:47:51 1776196071

I used the heck out of it when it was announced, and it felt like I was using one of the best models I've ever used, but then so were all of their other customers, I don't think they accounted for such heavy load, or maybe follow up changes goofed something up, not sure. Like I said, the 1M token, for the first few days allowed me to bust out some interesting projects in one session from nothing to "oh my" in no time.

I'm thinking they should go back to all their old settings and as a user cap you at their old token limit, and ask you if you want to compact at your "soft" limit or burst for a little longer, to finish a task.

Jimpulse · 2026-04-15T13:33:56 1776260036

How do you re-enable that feature?

dgb23 · 2026-04-15T05:19:03 1776230343

The future of harnesses cannot be „resend the whole history on every step“ or whatever this terrible compaction is.

Most of the context is unstructured fluff, much of it is distracting or even plain wrong. Especially the „thinking“ tokens are often completely disjoint halucinations that don’t make any sense.

I think what will have to happen is that context looks less like a long chat and action log and more like a structured, short, schema validated state description, plus a short log trace that only grows until a checkpoint is reached, which produces a new state.

dyauspitr · 2026-04-15T17:55:38 1776275738

You’re going to loose a lot of natural language nuances then. Plus git is essentially your structured, validated state description.

imhoguy · 2026-04-14T19:30:01 1776195001

AI race to the bottom is a debt game now. Once the party is over somebody will have to pay the bill.

timacles · 2026-04-14T21:55:27 1776203727

It’s going to be crazy with the explanation they come up with why the us public has to pay to bail out AI for national security.

In a way, it’s true if china has superior AI then it’s dominance over US will materialize. But it’s not hard to see how this scenario is being used to essential lie and scam into trillions of debt.

Its interesting how the cutthroat space of big tech has manifested into an incidious hyper capitalist system where disrupting a system is it’s primary function. The system in this case is world order and western governments

joquarky · 2026-04-14T22:39:20 1776206360

"Move fast and break things" broke containment to the tech industry. Now you can see it everywhere.

breakingcups · 2026-04-14T18:13:03 1776190383

You seem to be vouched for now, no longer dead for me.

minimaxir · 2026-04-14T18:22:28 1776190948

Hmm, I can't edit the original comment to retract that edit either. Either my account is flagged for something or HN is being weird.

0123456789ABCDE · 2026-04-15T11:35:18 1776252918

this comment was made almost 1h after the one at the root of the thread

you can make changes to your posts up to 10 minutes after they were originally created — see: https://news.ycombinator.com/newsfaq.html#:~:text=minutes%20...

minimaxir · 2026-04-15T17:21:16 1776273676

That is not what that feature does, it just is the time your comment is publically shown.

Users have 2 hours to edit comments, and the button was gone within 1 hour.

TacticalCoder · 2026-04-14T18:27:41 1776191261

Everything looks good to me: you don't look like you have a flagged account (but then I don't work for HN).

stavros · 2026-04-14T22:14:33 1776204873

It's not alleged: https://www.ghacks.net/2026/03/27/anthropic-reduces-claude-s...

minimaxir · 2026-04-14T23:45:42 1776210342

That's a separate change to what the linked HN post described.

minimaxir · 2026-04-13T18:12:46 1776103966

Gemma 4 is not supported by the MLX engine yet.

Confiks · 2026-04-14T04:36:10 1776141370

It is, as I'm running it; it has been added this week. As I said I'm running the main version from Github and doing nothing special, see: https://news.ycombinator.com/item?id=47761308

minimaxir · 2026-04-13T15:42:36 1776094956

I have been building/vibecoding a similar tool and unfortunately came to the conclusion that in practice, there are just too many features dependent on the full Chrome stack that it's just more pragmatic to use a real Chromium installation despite the file size. Performance/image generation speed is still fine, though.

In Rust, the chromiumoxide crate is a performant way to interface with it for screenshots: https://crates.io/crates/chromiumoxide

ospider · 2026-04-14T02:35:07 1776134107

> there are just too many features dependent on the full Chrome stack

Do you mind elaborating on what features are missing?

mnutt · 2026-04-13T20:24:53 1776111893

I think you could in theory have a similar webkit-based stripped down headless crate that might have a good tradeoff of features, performance, and size.

minimaxir · 2026-04-13T03:35:50 1776051350

Because people might have missed it last thread, here's dang's response to the discourse:

> I don't think I've ever seen a thread this bad on Hacker News. The number of commenters justifying violence, or saying they "don't condone violence" and then doing exactly that, is sickening and makes me want to find something else to do with my life—something as far away from this as I can get. I feel ashamed of this community.

> Edit: for anyone wondering (or hoping), no I'm not leaving. That was a momentary expression of dismay.

https://news.ycombinator.com/item?id=47728106

mcdeltat · 2026-04-13T04:02:12 1776052932

I recently saw a lecture by neuroscientist Robert Sapolsky [1] which discussed the complexities of human violence. We both condone and don't condone violence all the time, depending on social context. And furthers, our ways of expressing violence are varied (even down to tiny things like the silent treatment). We (along with other animals) have always used aggression to enforce social order and obtain social benefit.

Perhaps something to think about in a scenario like this. Personally I think it's interesting that some people are so quick to condone aggressive attacks on powerful people, yet have no comment on those powerful people committing lower levels of violence against the masses. It's all social context.

[1] https://youtu.be/GRYcSuyLiJk?si=HhnAUKelmR7igO9x

Imustaskforhelp · 2026-04-13T13:45:10 1776087910

> Perhaps something to think about in a scenario like this. Personally I think it's interesting that some people are so quick to condone aggressive attacks on powerful people, yet have no comment on those powerful people committing lower levels of violence against the masses. It's all social context.

Can I just say that out of all of this discourse happening, this might be the most insightful yet succint position to explain my stance on all of this especially the "its all social context." line.

I feel like many of us here might share an answer publicly but I have always believed that if I am in the shoes of someone else, I might act the way they do so in a sense I understand the human part of it. A human did the violence and why. I understand that. Now we can call this violence inhuman, sure, but this action is still done by human and for many reasons. And I also understand why people condemn these actions, we wish to live in a clean and structural world and then we see the messiness of the world.

I just feel like just condemning an action would do nothing unless we change the ground conditions but that isn't in the hands of even many of us Hackernews users and this is basically a class aspect to it.

I personally feel like there are some similarities to this incident to the Trolley problem actually. Vsauce did a video about it worth watching[0]

Thank you for writing this comment.

[0]: https://www.youtube.com/watch?v=1sl5KJ69qiA

jbxntuehineoh · 2026-04-13T06:13:11 1776060791

only on this site would people need a neuroscience lecture to understand elements of human nature that are apparent to most elementary schoolers

yetihehe · 2026-04-13T06:40:42 1776062442

I believe that unique community of HN consist mostly of individuals that weren't able to fully understand those elements of human nature as elementary (and sometimes high-school) schoolers. I stand as one example of such person, it took me about 30 years before I understood that I lacked such innate understanding at school.

Teever · 2026-04-13T06:50:08 1776063008

There's also the international angle here.

How is a person from a nation that the US President has threatened to annex or invade supposed to feel about seeing domestic violence in the United States? From their perspective a divided United States is less of a personal threat to them.

All this talk about how 'we can't have this in a democracy!' forgets that many of us don't live in that particular democracy, and that particular democracy is threatening other democracies.

What should my response be if a North Korean General is executed? Or if a Russian oligarch 'falls out a window'? Or a corrupt Mexican politician is beheaded by a rival cartel?

These American oligarchs aren't my countrymen, They don't have my best interests in mind, they fund the people who threaten my country, and now they provide the American military with technology that it can use to attack my country.

Their lobbying and campaign contributes have resulted in a Mad King waging an unwinnable war that has severely damaged the global economy and has made my life demonstrably worse. I have never done anything to these people and yet they callously did this to all of us for personal profit well beyond what any human being could never need in a thousand life times.

At the end of the day the less cohesive the American tribe is the better off my tribe is. I wish our incentives were aligned but they just aren't and I am not in any way responsible for that.

ItsHarper · 2026-04-13T05:22:21 1776057741

I think you meant condemn, but otherwise, well said.

mcdeltat · 2026-04-13T06:11:55 1776060715

Ah yes in the second paragraph I definitely meant condemn, thank you.

UncleMeat · 2026-04-13T12:58:12 1776085092

It is fascinating to me that this was the thing that dang thinks is the most violent in the forum's history.

Not people advocating for hundreds of thousands of unnecessary deaths from covid. Not people advocating for bombing campaigns blowing children to smithereens. Not people advocating for mass cuts to programs treating people with tuberculosis. Not people advocating for mass cuts to programs feeding the starving. Not people defending ICE in murdering people either via gunshot or medical neglect in their disgusting prisons.

In fact, a lot discussion critical of that stuff just gets [flagged].

None of that counts as violence to dang. But threaten a billionaire? Oh that's a bridge too far.

minimaxir · 2026-04-11T00:10:45 1775866245

I didn't think Hacker News needed an explicit "calls for violence are bad" guideline but the comments here have shown otherwise.

hax0ron3 · 2026-04-11T18:15:34 1775931334

It would be extremely difficult to have politics discussion without condoning violence. Deciding what sorts of violence is ok is an inherent part of politics. In practice, there's no way to ban calls for violence without banning the discussion of wide swaths of political topics.

Teever · 2026-04-11T00:36:26 1775867786

Do you feel the same way about comments that support the US military action in Iran? Why or why not?

johnisgood · 2026-04-11T00:44:26 1775868266

It is unnecessary, and it was an obvious offense, not defense. Of course it is "bad". We (Trump) need(s) to stop creating wars and fucking up the economy, while killing others. It is bad all the way down.

chipsrafferty · 2026-04-11T02:54:47 1775876087

Which one is more bad?

Trump bombing hundreds of people or someone throwing a bomb at Trump because he keeps bombing hundreds of people?

vizzier · 2026-04-11T08:34:54 1775896494

People think the trolley problem is easy.

lovich · 2026-04-11T01:24:20 1775870660

If you grind people into a paste long enough, eventually some of them may object in one manner or another.

twoodfin · 2026-04-11T01:49:00 1775872140

I’m sorry, which specific people were “ground into paste” and when?

lovich · 2026-04-11T02:07:46 1775873266

Everyone too poor to thrive.

sneak · 2026-04-11T01:36:38 1775871398

I agree with the idea that calls for violence are bad; however most people in the world are more than happy to support both violence and calls for same against people and organizations they believe to be sufficiently significant threats.

Are calls for violence against Hitler during WW2 bad? How about the Japanese imperial navy?

How about calls for violence against Putin during his war of aggression?

This isn’t rhetoric; I’m just pointing out that it isn’t as black and white as people seem to make it. (It is black and white for me, as I’m with Asimov on the matter, but it isn’t for most humans.)

deaux · 2026-04-11T02:57:25 1775876245

If you can't think of a single occurrence in history that directly disproves your proposed guideline, it's time to drop whatever you're doing and study history.

If you can think of one, then you shouldn't be proposing introduction of guidelines that are blatantly false. Or would you like a "1+1 is not 2" guideline to accompany it?

stavros · 2026-04-11T00:51:22 1775868682

Are calls for violence bad when you're calling for throwing a molotov cocktail at a child? At an adult? At a serial killer? At someone who's about to shoot you unprovoked? At someone who murdered your family? At someone who's about to?

If you said "yes" to all of the above, I'd love to know your reasoning.

empthought · 2026-04-11T01:44:48 1775871888

Yes.

If you want a molotov cocktail thrown so badly, throw it yourself. Don't put it on other people to do it for you.

stavros · 2026-04-11T01:47:00 1775872020

Are the two choices "accept that violence is unconditionally bad" and "throw a molotov cocktail at Sam Altman's house"? Because that dichotomy seems a bit... false?

empthought · 2026-04-11T01:56:26 1775872586

Your question was about calling for violence.

lostlogin · 2026-04-11T00:53:51 1775868831

The general tone here is that freedom of speech is absolute and nothing should curtail that.

Not my personal view.

what · 2026-04-11T01:21:04 1775870464

I’d like to know your reasoning for answering “no” to all of the above.

stavros · 2026-04-11T01:26:30 1775870790

I guess we'll just have to find someone who answers no to all of that and ask them!

what · 2026-04-11T01:43:26 1775871806

I think my point was obvious. What is your justification for answering no to any of them?

stavros · 2026-04-11T01:45:30 1775871930

Alright, I'll explain. I don't think violence is bad against someone who's about to kill my family, because:

* I care about my family more than I care about a stranger.

* I care about people who don't kill people unprovoked more than I care about people who kill people unprovoked.

* My family are more than one person, versus the one killer.

That's why I answer no to that one.

what · 2026-04-11T02:09:40 1775873380

Sure, I care about certain people more than others and I’d be willing to use violence to defend myself or my family. But that’s not the same as cheering on or advocating for an attack on someone else that may or may not have done something to harm someone totally unrelated to you.

stavros · 2026-04-11T02:11:17 1775873477

It gets much more complicated when the person being harmed is someone who made and sold AI targeting systems that might be used against my country.

minimaxir · 2026-04-10T23:59:19 1775865559

It most likely tripped the flame war detector heuristic (comments > points), and there is definitely a flame war here.

EDIT: Looks like a mod rescued it (surprisingly) and it is now back to #2.

minimaxir · 2026-04-09T17:25:10 1775755510

Tweeting is easy. Managing the weirdos that respond to your tweets is hard.

boznz · 2026-04-10T05:46:12 1775799972

I think they have it set so that only followers can respond. Prevents most of the horrible stuff, but also downgrades you on the X algorithm. At least there are no weirdo's on the other social media platforms :-)

minimaxir · 2026-04-07T22:02:25 1775599345

Mythos is most definitely not in response to this announcement.

HN For You