More

gnatolf · 2026-04-02T05:16:38 1775106998

Well, there is a financial 'sink' - stockpiles and ammunition or other non-reusable military gear are basically the definition of money 'destroyed'. Their political value is almost non-existent actual money. If any, at all.

chickenbig · 2026-04-02T07:55:17 1775116517

> stockpiles and ammunition or other non-reusable military gear are basically the definition of money 'destroyed'

Goods like longer-lasting food, medical supplies or a strategic oil reserve are not wasted. The money that went into supplying them has gone back into the economy, and they serve a more strategic purpose than the market participants could have borne (i.e. societal insurance policies). The same could also be said of military stockpiles, and continuing to buy them sustains a capability that is hard to get back once lost.

NetMageSCW · 2026-04-02T14:41:32 1775140892

Those stockpiles weren’t created by putting money into a shredder and getting ammunition out. They were created by paying for the materials and labor. At that point the government’s money is frozen and stockpiled, but the economy still has the money that was spent.

gnatolf · 2026-03-11T17:25:05 1773249905

Sorry but isn't the bottleneck then simply to do even relevant things? Like how much of a qualified backlog do you have that your pipeline does not run dry?

gnatolf · 2026-02-19T19:04:12 1771527852

So let's put things we're interested in in the benchmarks.

I'm not against pelicans!

ghurtado · 2026-02-19T19:50:21 1771530621

I think the reason the pelican example is great is because it's bizarre enough that it's unlikely that to appear in the training as one unified picture.

If we picked something more common, like say, a hot dog with toppings, then the training contamination is much harder to control.

troymc · 2026-02-19T22:29:37 1771540177

I think it's now part of their training though, thanks to Simon constantly testing every new model against it, and sharing his results publicly.

There's a specific term for this in education and applied linguistics: the washback effect.

rvnx · 2026-02-19T20:14:26 1771532066

It's the most common SVG test, it's the equivalent of Will Smith eating spaghettis, so obviously they benchmax toward it

gnatolf · 2026-02-17T22:01:19 1771365679

Good point. So much functionality gets commoditized, we have to move goalposts more or less constantly.

gnatolf · 2026-02-17T21:20:18 1771363218

While this is funny, the actual race already started in how companies can nudge LLM results towards their products. We can't be saved from enshittification, I fear.

raddan · 2026-02-17T21:46:20 1771364780

I am excited about a future where I am constantly reminded to like and subscribe my LLM’s output.

abelitoo · 2026-02-17T22:57:28 1771369048

I'm concerned for a future where adults stop realizing they themselves sound like LLMs because the majority of their interaction/reading is output from LLMs. Decades of corporations being the ones molding the very language we use is going to have an interesting effect.

gnatolf · 2026-02-14T22:36:05 1771108565

More specifically regarding spec-driven development:

There's a good reason that most successful examples of those tools like openspec are to-do apps etc. As soon as the project grows to 'relevant' size of complexity, maintaining specs is just as hard as whatever other methodology offers. Also from my brief attempts - similar to human based coding, we actually do quite well with incomplete specs. So do agents, but they'll shrug at all the implicit things much more than humans do. So you'll see more flip-flopped things you did not specify, and if you nail everything down hard, the specs get unwieldy - large and overly detailed.

zozbot234 · 2026-02-15T00:03:16 1771113796

> if you nail everything down hard, the specs get unwieldy - large and overly detailed

That's a rather short-sighted way of putting it. There's no way that the spec is anywhere as unwieldly as the actual code, and the more details, the better. If it gets too large, work on splitting a self-contained subset of it to a separate document.

lelanthran · 2026-02-15T09:31:10 1771147870

> There's no way that the spec is anywhere as unwieldly as the actual code, and the more details, the better.

I disagree - the spec is more unwieldy, simply by the fact of using ambiguous language without even the benefit of a type checker or compiler to verify that the language has no ambiguities.

skydhash · 2026-02-15T17:02:41 1771174961

People are keen to forget that programming languages are specs. And a good technique for coding is to build up you own set of symbols (variables, struct, and functions) so that the spec become easier to write and edit. Writing spec with natural language is playing russian roulette with the goals of the system, using AI as the gun.

gnatolf · 2026-02-14T22:27:59 1771108079

Everybody feels like this, and I think nobody stays ahead of the curve for a prolonged time. There's just too many wrinkles.

But also, you don't have to upgrade every iteration. I think it's absolutely worthwhile to step off the hamster wheel every now and then, just work with you head down for a while and come back after a few weeks. One notices that even though the world didn't stop spinning, you didn't get the whiplash of every rotation.

gnatolf · 2026-02-08T07:30:11 1770535811

That is a wobbly assertion. You certainly would need to run the same compiler, forgo any recent optimisations, architecture updates and the likes if your code has numerical sensitive parts.

You certainly can get identical results, but it's equally certainly not going to be that simple a path frequently.

AlexeyBrin · 2026-02-08T12:37:22 1770554242

> You certainly can get identical results, but it's equally certainly not going to be that simple a path frequently.

But at least I know that if I need to, I can do it. With an LLM, if you don't store the original weights, all bets are off. Reproducibility of results can be a hard requirement in certain cases or industries.

layer8 · 2026-02-08T13:05:41 1770555941

The more important point is that even when you don’t get identical binary output, you still get identical observable behavior as specified by the programming language, unless there’s a compiler bug. That’s not the case for LLMs, they are more like an always randomly buggy compiler. You wouldn’t want to use such a compiler.

gnatolf · 2026-02-05T19:03:14 1770318194

Absolutely. A technically correct bike is very hard to draw in SVG without going overboard in details

falloutx · 2026-02-05T19:26:49 1770319609

Its not. There are thousands of examples on the internet but good SVG sites do have monetary blocks.

https://www.freepik.com/free-photos-vectors/bicycle-svg

jefftk · 2026-02-05T21:08:48 1770325728

Several of those have incorrect frames:

https://www.freepik.com/free-vector/cyclist_23714264.htm

https://www.freepik.com/premium-vector/bicycle-icon-black-li...

Or missing/broken pedals:

https://www.freepik.com/premium-vector/bicycle-silhouette-ic...

https://www.freepik.com/premium-vector/bicycle-silhouette-ve...

http://freepik.com/premium-vector/bicycle-silhouette-vector-...

gnatolf · 2026-02-05T21:41:48 1770327708

From smaller to larger nitpick, there's basically something wrong with all of the first 15 or so of these drawings. Thanks for agreeing :)

RussianCow · 2026-02-05T20:22:23 1770322943

I'm not positive I could draw a technically correct bike with pen and paper (without a reference), let alone with SVG!

gnatolf · 2026-01-23T05:26:04 1769145964

Yes, even if you create a single person account, you create an 'organization' to be billed. That's the whole confusion here. Y'all seemingly don't have an account at anthropic?

HN For You