More

smokel · 2026-06-05T17:29:11 1780680551

For those too lazy to watch someone talk on video for ages to make a point:

The link is to a famous YouTuber called PewDiePie and he uses a local LLM to parse his email, to save time with that. They have an autoreply system and get notified about urgent matters.

smokel · 2026-06-04T18:10:41 1780596641

Most kids that grew up during the timeline you described had no interest in computer architecture. The small minority that did care is probably the same size now.

The other 99% who were into yoyo-ing back then are now into TikTok, that's all.

Swizec · 2026-06-04T18:52:49 1780599169

> The other 99% who were into yoyo-ing back then are now into TikTok, that's all.

Hey dude, some of us were yo-yoing while waiting for Gentoo to build from stage 0. Compiling an OS on a single-core Athlon takes time.

For the 3 days it takes to build all the way up to KDE, you have no computer. Hope you didn’t forget something

lucaspiller · 2026-06-05T16:04:09 1780675449

Don't forget the fan controllers to try to make it silent and the neon lights. I still have that machine at my parents house, used it a couple of years ago to rip all my teenage CDs to digital formats.

picofarad · 2026-06-04T19:29:57 1780601397

distcc-pump And, I forget what the toolchain setup is called, but on gentoo its literally just `emerge -1av <toolchain-thing> distcc` on machine with beef and just `emerge -1 distcc` on athlon...

I found out how to do it consistently in 2010 and its like black magic knowing how to target a real OS at BS hardware.

Swizec · 2026-06-04T20:43:17 1780605797

I was doing this in 2003 and my computer was also the internet/network router for our house. When that thing was down, you had no access to external information that you didn’t pre-save somewhere.

One time I forgot to install network drivers and had to download them through my flip phone via GPRS and then awkwardly load onto the computer via a clunky USB connection. Fun times.

Also my English wasn’t this good yet. I’m sure it would’ve been a lot easier had I actually understood all the tutorials and documentation fully.

knotimpressed · 2026-06-04T23:39:59 1780616399

Some of my least favourite nights and most cherished childhood memories involve troubleshooting broken or missing network drivers the only functional Linux box I had working. Never had to use a flip phone, but sure came close a few times.

Nothing I’d ever willingly re-live if given the chance, but always fun to look back on and grin.

themanmaran · 2026-06-04T18:34:12 1780598052

I'd wager that even if you didn't nerd out on computer architecture, just living through progression of CDs -> mp3s -> ipods -> streaming gives kids a better grounding than the iPad is where music comes from they have today

trumpdong · 2026-06-04T21:16:22 1780607782

At school we have Disney Plus with a box with a thing with a hole in it! https://www.reddit.com/r/KidsAreFuckingStupid/comments/1tv4f...

lukan · 2026-06-04T20:23:13 1780604593

I would argue yoyo is way more healthy than TikTok.

petemill · 2026-06-05T02:44:16 1780627456

The difference is more of those people can use local file management than the new generation joining office environments.

smokel · 2026-06-04T08:05:11 1780560311

Do you swap SIM cards all the time? This seems to be the biggest blocking issue for me.

I tried switching phones once a week, which was heavenly. Might try that again, it requires some discipline.

mrweasel · 2026-06-04T09:54:19 1780566859

Apparently some phone companies can/will provide you with aux SIMs, which allows two phone to share a number, or so I've been lead to believe. I can't find a single provider here in Denmark that will issue me such a SIM. Kinda sad about that, because it would solve most of my issues.

I need a smartphone for a few things every so often, but most of the time a dumb phone is perfectly fine.

tristanj · 2026-06-04T10:44:56 1780569896

I do not, the gym phone has its own SIM. I have it running a cheap data-only eSIM from esimdb.com

If you need synchronized phone / text messages, I suggest Google Voice. When anyone rings your (free) Google Voice number, it will forward the call to multiple phone numbers. It will ring as a regular phone call, not as an app notification. However, text messages appear as app notifications.

smokel · 2026-06-03T06:23:19 1780467799

The academic paper is here: https://arxiv.org/abs/2606.03811

It's not fully described how things work exactly, but apparently it does not transfer entire LLMs as part of the worm. Now that would be interesting :)

tiborsaas · 2026-06-03T13:12:27 1780492347

The abstract says:

> The worm parasitically uses compromised machines to run open-weight large language models (LLMs) to sustain its reasoning, or extend its reach for further attacks.

smokel · 2026-06-03T13:53:56 1780494836

Thanks for pointing that out. I scanned the paper and found that in their main experiments, they use a shared GPU resource and do not copy LLMs to target machines. Apparently they did other experiments in the ablation study where they did copy LLMs.

So it's even worse than I expected. The intended worm can spread through my thermostat, and when it reaches a GPU host, it can spread even harder. Fun times ahead.

BLanen · 2026-06-03T14:41:09 1780497669

I wonder if gamma ray memory corruption will induce a sort of mutation and selection effect on non-ecc-memory hosts which will make the worms effectively evolve.

rdedev · 2026-06-03T19:29:53 1780514993

This reminded me of the geth from mass effect. They get smarter as more geth "agents" network together.

What if there is a worm that spread through thermostats and another that spread through smart fridges and they finally infect a laptop with a gpu. They can exchange notes while they are there. Fun times

saltcured · 2026-06-03T16:54:15 1780505655

You'll just have to starve it with a bunch of thermostats that lead it towards the GPU rich honey pot where you will extract it...

a1o · 2026-06-03T10:50:56 1780483856

I think an approach could be to use some engineered security issue or however people build botnets, and give it some AI llm that is small and minimal but comes with instructions to download models from hugging face, and some other minimal prompts and descriptions of tools. Then it could use this to grow in infected computers and try find more capable and vulnerable computers to run better capable models and also devise some minimal communication between the different points of the botnet. Perhaps set itself a goal to dominate the biggest amount of compute and have some other goal. Would be curious to see what happens.

m3kw9 · 2026-06-03T13:38:05 1780493885

When the worm makes someone's machine start to sound like a leaf blower, you are found out.

hamburgererror · 2026-06-03T07:10:11 1780470611

In the abstract, what does it mean "the attacker's marginal cost per new infection is zero"?

amoshebb · 2026-06-03T07:16:25 1780470985

If you infect a machine with GPU enough to run the localLLM needed to steal another machine, you can let it burn tokens all day for free because whoever you stole the first one from will pay the electric bill.

cyanydeez · 2026-06-03T12:18:05 1780489085

We're getting closer to the Matrix's "We do know it was us who blackened the skies"

smokel · 2026-06-01T06:54:34 1780296874

Well, people used MS-DOS which had basically no security model at all for at least 10 years. Most viruses were benign, but it was almost trivial to simply wipe the entire hard disk. People generally didn't care, and made backups.

Things have become a bit more complicated now that machines are connected all the time, and the risk of infection is no longer limited to physically inserting a floppy disk into a machine.

I suspect that the solution is not so much in trying to make our current systems secure, but to make disconnection more practical.

smokel · 2026-05-27T21:11:40 1779916300

Looks really nice. Do you plan to support this in the future? Are you planning to foster a community of developers around it, or are you thinking about hosting it as a service?

Asking for a friend, who also enjoys building projects with LLMs, but publishing and supporting them not so much.

nathanstitt · 2026-05-27T21:15:40 1779916540

yes, I'm planning to support it indefinitely - my company (~10 users) just started using it so we're very invested in it's success.

I'd love for people to write whatever crazy packages they can think of for it so it has a rich ecosystem. It has the ability for admins to add a git or NPM reference into the packages list and install packages on the fly like wordpress supports.

As for hosting as a service, maybe someday. I also own the tinycld.com so who knows.

smokel · 2026-05-27T17:49:42 1779904182

AutoCAD is $175 per user per month [1].

[1] https://www.autodesk.com/products/autocad/buy

bigbuppo · 2026-05-27T18:28:47 1779906527

AutoCAD is still the budget-friendly CAD program it has always been. You don't build big boats in AutoCAD.

rrr_oh_man · 2026-05-27T18:40:33 1779907233

Winch Design [0], which have built some of the world's largest superyachts [1], seem to be using AutoCad. [2] Afaik it's also the same with Lürssen (but don't quote me on that)

[0] https://winchdesign.com/ [1] https://www.superyachts.com/directory/1516/winch-design/flee... [2] https://www.autodesk.com/design-make/articles/naval-architec...

numpad0 · 2026-05-27T21:15:13 1779916513

Likely not the "base model" of AutoCAD.

Those tools are used in ways that they're integral to processes. They have their equivalents of ticket systems that are linked to code repositories with LFSs and bunch of IDE type tools and automated and manual test systems and build systems. Their equivalents of PR discussions and Selenium screenshots needs to check all boxes in the right ways for legal and traceability purposes.

Without all that might be $175/user/month but you're not shipping apps with just vi and bare gcc.

noosphr · 2026-05-27T21:48:28 1779918508

>Without all that might be $175/user/month but you're not shipping apps with just vi and bare gcc.

You're right, Linus uses Emacs.

Our_Benefactors · 2026-05-27T19:02:38 1779908558

As someone completely outside the 3D design world who always thought of AutoCAD as the gold standard - really? What program would be used instead? Please enlighten me.

so_it_be · 2026-05-27T18:37:09 1779907029

Except LLM's even with Vision are still useless at AutoCAD let alone Revit (please dont quote SCAD LLM's at me, useless). Knowledge based approaches still win.

I might agree "AutoCAD" is the current level LLM's are at, but wait until your design departments discovers "Revit", its another ballpark (in wasted cots, engineers on site still get "clashes").

Revit costs are high, and the end results are marginally better - but local LLM's tokens are cheaper 24/7 at "AutoCAD" level - "Revit" level tokens will make Ubers CTO/COO weep harder than they already do. While producing results no better than "Revit" does (engineers still face "clashes").

Hasz · 2026-05-27T18:42:27 1779907347

Cadence and Ansys have entered the chat. A bunch of other highly-specialized engineering software has entered the chat. Licenses are on the order of 10-100k/seat.

For a pretty funny comment about pricing.

https://www.reddit.com/r/chipdesign/comments/1ajrli2/cadence...

analog_daddy · 2026-05-28T11:24:25 1779967465

Glad to run into this after some time!

I guess we are welcoming the software people to the world of expensive tools. Just sad that the FOSS alternatives of these tools are not as powerful whereas software industry still has FOSS tools to fall back on.

smokel · 2026-05-27T17:44:04 1779903844

Does this analysis factor in potential caching of tokens on the server side? It seems that if they organize things well (as a model provider), they can save quite a lot on that. Looking at my Cursor statistics makes it clear that the token calculations are not at all trivial.

simonw · 2026-05-27T17:48:51 1779904131

I believe the ccusage tool I used takes cached token pricing into account.

smokel · 2026-05-27T17:37:49 1779903469

For coding assistance, I have tried OpenCode with several large open models through OpenRouter. All were fairly bad compared to Claude Opus. Could you provide some hints on how I should be holding these open models so that I might get more value out of them?

I agree with the common trope that open models lag behind by about a year, but something magical happened just around a year ago when the state of the art models became extremely useful. By this reasoning we're about to see open models perform well, but I'm afraid there is more to it than just waiting for another revolution around the sun.

Note, my application is coding assistance. Open models can be great for other purposes.

tariky · 2026-05-27T18:43:21 1779907401

I tried almost all OS models on opencode, none of them is on levels as opus 4.7.

In latest experiment I used opus for implementation plan then used cursor composer 2.5 for execution.

I must say that combo is really good. Main drawback of claude code is that is super slow. So when paired with composer that is super fast it flies.

cainxinth · 2026-05-27T19:24:42 1779909882

No one is claiming that OS is as good. They are saying it isn't that far behind SOTA commercial products. So why pay exorbitantly just to get something only a few percent better than the free option?

But there have been very good open source office apps for decades and few enterprises use them, so perhaps this is just the nature of B2B purchasing committees and 'nobody getting fired for buying IBM.'

Alex-Programs · 2026-05-28T15:37:03 1779982623

Because failures compound. My productivity has substantially improved since I switched from open models to a Codex subscription, because it doesn't need hand holding, and it doesn't pull stupid tricks occasionally.

slopinthebag · 2026-05-27T19:09:56 1779908996

Do more planning yourself, be smart about the context, break down tasks into smaller components, give it more guidance. You can't just lazily prompt it to complete large features autonomously and expect good results.

amilios · 2026-05-27T19:26:48 1779910008

But if the closed-source models can do this without the additional effort, that's a significant gap, no?

bigfishrunning · 2026-05-27T19:42:00 1779910920

See that's the thing, they can't. Every model needs hand holding and guidance.

amilios · 2026-05-27T19:57:23 1779911843

some require less hand-holding than others though

myaccountonhn · 2026-05-28T10:28:39 1779964119

No one is trying to argue that OS models are better than Opus 4.7. It's simply that they're good enough and cheaper.

10000truths · 2026-05-27T19:40:03 1779910803

The point is that the price gap is so much larger than the capability gap, that even with the extra compute needed to make up for the lack of capability, you can still come out ahead in terms of amortized $/work done.

flexagoon · 2026-05-27T20:04:37 1779912277

Is it really when they are hundreds of times more expensive?

eikenberry · 2026-05-27T19:44:28 1779911068

That is the 3-6 month sota-open gap people talk about, a time-window that continues to move as new models are released on both sides.

grttq · 2026-05-27T22:39:48 1779921588

Do you know what economic trade offs are?

Both implicit and explicit..?

eikenberry · 2026-05-27T19:38:44 1779910724

+1 .. just wanted to reiterate that this is the answer. The open models work great if you just do a little more of the design/architectural work up front and organize your work appropriately.

aniceperson · 2026-05-27T20:52:51 1779915171

a good harness is supposed to do what you are describing. sonnet on pi.dev is pretty terrible but fast. Claude Code has ridiculous amounts of prompt engineering at system prompt level and sub session spawing combined with low temperature, to provide the predictable results people like. CC screws up and you never see, because the harness auto corrects, while on OSS you see everything, and does not comes with the level of monitoring by default.

smokel · 2026-05-25T12:39:47 1779712787

In order to find out how real humans reply:

Please guess a number between 1 and 100.

bestouff · 2026-05-25T12:45:32 1779713132

relativeadv · 2026-05-25T12:59:17 1779713957

Grimblewald · 2026-05-25T23:32:18 1779751938

6*7=42

pieter_mj · 2026-05-25T14:16:51 1779718611

maxloh · 2026-05-25T13:55:40 1779717340

snerbles · 2026-05-25T13:22:18 1779715338

Ekaros · 2026-05-25T13:08:20 1779714500

Barbing · 2026-05-25T13:01:03 1779714063

Sure!

orphea · 2026-05-25T12:59:23 1779713963

rithdmc · 2026-05-25T12:57:08 1779713828

√67

zulban · 2026-05-25T12:51:38 1779713498

HappMacDonald · 2026-05-25T21:20:23 1779744023

i+7up

Wonkey · 2026-05-25T19:27:24 1779737244

nom · 2026-05-25T19:21:31 1779736891

casper14 · 2026-05-25T14:21:12 1779718872

HN For You