More

veidr · 2026-06-08T12:24:47 1780921487

"coding is largely solved" and "our app uses Electron" are incompatible, self-refutational statements

veidr · 2026-06-06T02:02:44 1780711364

2025 xmas day, was at my wife's parents' house in rural Japan, my kids were all playing with their cousins, I was posted up with my laptop just listening to some podcast about the benefits of making time for long walks in middle age (as if! ~lol) while running another "agentic team" experiment — 12 agents in parallel.

I'd been feeding these bots a few projects, over and over — the hard part was the feeding them — that is, giving them enough well-defined work to do. They weren't yet good enough to write real software you could keep — at least I'd never seen that — and my experiments were just about finding the edges, building my intuition, and playing with processes that might be useful someday.

These things had built my kids' weird magical-dominoes games a few times by that point — but the experiment had been repeated so many times that you could argue we had "written" that software in English, with a spec that had been built, reworked, and rebuilt many times.

But this time, the bots were building me a bespoke git client, unlike any other, and unlike anything I would take the time to write — waaaay to complicated, with too little benefit. I wanted it, but only for this one niche use case.

It was a GUI client to manage a collection of repos, about 200 of them in a monorepo where every subproject was a git submodule , which are the universal counterpart to node_modules — while the latter is notorious for being "the heaviest object in the universe", git submodules are widely acknowledged to be the most annoying objects in the universe.

Nevertheless, I had this weird monorepo, and I wanted to visualize and do stuff to this list of independent repos that were also git submodules of the parent monorepo: sort by outstanding commits, divergence from upstream, recency of activity, etc. Visualize them differently based on these things. Search across them, including the source code on branches other than the current one. Show the branch counts and number of branches and commits that existed locally but not pushed upstream. A bunch more boring stuff like that, but done across the full set of repos.

That project itself wasn't even interesting to me; that software would be marginally useful to me if it existed and worked, but the main point it was just a large enough chunk of work to keep a team of bots busy all day without a human in the loop.

In December 2025, AI coding agents were already useful with a human in the loop. Opinions varied a lot about how useful they were, but to me it was obvious we were going to use them for the rest of our careers as software engineers.

It was not yet obvious that we were going to let them write huge swaths of code, or entire programs, without any humans in the loop. I had never seen that produce something that worked well enough to be worth keeping.

And then, that day, I did. I had structured the workflow so that the git client was on the screen and auto-refreshing. I was listening to the podcast, drinking coffee, reading the news. The git client was a crude window with a table in the background, a single column showing the full path to each repo, and nothing else.

Then the table expanded. It got color coded numbers representing the commit/branch counts. It suddenly gained styles, and looked nice. A contextual menu started popping up, repeatedly, and grew to include several more menu items over the next few minutes. New confirmation dialogs popped up as the bots implemented and exercised the various features from my spec.

I remember my field of vision narrowing as I started to focus on what the bots were doing. They were just executing my loop — one bot would implement one bullet from my spec, another bot would review the code while another bot manually tested it, and tried to break it, run a code review gauntlet in a loop until there were no more findings, repeat.

I could see the progress play out on my screen as they worked. I had watched bot teams work before, but it had always been pretty janky, and something like a bad game that nobody would play, or a stupid to-do-list app, or — more often — something that didn't actually work.

This was the first time I had ever seen it work. This was the grail we'd been looking for, not sure if it really existed: a fleet of bots successfully building a piece of complex, useful software without human assistance. I could tell it was working, because the adversarial testing and usability checks were all happening right before my eyes.

So it _is_ possible, I thought to myself.

They did it all morning. The app worked. I used it every day after that, for several weeks, until I finally got that entire monorepo converted to a more sensible git subtree-based arrangement.

In the half year since then I've been in a kind of manic state some of my friends call cyberpsychosis, chasing that dream. I've now seen agentic fleets successfully build many things. I've also seen a bunch of failures, some subtle, some catastrophic and hilarious. I'm still building my intuition, and the laws of physics in this universe are mutating every few weeks. It's wild.

I am fortunate enough to work at a place that doesn't pressure engineers to climb a token leaderboard, or to use AI beyond what we deem prudent. This kind of agentic no-humans-in-the-loop coding is prohibited. The policy is that in this era where we all generate more code than ever, even by hand, it's the quality bar that must go up, not the speed of production.

That's awesome because it keeps me grounded in the old ways, and confines my cyberpsychosis to my weekends and evenings. I usually spend the weekend building up a couple software plans, honing them as best I can, and then unleashing the clankers Sunday night.

I'll let them run all week, sometimes giving them a poke or flipping them over a couple time in the evening, and then the next Saturday morning, I see what I've got. What I'm mainly interested in is: How can agentic fleet-coding processes evolve to produce better software and require less human interaction and inspection? And the corollary: How can software architectures evolve to safely consume more of this fundamentally untrustable code?

It's thrilling. Exhilarating. The near-infinite subsidized tokens are about to finally run out this month, alas. But for the past 6 months it's easily the best $400/month I have ever spent. :)

LearnYouALisp · 2026-06-07T20:19:09 1780863549

Hm, narrows eyes after the tenth perfectly pair-spaced em dash.

Scans downward ... this does remind one of the general tone of fanfiction, which in fact comprises a large proportion of the text base of data.

//The realization that potentially many of these comments may be fun- or profit-motivated 'advertising'.

veidr · 2026-06-08T01:13:43 1780881223

I like to think I'm part of the reason the bots use em dashes so much, since I've been using them — or the ASCII "--" that we used to have to type to represent them in the pre UTF-8 times — since you could write stuff and post it on the internet.

veidr · 2026-06-02T02:00:36 1780365636

This fixes a dozens-of-times-per-day annoyance for me.

The grid is good, but even better is the instant virtual display switching.

Nowhere is the death-by-a-thousand-paper-cuts annoyance of modern macOS worse than having to hit Ctrl→→→→→→→ and suffer those repeated animations, over and over.

xp84 · 2026-06-02T02:29:45 1780367385

It's every action on Mac and iOS that does this, and it has been increasing in intrusiveness for a decade. I can't be sure why they do it, but it comes off as though their visual designers are immature, thinking we want to see their impressive animations not just in a demo, not just in a tutorial that we go through once, where we are meant to grasp the relationships between the things, but over and over again, all day long, for decades.

I freaking don't. One time was plenty. I don't want any animation. And the "reduce animation" feature's implementation is a slap in the face: all the delay -- that part is non-negotiable apparently -- but with blurry crossfades instead.

skydhash · 2026-06-02T04:03:59 1780373039

I'm using cwm (x11) without a compositor (never noticed tearing). And it's so nice when everything is not trying to be cute with shadows, animations and round corners. Animation only makes sense when there's a direct action that controls it (like when swapping spaces or hovering) or the system wanting to inform us (notifications). And it's better be fast. Otherwise it's just visual effects that quickly become tiring after a few days.

chamomeal · 2026-06-02T03:23:28 1780370608

It is absolutely, positively mind boggling that you have to sit through those animations. And key presses don’t even take effect if your new desktop until the animation is done. It’s just lunacy.

How does a company with infinite resources and talented designers come up with shit like that??

coolmitch · 2026-06-02T02:13:43 1780366423

yes! it's the worst!

I've been using Instant Space Switcher (which got a small callout in tfa) as a targeted fix for this, and it's lifechanging

eproxus · 2026-06-02T08:30:40 1780389040

I've also switched to Instant Space Switcher, it is soo good! Previously I used BetterMouse for only this feature but they made the space switching worse in later versions (slower, on-par with the default macOS speed).

Here's the link if anyone is curious: https://github.com/jurplel/InstantSpaceSwitcher

cpt_sobel · 2026-06-02T09:51:22 1780393882

I also used to use BetterTouch tool just for this feature, no idea what they have been thinking over at Apple with this delay.

saila · 2026-06-02T06:29:52 1780381792

You can also do Ctrl-UpArrow then click the space you want. This isn't instant, but it might be a little better than repeatedly cycling through each desktop, especially if you have a lot of them. Turning off "Automatically rearrange Spaces based on most recent use" is also a must IMO.

Personally, I only open one app per desktop and just use Command-Tab. If you hold Command after Command-Tab, you can select an app with having to cycle through all of them.

oneeyedpigeon · 2026-06-02T07:25:47 1780385147

> I only open one app per desktop

So what benefit do you get from multiple desktops?

sgustard · 2026-06-02T05:12:09 1780377129

Tried this? defaults write com.apple.dock expose-animation-duration -float 0.05; killall Dock

OrangeMusic · 2026-06-02T13:48:28 1780408108

Yup. Doesn't work.

veidr · 2026-05-25T07:35:32 1779694532

No, he's stated the opposite, e.g. https://x.com/jarredsumner/status/2058283214981251080?s=46

But AFAICT he's never suggested they reviewed all the code, and that they didn't seems like a pretty safe assumption given the volume, and timeline.

I personally think the test suite passing counts for something, and I would bet they also set up some pretty intense LLM-powered verification loops and quality gates (which I hope the forthcoming blog post will detail). I've seen mechanical LLM ports that went extremely well (though nowhere near this scale, so we could review the code (which is how I know they went well)).

I think the most hysterical reactions that we are seeing from some people are premature, knee-jerk responses. We're gonna _find out_ if the Rust version really is better than Zig version, and soon.

And even if it is better overall, I think if there is an AI-slop-induced major bug we are definitely gonna know that, too, because we have a highly motivated community of folks ready to tweet the shit out of it the instant it is found.

So even as a pretty heavy daily user of Bun, I'm actually really glad they did this. The value of the public experiment is high, and if new Bun sucks, well, I still have Deno.

veidr · 2026-05-23T14:27:49 1779546469

i miss smart people writing blog posts

that stopped after twitter

and went asymptotically downhill from there

approaching, but never quite literally getting to the point of eating a dog shit sandwich

(despite the same nauseous feeling and bad taste in your mouth)

simonw · 2026-05-23T15:17:45 1779549465

I frequently joke with people that the reason I have influence in the AI world is that I'm blogging like it's the early 2000s, when everyone else gave up on blogging as a medium.

It's only partly a joke.

veidr · 2026-05-23T15:24:29 1779549869

And, haven't you also been doing so since around the turn of the millennium?

So, you might also be repped writ large in their their training data...

  (;^_^)

throwaway27448 · 2026-05-23T14:34:47 1779546887

Substack is thriving, btw. Curiously I simply have less desire to read the thoughts of "smart" people than ever. Either write a proper book or distract me from the horrors of the world.

veidr · 2026-05-23T14:59:03 1779548343

yeah, but substack is mostly just another twitter low-engagement farm

also, your last-line worldview... i mean i get it, but...

just basically sounds like the twitter origin story (T_T)

st3phvee · 2026-05-23T19:20:30 1779564030

> but substack is mostly just another twitter low-engagement farm

That, plus it's also full to the brim with LinkedIn-esque AI slop. There are still some decent writers there, for sure, but Substack is going downhill fast as more grifters join the platform in the hopes of making a quick, easy buck.

throwaway27448 · 2026-05-24T03:21:28 1779592888

How are you using it such that you even encounter writers you don't know?

st3phvee · 2026-05-24T11:49:42 1779623382

Via Substack's own recommendation algorithm, Substack Notes, and by perusing the leaderboards, both of which have been a thing on the platform for a while now. Substack's social media side is very Twitter-esque. Writers you follow "restack" publications (some of which are full of AI slop, unbeknownst to the restacker) and the algorithm also inserts "writers" you haven't encountered into your feed. ("Writers" is in scare quotes for a reason.)

throwaway27448 · 2026-05-24T15:37:20 1779637040

So—why do you use these features if you don't like them?

veidr · 2026-05-22T15:34:46 1779464086

running out of money, for an open source project of almost any kind, is safer than "running into money" with the wrong strings attached

(still reserving judgement on Bun, though — I mean, we'll soon see, one way or the other!)

veidr · 2026-05-16T15:25:24 1778945124

So true.

I had this gaming PC — and once a year doing excel and dropbox exchanges with my accountant, but other than that, gaming PC — and it never had an issue, from 2020 or 2021 to last month.

So I decided to move it to the living room, and connect it to our big TV, instead of the small TV — same LG manufacturer, same 4K res, mind you — and now it just freezes every 3-4 days. And freeze means just, the screen still shows whatever it was showing when it froze, no USB mouse or keyboard does anything, cannot be RDP'd to cannot be pinged... hold-down-power-button only answer.

(I have swapped all the cabels, just to be sure.)

The only differences: moved it 20 meters physically, connected it to a slightly newer TV. ¯\_(ಠ_ಠ)_/¯

macOS and Linux also do suck, but both are AFAICT way more predictable, and less random

mrb · 2026-05-16T15:44:13 1778946253

TBH your problem sounds like a hardware issue. Maybe the PC's new location is warmer due to a more enclosed space, triggering more unrecoverable hardware faults.

veidr · 2026-05-16T16:06:07 1778947567

I agree it sounds like that, but (having that same thought) I kept the temp in the living room 20℃ or less for a week but nah

My best guess at this point is the 2025 LG TVs have some different HDMI ARC something something compared to the 2019 it was plugged into before.

But also my point is that there's no way a human with 3 kids and job could ever know... it either starts working or I get a PlayStation or a different PC or whatever.

Or just tell my kids, "Hey, Death Stranding works on your Mac now, so shut the fuck up until you finish that whole game." ¯\_(ಠ_ಠ)_/¯

Our_Benefactors · 2026-05-16T18:32:27 1778956347

You could look into EDID settings, lots of weird quirks around that spec.

steve1977 · 2026-05-16T15:52:24 1778946744

> macOS and Linux also do suck, but both are AFAICT way more predictable, and less random

macOS maybe as long as you're only using Apple hardware. As soon as you use 3rd party peripherals, you're in for very interesting bugs that are not getting confirmed by Apple and suddenly disappear again with a macOS update (if you're lucky).

veidr · 2026-05-16T15:59:54 1778947194

yeah — i have my kids on Macs, bc I'm lazy, but just the ones with only two USB ports and nothing else — otherwise never-ending, unresolvable nightmare unless it's just some Apple thing you're plugging in

veidr · 2026-05-16T14:24:12 1778941452

Vernor Vinge has some hits and some misses, but A Deepness in the Sky (best to just take the plunge and read it without googling — it's good either way, but better if you don't even read the back of the paperback).

Then, a bit further afield but for me, at least, exercised what I liked in The Culture series, even though stylistically different: Spin by Robert Charles Wilson.

derektank · 2026-05-16T18:14:01 1778955241

I think A Fire Upon the Deep would be a more enjoyable starting place for someone that likes the Culture series, even though A Deepness in the Sky is generally considered the better novel.

veidr · 2026-05-17T11:14:46 1779016486

That is probably true, and I was assuming that if you read A Deepness in the Sky, you would go on to read A Fire Upon the Deep.

But I got it backwards. While A Deepness in the Sky is set earlier than A Fire Upon the Deep, it was actually published later, as a prequel.

So I agree. Read the first-published one, and if you like it, read the other.

dooglius · 2026-05-18T03:14:46 1779074086

I wish I had read Deepness first as Fire sort of spoils it (granted, either direction will spoil some things)

veidr · 2026-05-23T15:02:01 1779548521

OK now I reverse my reversal; you — and original me — are/am right: read Deepness first. First! haha

veidr · 2026-05-16T12:03:42 1778933022

I can understand where you are coming from, but I myself am coming from a quite different place. I'm a long-time Deno fan, and to me Bun was less interesting because a.) it seemed like a much-less-ambitious Deno, and b.) I don't want to learn Zig, so I wasn't likely to try to hack on Bun itself, even just recreationally.

But, I warmed up to Bun over the last couple years almost against my own will — trying to maintain a pretty large body of TypeScript code in a runtime-agnostic way (including even Node, since 24.2). I don't want to make any specific TypeScript runtime a requirement for my TypeScript code, unless there are really good reasons to do so.

But Bun (like Deno) kept providing those reasons. Postgres, SQLite, S3, websockets, local secrets (Keychain/wallet), bundling, compilation, killer speed. So I (somewhat grudgingly) started using Bun more, and even made it a requirement for some of my projects (albeit, in ways I could walk back later if needed).

Today, I have a bunch of API servers and frontend app servers which are bun build --compile --bytecode single executables ,that can run and be deployed virtually anywhere.

I've been very happy with it so far. But also, I don’t think that the way I am doing it is super-common, and now that they are doing this, uh... extremely ambitious LLM port, I am perfectly positioned to regret all of my decisions around Bun if this port ends up sucking.

So I'm a little nervous, but... what if it doesn't suck? That would be cool, because a.) they will have shown something interesting about what is possible with LLMs (albeit if you are rounds-to-a-trillion-dollars valuation frontier AI lab, lol, but still). And b.) going forward, Bun will be developed in Rust. We all have our own preferences, obviously, but to me, that's a win.

And if it does suck, though — that's super interesting too! Will be annoying to me to re-architect my Bun-specific shit to Deno, but for the world at large (and me, too) that's still interesting information!

Because Bun is perfectly positioned to do a huge LLM-powered port. They are one of the premier TS/JS runtimes, it's obviously and insane marketing pillar for the AI lab that bought them, they have unfathomable resources and access to the cutting-edge models that all of us don't get to play with yet, and for all intents and purposes, they have unlimited money to do this.

So if they can't do it — which will be really obvious, I think, if true — then it really just isn't possible yet, and all the naysayers were right.

coldtea · 2026-05-16T15:35:21 1778945721

>and to me Bun was less interesting because a.) it seemed like a much-less-ambitious Deno

I don't know, I've followed Deno, and it appeared to me an incredibly low ambition from the get go.

veidr · 2026-05-16T16:41:49 1778949709

lol — what you're saying doesn't make sense to me, but I'm sure it makes sense to somebody

What I was specifically referring to is Deno (originally) trying to fix the (glaring, fundamental) problems that Node imposes on the world, vs just do them faster.

coldtea · 2026-05-16T18:03:21 1778954601

Yes, but "fixing some fundamental Node problems" is a low bar, hardly the high mark of ambition now, was it?

And to offer a counter example, something like Dart appeared much more ambitious to me.

brokencode · 2026-05-16T22:13:57 1778969637

I guess it depends on how you define ambition. If you are talking about in an absolute sense, yeah of course, the Dart project had to build a whole language, VM, and ecosystem. That's way more ambitious than Deno.

Though if you look relative to the team size and resources going into it, a project like Deno can still be considered ambitious. Creating an alternative ecosystem to nodejs is a large undertaking.

veidr · 2026-05-16T18:20:59 1778955659

OK. But without changing programming laguages, "fix some fundamental Node problems" vs "don't fix those problems, just run them faster, and maybe inline the most popular dependencies"...

Surely we can agree that one of those positions is relatively less ambitious?

pjmlp · 2026-05-17T06:02:35 1778997755

Well it remains to be proven how they can make a business out of fixing nodejs fundamental problems.

veidr · 2026-05-17T11:34:23 1779017663

I think the Anthropic acquisition means that Bun isn't in that business anymore. Bun is still fixing fundamental Node problems, but that's no longer the business.

The business value the Bun team needed to deliver (to make the acquisition pay out) might very well be this controversial, but nevertheless spectacular, 6-day Zig→Rust port.

But beyond that, now Bun is just tooling used internally at Anthropic, which also happens to be open-source.

pjmlp · 2026-05-17T13:47:23 1779025643

I also meant Deno as well.

veidr · 2026-05-17T14:14:30 1779027270

Oh. Well, then, yes I agree. It certainly does remain to be proven if anybody can make "Node, but better" a business.

Certainly the recent layoffs¹ of ~half-or-so of the Deno team doesn't bode well for it, as AFAIK Bun was the only other significant player trying (to make it a business).

¹: https://www.reddit.com/r/Deno/comments/1rwjaeb/whats_going_o...

egorfine · 2026-05-16T15:11:18 1778944278

> what if it doesn't suck? > And if it does suck

Why not both? How about that: perfectly fine for Anthropic but suck for everyone else.

veidr · 2026-05-16T16:28:51 1778948931

well to me that would still count as "it sucks"

but sure anthropic might not agree

subarctic · 2026-05-16T12:23:26 1778934206

Is there much value in it being written in rust if it's all AI slop?

veidr · 2026-05-16T12:54:24 1778936064

Well "slop" is doing a lot of work there. If it's all incomprehensible garbage-code that no human can understand? Then... yeah very marginal value to me, in terms of hacking on it.

However, I think if it turns out that that's the case, then their port will fail in two ways (to paraphrase Hemingway): gradually, and then suddenly.

I don't think this port can be a success unless they end up — on the other side of it, not necessarily immediately — with maintainable Rust code.

txdv · 2026-05-16T12:20:33 1778934033

if they succeed nothing will change for you

gpm · 2026-05-16T16:19:40 1778948380

If they succeed the software will be more reliable with less memory issues that are very likely significant security issues at least some of the time.

When we've seen linux having a new significant exploit every other day now thanks to LLMs being better at weaponizing memory bugs this seems significant.

veidr · 2026-05-15T18:50:57 1778871057

No, and there's been a lot of confusion about that on this website.

They did cite Rust's safety as a motivating factor for the port. That doesn't imply trying to achieve that simultaneously with the language change — which is good, because that would be insane. (Or, if you prefer, even more insane.)

You cannot faithfully port a codebase to a new language while also radically re-architecting it. You have to choose.

They want the safety benefits of Rust going forward; i.e., after it's finished, when they then write new code in Rust.

swiftcoder · 2026-05-15T18:54:24 1778871264

Yeah, exactly. The typical approach is to do a mechanical translation such as with rust2c, that is full of unsafe, and then gradually refactor safety in.

Dylan16807 · 2026-05-15T19:00:10 1778871610

But nobody makes announcements and blog posts about running that.

pgporada · 2026-05-15T19:20:19 1778872819

There's several blog posts here. https://www.memorysafety.org/initiative/av1/

Dylan16807 · 2026-05-15T19:26:27 1778873187

And the first post is about the team working on the project, with about two and a half sentences on c2rust, and making it very clear they just started.

The newer posts go into detail about the rearchitecting that follows.

Ar-Curunir · 2026-05-15T19:16:56 1778872616

And indeed, the bun team has not done that

Dylan16807 · 2026-05-15T20:15:46 1778876146

Did they not make the announcement? And they definitely promised a blog post even if it's not out yet.

Ar-Curunir · 2026-05-15T21:12:58 1778879578

Not on their blog, website, or twitter, so no?

HN For You