More

sollewitt · 2026-04-05T16:01:26 1775404886

Pine sap. You can get a schnapps of it, obviously.

sollewitt · 2026-04-04T15:08:00 1775315280

Google is famously a slime mold.

DonHopkins · 2026-04-04T15:42:39 1775317359

I will not sit here idly as you disparage an entire kingdom of diverse, beautiful, highly efficient, decentralized problem-solvers. Some of my best friends are slime molds.

Slime Mold Identification & Appreciation (amazing photography)

https://www.facebook.com/groups/1510123272580859

sollewitt · 2026-03-31T15:30:37 1774971037

In CA AB-435 is written to apply a 5-point fit test to anyone under 16.

bombcar · 2026-04-01T11:46:14 1775043974

I knew a few drivers (small females) who we teased because they’d have to be in a car seat if they weren’t 18+.

BobaFloutist · 2026-03-31T18:05:14 1774980314

Is that not just further formalizing "Use a car seat until the kid is big enough that you would expect seat belts to function properly"?

sollewitt · 2026-03-09T16:23:20 1773073400

If you don't count the externalities, sure. Healthcare is a cost too. We need more holistic accounting, the financialising of everything into a tidy but ultimately false P&L column is literally killing us.

sollewitt · 2026-02-27T21:45:14 1772228714

Government physical mail is pretty great, you just need the right regulations.

gameman144 · 2026-02-27T21:51:04 1772229064

Government physical mail has the benefit that substantial tampering is way harder to do at scale.

It's the same vein as criminals using cash vs Bitcoin; both can hide crime, but one is way easier to scale up.

sollewitt · 2026-02-27T16:05:23 1772208323

But does it work? I’ve used LLMs for log analysis and they have been prone to hallucinate reasons: depending on the logs the distance between cause and effects can be larger than context, usually we’re dealing with multiple failures at once for things to go badly wrong, and plenty of benign issues throw scary sounding errors.

aluzzardi · 2026-02-27T16:36:34 1772210194

Post author here.

Yes, it works really well.

1) The latest models are radically better at this. We noticed a massive improvement in quality starting with Sonnet 4.5

2) The context issue is real. We solve this by using sub agents that read through logs and return only relevant bits to the parent agent’s context

hinkley · 2026-02-27T18:15:29 1772216129

So you’re not getting alerts at 2 am from hallucinations?

tclancy · 2026-02-28T01:07:04 1772240824

Not from AI no.

sollewitt · 2026-02-27T16:45:53 1772210753

I would be very interested in reading about this kind of orchestration and filtering than data acquisition if you have the energy for another post :)

shad42 · 2026-02-27T16:48:24 1772210904

We started writing very recently: https://www.mendral.com/blog - there is a another post we made yesterday about the overall architecture. And we have a long list of things we're planning to write about in more details.

Taking good note of your comment :)

aluzzardi · 2026-02-27T18:01:58 1772215318

We've actually started to gather metrics this week to write that exact post :) Coming soon!

verdverm · 2026-02-27T16:27:39 1772209659

It can, like all the other tasks, it's not magic and you need to make the job of the agent easier by giving it good instructions, tools, and environments. It's exactly the same thing that makes the life of humans easier too.

This post is a case study that shows one way to do this for a specific task. We found an RCA to a long-standing problem with our dev boxes this week using Ai. I fed Gemini Deep Research a few logs and our tech stack, it came back with an explanation of the underlying interactions, debugging commands, and the most likely fix. It was spot on, GDR is one of the best debugging tools for problems where you don't have full understanding.

If you are curious, and perhaps a PSA, the issue was that Docker and Tailscale were competing on IP table updates, and in rare circumstances (one dev, once every few weeks), Docker DNS would get borked. The fix is to ignore Docker managed interfaces in NetworkManager so Tailscale stops trying to do things with them.

aluzzardi · 2026-02-27T23:07:08 1772233628

> it's not magic and you need to make the job of the agent easier by giving it good instructions, tools, and environments.

This. We had much better success by letting the agent pull context rather trying to push what we thought was relevant.

Turns out it's exactly like a human: if you push the wrong context, it'll influence them to follow the wrong pattern.

verdverm · 2026-02-27T23:57:05 1772236625

I'd put it somewhere in the middle, but closer to the pull end.

- I force the AGENTS.md into the system prompt if the agent reads a directory, or file within, that contains one such file. This is anecdotally very good and saves on function calls and context growth in multiple ways. Sort them. I'm now doing this with planning and long-term task tracking markdown files.

- Everything else is pull, ideally be search, yet to substantially leverage subagents for context gathering. Savings elsewhere have pushed the need out.

btw, hi Al, I see you are working on a new company since our last collaboration, want to catch up sometime and talk shop?

sollewitt · 2026-02-27T16:50:15 1772211015

Thanks - that’s the maddening with flakes - is it the thing under test or the thing doing the testing? Hermeticity is a lie we tell ourselves :)

shad42 · 2026-02-27T16:32:35 1772209955

Mendral co-founder here, we built this infra to have our agent detect CI issues like flaky tests and fix them. Observing logs are useful to detect anomalies but we also use those to confirm a fix after the agent opens a PR (we have long coding sessions that verifies a fixe and re-run the CI if needed, all in the same agent loop).

So yes it works, we have customers in production.

hardolaf · 2026-02-28T21:15:51 1772313351

I can't get an LLM to properly handle analyzing a single 200K+ line log without making things up so whatever anyone is saying about this "working" is probably a lie.

kburman · 2026-02-27T17:33:05 1772213585

Honestly, with recent models, these types of tasks are very much possible. Now it mostly depends on whether you are using the model correctly or not.

sollewitt · 2026-02-18T16:05:55 1771430755

They are both the legislature and the judiciary.

sollewitt · 2026-02-12T01:29:45 1770859785

Society needs art. Artists produce art. There a pantheon of greats that had no commercial success in their lives but moved our culture, we’d be so much more culturally impoverished if we’d insisted they become shit plumbers.

sollewitt · 2026-02-03T22:36:11 1770158171

The bigger issue: datacenters in space are disposable. All the extremely recyclable aluminum, silica - you extract it, manufacture it and instead of recycling it when it’s done you incinerate it in the atmosphere and scatter the ashes far and wide across the earth, the harder to recapture later.

You do this when the most fragile part in the system fails. Solar panels good for 25 years but the SSDs burn out after 2? Incinerate the lot!

This kind of thinking is late capitalist brain rot. This kind of waste should be a crime.

timmmmmmay · 2026-02-04T04:49:08 1770180548

Aluminum is 8% of the earth's crust and silicon is 28%; I think we're good

sollewitt · 2026-02-03T04:47:36 1770094056

Also Wired and weirdly People Magazine (and before they were all fired J17)

HN For You