More

dbreunig · 2025-06-30T23:59:10 1751327950

Sometimes buzzwords turn out to be mirages that disappear in a few weeks, but often they stick around.

I find they takeoff when someone crystallizes something many people are thinking about internally, and don’t realize everyone else is having similar thoughts. In this example, I think the way agent and app builders are wrestling with LLMs is fundamentally different than chatbots users (it’s closer to programming), and this phrase resonates with that crowd.

Here’s an earlier write up on buzzwords: https://www.dbreunig.com/2020/02/28/how-to-build-a-buzzword....

refulgentis · 2025-07-01T00:02:31 1751328151

I agree - what distinguishes this is how rushed and self-aware it is. It is being pushed top down, sheepishly.

EDIT: Ah, you also wrote the blog posts tied to this. It gives 0 comfort that you have a blog post re: building buzz phrases in 2020, rather, it enhances the awkward inorganic rush people are self-aware of.

dbreunig · 2025-07-01T00:43:54 1751330634

I studied linguistic anthropology, in addition to CS. Been at it since 2002.

And I wrote the first post before the meme.

refulgentis · 2025-07-01T01:55:15 1751334915

I've read these ideas a 1000 times, I thought it was the most beautiful core of the "Sparks of AGI" paper. (6.2)

We should be able to name the source of this sheepishness and have fun with that we are all things at once: you can be a viral hit 2002 super PhD with expertise in all areas involved in this topic that has brought pop attention onto something important, and yet, the hip topic you feel centered on can also make people's eyes roll temporarily. You're doing God's work. The AI = F(C) thing is really important. Its just, in the short term, it will feel like a buzzword.

This is much more about me playing with, what we can reduce to, the "get off my lawn!" take. I felt it interesting to voice because it is a consistent undercurrent in the discussion and also leads to observable absurdities when trying to describe it. It is not questioning you, your ideas, or work. It has just come about at a time when things become hyperreal hyperquickly and I am feeling old.

dbreunig · 2025-06-30T23:17:28 1751325448

While researching the above posts Simon linked, I was struck by how many of these techniques came from the pre-ChatGPT era. NLP researchers have been dealing with this for awhile.

dbreunig · 2025-06-17T15:31:48 1750174308

Yes, look up Winshuttle.

A very successful company with some of the happiest customers I’ve ever seen, whose entire product was a SAP hack that allowed people to enter their data using Excel. As someone unfamiliar with SAP, absolutely blew my mind.

dbreunig · 2025-06-17T13:27:26 1750166846

Not dictation…copy/paste I think. Thanks, fixed.

dbreunig · 2025-06-05T00:44:39 1749084279

Thanks! That's a typo!

dbreunig · 2025-05-19T17:45:30 1747676730

Can anyone provide a reason an enterprise would choose Grok over a similar class of models?

vasusen · 2025-05-19T20:51:26 1747687886

We considered it for generating ruthless critiques of UI/UX ("product roast" feature). Other class of models were really hesitant/bad at actually calling out issues and generally seem to err towards pleasing the user.

Here's a simple example I tried just now. Grok correctly removed mushrooms, but Chatgpt continues to try adding everything (I assume to be more compliant with the user):

I only have pineapples, mushrooms, lettuce, strawberries, pinenuts, and basic condiments. What salad can I make that's yummy?

Grok: Pineapple-Strawberry Salad with Lettuce and Pine Nuts - https://x.com/i/grok/share/exvHu2ewjrWuRNjSJHkq7eLSY

ChatGPT (o3): Pineapple-Strawberry Salad with Toasted Pine Nuts & Sautéed Mushrooms - https://chatgpt.com/share/682b9987-9394-8011-9e55-15626db78b...

tmpz22 · 2025-05-20T01:00:16 1747702816

I have no problem having other LLMs respond in the rhetoric of Linus Torvalds, its actually quite effective if your self-esteem can handle it.

torben-friis · 2025-05-20T09:13:03 1747732383

Do you ask specifically for Linux or just skeptic/caustic in general?

dimava · 2025-05-20T13:56:31 1747749391

Specifically for Linus Torvalds, the author or Linux

He has a very distinctive style and large amount of training data from all the reviews and emails he made while collaborating on Linux

And as he manages a huge project that's in development for decades, he has to be very strict about the quality

tmpz22 · 2025-05-21T01:09:45 1747789785

And its fairly constructive, at least when I tried in Gemini 2.5 awhile back. Like yes its caustic (fantastic word) but it does so in a way thats constructive in its counterargument to reach a better outcome.

BoorishBears · 2025-05-19T21:20:08 1747689608

I haven't seen a model since the 3.5 Turbo days that can't be ruthless if asked to be. And Grok is about as helpful as any other model despite Elon's claims.

Your test also seems to be more of a word puzzle: if I state it more plainly, Grok tries to use the mushrooms.

https://grok.com/share/bGVnYWN5_2db81cd5-7092-4287-8530-4b9e...

And in fact, via the API with no system prompt it also uses mushrooms.

So like most models it just comes down to prompting.

GuinansEyebrows · 2025-05-21T15:10:31 1747840231

> We considered it for generating ruthless critiques of UI/UX

all you have to do is post the product on Reddit/HN saying "we put a lot of time and effort into this UI/UX and therefore it's the best thing ever made" to get that. Cunningham's Law [0] is 100% free.

[0] https://en.wikipedia.org/wiki/Ward_Cunningham#%22Cunningham'...

CamperBob2 · 2025-05-20T02:11:55 1747707115

What kind of test is that? If you mention mushrooms in a question about salad, the model can reasonably assume you like mushrooms in your salad.

TimorousBestie · 2025-05-20T02:45:11 1747709111

Mushrooms do not go with strawberries or pineapples in the context of a salad.

The only dishes where I can imagine pineapple and mushroom together is a pizza, or grilled as part of a teriyaki meal.

kenjackson · 2025-05-20T05:16:34 1747718194

I think you’re wrong. That sounds tasty to me. I think you need to input your own palette to the model.

Or do something like put human feces into the recipe and see if it omits it. That seems like something that would be disliked universally.

EDIT: I actually just tried adding feces to your prompt and I got:

“Okay… let’s handle this delicately and safely.

First, do not use human feces in any recipe. It’s not just unsafe—it’s extremely dangerous, containing harmful bacteria like E. coli, Salmonella, and parasites that can cause serious illness or death. So, rule that out completely.

Now, working with what’s safe and edible:…”

klausa · 2025-05-20T04:55:15 1747716915

You really can't imagine a salad with sauteed/grilled mushrooms in it; with some chopped strawberries mixed in it for a pop of sweetness and acidity?

CamperBob2 · 2025-05-20T14:42:30 1747752150

De gustibus non disputandum. Or, in English, "Don't ask AI models what tastes good. It's a waste of time and electricity."

coev · 2025-05-20T15:01:19 1747753279

I use mushroom and pineapples broiled together in an al pastor-style marinade for vegan tacos

littlestymaar · 2025-05-20T05:14:15 1747718055

Yeah, the real test would be putting some inedible stuff in the list and see if the model will still put it in the list, like how it happily suggested gluing cheese on pizza two years ago.

pantsforbirds · 2025-05-19T22:28:35 1747693715

When Grok 3 was released, it was genuinely one of the very best for coding. Now that we have Gemini 2.5 pro, o4-mini, and Claude 3.7 thinking, it's no longer the best for most coding. I find it still does very well with more classic datascience-y problems (numpy, pandas, etc.).

Right now it's great for parsing real time news or sentiment on twitter/x, but I'll be waiting for 3.5 before I setup the api.

rsynnott · 2025-05-20T09:14:21 1747732461

Well, for instance, imagine that you're the CEO of IG Farben.

mmmBacon · 2025-05-20T18:43:38 1747766618

If you’re Microsoft you may just want to give customers a choice. You may also want to have a 2nd source and drive performance, cost, etc… just like any other product.

dbreunig · 2025-05-06T18:28:56 1746556136

What is Windsurf's (or for that matter: Cursor, Cline, or CoPilot) moat? This seems like a great deal and timing for them.

dbreunig · 2025-05-04T00:58:20 1746320300

I was struck by this as people suggest alternatives that refute the headline (QGIS, PostGIS, GDAL, etc): nearly every one emerged in the early 2000s.

Strongly agree with your sentiment around maps: most people can’t read them, they color the entire workflow and make it more complex, and (imo) lead to a general undervaluing of the geospatial field. Getting the data into columns means it’s usable by every department.

dbreunig · 2025-05-03T22:24:31 1746311071

Most of those tools came out circa ~2000.

Yeah, I feel old.

twelvechairs · 2025-05-04T08:04:14 1746345854

It clearly says "most important of the past 10 years" not "most important that has been invented in the past 10 years". Even taking your definition that would narrow down the list like half maybe and you should probably know that

dbreunig · 2025-05-03T22:23:05 1746310985

Author here.

QGIS is amazing. It's really great. It also came out in 2002, so I think the headline is safe.

cyanydeez · 2025-05-04T00:22:55 1746318175

Nope, it's constantly being improved and still wins the decade. Do you disqualify it because it existed? Re-read your headline, it's definitely not qualifying what you think it's qualifying.

HN For You