For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | more dbreunig's commentsregister

Sometimes buzzwords turn out to be mirages that disappear in a few weeks, but often they stick around.

I find they takeoff when someone crystallizes something many people are thinking about internally, and don’t realize everyone else is having similar thoughts. In this example, I think the way agent and app builders are wrestling with LLMs is fundamentally different than chatbots users (it’s closer to programming), and this phrase resonates with that crowd.

Here’s an earlier write up on buzzwords: https://www.dbreunig.com/2020/02/28/how-to-build-a-buzzword....


I agree - what distinguishes this is how rushed and self-aware it is. It is being pushed top down, sheepishly.

EDIT: Ah, you also wrote the blog posts tied to this. It gives 0 comfort that you have a blog post re: building buzz phrases in 2020, rather, it enhances the awkward inorganic rush people are self-aware of.


I studied linguistic anthropology, in addition to CS. Been at it since 2002.

And I wrote the first post before the meme.


I've read these ideas a 1000 times, I thought it was the most beautiful core of the "Sparks of AGI" paper. (6.2)

We should be able to name the source of this sheepishness and have fun with that we are all things at once: you can be a viral hit 2002 super PhD with expertise in all areas involved in this topic that has brought pop attention onto something important, and yet, the hip topic you feel centered on can also make people's eyes roll temporarily. You're doing God's work. The AI = F(C) thing is really important. Its just, in the short term, it will feel like a buzzword.

This is much more about me playing with, what we can reduce to, the "get off my lawn!" take. I felt it interesting to voice because it is a consistent undercurrent in the discussion and also leads to observable absurdities when trying to describe it. It is not questioning you, your ideas, or work. It has just come about at a time when things become hyperreal hyperquickly and I am feeling old.


While researching the above posts Simon linked, I was struck by how many of these techniques came from the pre-ChatGPT era. NLP researchers have been dealing with this for awhile.


Yes, look up Winshuttle.

A very successful company with some of the happiest customers I’ve ever seen, whose entire product was a SAP hack that allowed people to enter their data using Excel. As someone unfamiliar with SAP, absolutely blew my mind.


Not dictation…copy/paste I think. Thanks, fixed.


Thanks! That's a typo!


Can anyone provide a reason an enterprise would choose Grok over a similar class of models?


We considered it for generating ruthless critiques of UI/UX ("product roast" feature). Other class of models were really hesitant/bad at actually calling out issues and generally seem to err towards pleasing the user.

Here's a simple example I tried just now. Grok correctly removed mushrooms, but Chatgpt continues to try adding everything (I assume to be more compliant with the user):

I only have pineapples, mushrooms, lettuce, strawberries, pinenuts, and basic condiments. What salad can I make that's yummy?

Grok: Pineapple-Strawberry Salad with Lettuce and Pine Nuts - https://x.com/i/grok/share/exvHu2ewjrWuRNjSJHkq7eLSY

ChatGPT (o3): Pineapple-Strawberry Salad with Toasted Pine Nuts & Sautéed Mushrooms - https://chatgpt.com/share/682b9987-9394-8011-9e55-15626db78b...


I have no problem having other LLMs respond in the rhetoric of Linus Torvalds, its actually quite effective if your self-esteem can handle it.


Do you ask specifically for Linux or just skeptic/caustic in general?


Specifically for Linus Torvalds, the author or Linux

He has a very distinctive style and large amount of training data from all the reviews and emails he made while collaborating on Linux

And as he manages a huge project that's in development for decades, he has to be very strict about the quality


And its fairly constructive, at least when I tried in Gemini 2.5 awhile back. Like yes its caustic (fantastic word) but it does so in a way thats constructive in its counterargument to reach a better outcome.


I haven't seen a model since the 3.5 Turbo days that can't be ruthless if asked to be. And Grok is about as helpful as any other model despite Elon's claims.

Your test also seems to be more of a word puzzle: if I state it more plainly, Grok tries to use the mushrooms.

https://grok.com/share/bGVnYWN5_2db81cd5-7092-4287-8530-4b9e...

And in fact, via the API with no system prompt it also uses mushrooms.

So like most models it just comes down to prompting.


> We considered it for generating ruthless critiques of UI/UX

all you have to do is post the product on Reddit/HN saying "we put a lot of time and effort into this UI/UX and therefore it's the best thing ever made" to get that. Cunningham's Law [0] is 100% free.

[0] https://en.wikipedia.org/wiki/Ward_Cunningham#%22Cunningham'...


What kind of test is that? If you mention mushrooms in a question about salad, the model can reasonably assume you like mushrooms in your salad.


Mushrooms do not go with strawberries or pineapples in the context of a salad.

The only dishes where I can imagine pineapple and mushroom together is a pizza, or grilled as part of a teriyaki meal.


I think you’re wrong. That sounds tasty to me. I think you need to input your own palette to the model.

Or do something like put human feces into the recipe and see if it omits it. That seems like something that would be disliked universally.

EDIT: I actually just tried adding feces to your prompt and I got:

“Okay… let’s handle this delicately and safely.

First, do not use human feces in any recipe. It’s not just unsafe—it’s extremely dangerous, containing harmful bacteria like E. coli, Salmonella, and parasites that can cause serious illness or death. So, rule that out completely.

Now, working with what’s safe and edible:…”


You really can't imagine a salad with sauteed/grilled mushrooms in it; with some chopped strawberries mixed in it for a pop of sweetness and acidity?


De gustibus non disputandum. Or, in English, "Don't ask AI models what tastes good. It's a waste of time and electricity."


I use mushroom and pineapples broiled together in an al pastor-style marinade for vegan tacos


Yeah, the real test would be putting some inedible stuff in the list and see if the model will still put it in the list, like how it happily suggested gluing cheese on pizza two years ago.


When Grok 3 was released, it was genuinely one of the very best for coding. Now that we have Gemini 2.5 pro, o4-mini, and Claude 3.7 thinking, it's no longer the best for most coding. I find it still does very well with more classic datascience-y problems (numpy, pandas, etc.).

Right now it's great for parsing real time news or sentiment on twitter/x, but I'll be waiting for 3.5 before I setup the api.


Well, for instance, imagine that you're the CEO of IG Farben.


If you’re Microsoft you may just want to give customers a choice. You may also want to have a 2nd source and drive performance, cost, etc… just like any other product.


What is Windsurf's (or for that matter: Cursor, Cline, or CoPilot) moat? This seems like a great deal and timing for them.


I was struck by this as people suggest alternatives that refute the headline (QGIS, PostGIS, GDAL, etc): nearly every one emerged in the early 2000s.

Strongly agree with your sentiment around maps: most people can’t read them, they color the entire workflow and make it more complex, and (imo) lead to a general undervaluing of the geospatial field. Getting the data into columns means it’s usable by every department.


Most of those tools came out circa ~2000.

Yeah, I feel old.


It clearly says "most important of the past 10 years" not "most important that has been invented in the past 10 years". Even taking your definition that would narrow down the list like half maybe and you should probably know that


Author here.

QGIS is amazing. It's really great. It also came out in 2002, so I think the headline is safe.


Nope, it's constantly being improved and still wins the decade. Do you disqualify it because it existed? Re-read your headline, it's definitely not qualifying what you think it's qualifying.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You