More

mckennameyer · 2026-06-09T14:11:09 1781014269

Claude's great at reading what people say, but surprisingly bad at recognizing when a politician's stance is just the first signal in a negotiation.

mckennameyer · 2026-05-28T15:00:15 1779980415

You're right, just updated.

Original title took one framing from the back half of the post (3 update cycles that can loosely be called the "ChatGPT era, then xAI/Meta/Gemini era, then Anthropic era"), but definitely not the point here. Thanks for flagging

dwohnitmok · 2026-05-28T15:09:32 1779980972

Nice!

mckennameyer · 2026-03-26T18:44:34 1774550674

So basically the attacker and the dev who caught it were probably using the same tools if the malware was AI-generated (hence the fork bomb bug), and the investigation was AI-assisted (hence the speed). Less "tip of the iceberg" and more just that both sides got faster.

mckennameyer · 2026-03-17T14:07:07 1773756427

It seems like a marketing play to seize on the protein movement. What will they do when fiber becomes the next craze?

mckennameyer · 2026-03-17T14:00:13 1773756013

Can definitely relate. I think forcing myself to conduct 1 session at a time feels so difficult not only from an efficiency POV but from an attention standpoint. Waiting for a session to finish, being alone with my thoughts... we're faced every day with things that are convincing us that multitasking is efficient when it's really not at all

aray07 · 2026-03-17T14:05:08 1773756308

Yeah - its definitely a new way of working and getting used to!

mckennameyer · 2026-03-13T17:14:31 1773422071

For anyone following the Chalamet drama... next you'll have to look into how many times a best actor frontrunner has lost thanks to their ego last week of the race!

mckennameyer · 2026-03-10T13:55:32 1773150932

Do you think reasoning and behavioral effort should be separate knobs, or is bundling them the right call?

Bullhorn9268 · 2026-03-10T14:50:09 1773154209

I see the value in making it simple for the user, but here I feel it's a bit too much. Would probably prefer two.

mckennameyer · 2026-03-05T15:50:22 1772725822

Aren't vibe PRs way more likely to get abandoned? Sure they reduce reviewer load, but then everyone feels less urgency to do a human review after. Do you think the skill is making that better or worse?

parad0x0n · 2026-03-05T16:11:13 1772727073

yeah I guess figuring out how AI and your team can optimally work together is not that straightforward.. probably every engineering team is trying to figure that out atm :D but if we already let AI write reviews, they should at least be as good as they can

mckennameyer · 2026-02-13T14:38:23 1770993503

We tested GPT-5 and Gemini Flash 3 at low, medium, and high effort on 169 instances with human-verified answers, scored against a frozen offline web corpus using Deep Research Bench. High effort consistently scored worse than lower thinking levels for both models. Methodology and raw data: https://everyrow.io/docs/notebooks/deep-research-bench-paret... (edited)

mckennameyer · 2026-01-22T19:28:49 1769110129

Interesting approach with the cascade. How do you decide when to escalate from fuzzy matching to LLM?

parad0x0n · 2026-01-22T19:54:42 1769111682

So fuzzy matching only makes sense if you expect two columns having the same data more or less, otherwise you can skip that step.

And then you have to pick a threshold -> if similarity of strings is above that threshold, it's a match, otherwise, not. Threshold should be high to prevent false positives. LLM will take care of the non-matches

HN For You