More

jwr · 2026-04-07T07:17:45 1775546265

I currently use MacWhisper and it is quite good, but it's great to see an alternative, especially as I've been looking to use more recent models!

I hope there will be a way to plug in other models: I currently work mostly with Whisper Large. Parakeet is slightly worse for non-English languages. But there are better recent developments.

jwr · 2026-04-06T17:53:44 1775498024

I wish they had a "and we won't screw you in two weeks" plan at, say, 5x the price. It's worth it for my business, I'd pay it.

Should I switch back to API pricing? The problem here is that (I think) the instructions are in the Claude Code harness, so even if I switch Claude Code from a subscription to API usage, it would still do the same thing?

garfij · 2026-04-06T18:34:19 1775500459

FWIW I've only ever been on the API based plan at work and we never seem to run into the majority of the problems people seem to be very vocal about. Outages still affect us, and we do have the intermittent voodoo feeling of "Claude seems stupider today", but nothing persistent.

Of course it's a stupid amount of money sometimes, but I generally feel like we get what we're paying for.

Majromax · 2026-04-06T18:49:03 1775501343

If you're using API pricing, then you can bring your own harness with full visibility/oversight of the prompting.

muyuu · 2026-04-06T22:48:26 1775515706

Perhaps that does sort most of the issues? I'm not convinced because some of them look deep and related to opaque pre-injection on their end.

_3u10 · 2026-04-07T04:03:25 1775534605

Opus is garbage use opencode and then directly compare it. It’s just as fucking dumb with opencode’s harness.

jwr · 2026-04-07T07:19:44 1775546384

I never managed to get anything useful out of opencode, to be honest. I tried it many times, with various models. Claude Code always just worked better.

jwr · 2026-04-06T17:50:19 1775497819

That's one of the possible explanations, but I think too many people are seeing the same symptoms (and some actually measured them).

An "economical explanation" is actually that Anthropic subscriptions are heavily subsidized and after a while they realized that they need to make Claude be more stingy with thinking tokens. So they modified the instructions and this is the result.

root_axis · 2026-04-06T18:34:37 1775500477

> but I think too many people are seeing the same symptoms (and some actually measured them).

Or too many people are slurping up anecdotes from the same watering hole that confirms their opinions. Outside of academic papers, I don't think I've ever seen an example of "measuring" output that couldn't also be explained by stochastic variability.

jwr · 2026-04-06T09:25:55 1775467555

That is very, very interesting. I've been hoping to have an assistant in the workshop (hands-free!) that I could talk to and have it help me with simple tasks: timers, calculating, digging up notes, etc. — basically, what the phone assistants were supposed to be, but aren't.

"You will have to unlock your iphone first" is kind of a deal-breaker when you are in the middle of mixing polyurethane resin and have gloves and a mask on.

More and more I find that we have the technology, but the supposedly "tech" companies are the gatekeepers, preventing us from using the technological advances and holding us back years behind the state of the art.

I'll be trying this out on my Macbook, looks very promising!

gtowey · 2026-04-06T13:07:14 1775480834

The computing power we all have in our pockets is staggering. It could be tool that truly makes our lives easier, but instead it's mostly a device that is frustrating to use. Companies have decided to make it simply another conduit for advertising. It's a tool for them to sell us more stuff. Basic usability be damned.

jamilton · 2026-04-06T17:10:01 1775495401

Siri does have a setting that'll activate it if you say "hey siri" while the phone is locked. Obvious privacy and battery usage concerns though, and it's still Siri, so it's a little clunky.

jwr · 2026-04-06T17:37:24 1775497044

Mhm. I think I use that. But then I say "call my wife" and it says "you'll need to unlock your iphone first".

It's clear Tim Cook doesn't ever try to use Siri wearing gloves. Or ever, for that matter :-)

mft_ · 2026-04-06T17:51:45 1775497905

Siri (on iOS 18, at least) will call people for me without unlocking, in response to a voice command only - I just double-checked...

mentalgear · 2026-04-06T12:43:34 1775479414

You might be interested in the open-source https://www.home-assistant.io/voice-pe/ .

QuercusMax · 2026-04-06T17:18:22 1775495902

I've been replacing my Google Homes and Chromecasts with Snapcast streamers, and this is the next thing I've been planning to look into.

It's truly absurd how the Google voice assistant USED to work properly for setting timers, playing music, etc, and then they had to break it 15 times and finally replace it with much slower AI that only kinda does what you want. I'm done.

Selfhosted is the way to go if you want to keep your sanity. My wife has basically given up on any Google/Apple voice assistants being able to do anything useful above "set a 10 minute timer".

huijzer · 2026-04-06T12:55:21 1775480121

> More and more I find that we have the technology, but the supposedly "tech" companies are the gatekeepers

Yes same with RSS readers being dropped by large companies. Worked too good I guess!

jwr · 2026-04-06T08:40:49 1775464849

> It's also possible to make an MLX version of it, which runs a little faster on Macs

FWIW, I found MLX variants to perform consistently worse (in terms of expected output, not speed) than GGUF in my measurements on my benchmark that matters to me (spam filtering). I used MLX models in LM Studio. GGUF was always slightly better.

Perhaps someone who knows more can pitch in and explain this.

embedding-shape · 2026-04-06T12:40:38 1775479238

It isn't 100% clear, but what quantization were you using for each? I've had worse results with MLX 8bit than what you get with Q4 GGUF, same model, seems mxfp8 or bf16 is needed when ran with MLX to get something worthwhile out of them, but I've done very little testing, could have been something specific with the model I was testing at the time.

pmarreck · 2026-04-06T13:03:54 1775480634

I was not aware of this. I might not be willing to trade accuracy for speed in this case, then.

jwr · 2026-04-03T15:31:50 1775230310

No, they can't, not unless we get rid of the fossil fuel lobby, which pretty much runs the world these days. Which isn't surprising, given that fossil fuels are the largest industry ever created by mankind. If you compare it to anything else which was actively harmful and yet big money tried to convince you it wasn't (like tobacco, alcohol, or really anything else), there is nothing that huge. So it isn't surprising that the industry fights change.

EV adoption has been successfully held back mostly by PR, Germany shifted from nuclear to coal and gas, the US president is doing everything to dismantle anything that isn't fossil fuel and promotes fossil fuels, the list goes on.

j23n · 2026-04-03T15:52:10 1775231530

I think this sells the German energy mix short - fossil fuel has been on a steady decline in the energy mix for about 2 decades now.

Comparing 2020[^2] to 2025[^1]:

- renewables (solar+wind) went from 181 TWh to 219 TWh

- fossil (coal+gas) stayed constant (177 TWh and 179 TWh)

So I'd say we switched from nuclear (60TWh in 2020) to renewables & imported nuclear - but the long-term trend is towards renewables.

[1]: https://www.ise.fraunhofer.de/en/press-media/press-releases/... [2]: (pdf) https://www.ise.fraunhofer.de/content/dam/ise/en/documents/N...

jwr · 2026-04-03T16:25:09 1775233509

I realize there is a lot of verbal gymnastics going on around this issue, and the word "renewables" is being used a lot, but my point still stands.

Another way to look at your numbers is that had the nuclear plants not been turned off, fossil (coal+gas) could have been reduced by 60TWh.

But they weren't reduced. They remained the same.

From the point of view of the fossil fuel industry: WIN!

dehrmann · 2026-04-03T16:00:30 1775232030

The fossil fuel lobby can only do so much. Solar has gotten so cheap it's taking over on its own. Companies are doing it for no reason other than the math makes sense. EV batteries are nearing that point too. You can only keep BYD out of the US for so long.

jacquesm · 2026-04-03T16:01:30 1775232090

The fossil fuel industry is fighting a rearguard action at this point.

mft_ · 2026-04-03T16:09:34 1775232574

> Germany shifted from nuclear to coal and gas

Sure, but you're attributing this, deliberately or not, to the wrong cause. It wasn't that the fossil fuel industry somehow won - it was range of factors possibly including geopolitics, some existing plants aging, an emotional response to the Fukushima nuclear disaster, and the Green lobby.

Basically, they voted to kill nuclear without a solid plan for an alternative, and coal/gas is the default option for filling the gaps left in the absence of timely and sufficiently rapid investment in other technologies.

jwr · 2026-04-03T16:18:19 1775233099

Hmm. After former chancellor (Schroeder) heavily pushed Russian gas pipelines (Nord Stream 1 and 2) and then swiftly moved to working for Russian state-owned energy companies, including Nord Stream AG, Rosneft, and Gazprom, I have a different outlook on things.

mft_ · 2026-04-03T21:48:14 1775252894

One can never discount lobbying and influence behind the scenes, but Schroeder finished being Chancellor in 2005, which was six years before the initial post-Fukushima vote in question, and even longer since various aspects of the plan continued to be supported by various politicians.

He'd be a spectacularly successful lobbyist if your suspicion is correct.

jwr · 2026-04-04T07:41:24 1775288484

Why would I assume Schroeder is the only politician under the influence of Russia or any of the fossil-fuel industry businesses?

KaiserPro · 2026-04-03T16:34:56 1775234096

I mean yeah, but $100 a barrel makes it difficult to argue.

jwr · 2026-04-03T10:45:43 1775213143

I found it impossible to add a European project to it.

monegator · 2026-04-03T13:51:53 1775224313

Bottom of the page:

"Any suggestions? Sign up for an account to suggest changes or new products."

(granted: what on earth do i need an account for, that can't be done with a form or an email?)

jwr · 2026-04-03T15:11:51 1775229111

Have you tried? I have. I just checked: my product has been "Waiting for review" since July 15, 2025.

No wonder us Europeans get laughed at when it comes to speed and execution.

jwr · 2026-04-03T09:21:36 1775208096

I never understood the logic behind the thinking there. Why would you ever want to place menubar items UNDER the notch, if you know it's there and they won't be visible?

It's such an easy problem to fix, with such incredible usability consequences, I just don't get the thinking.

butlike · 2026-04-03T14:13:01 1775225581

The notch itself is probably considered temporary internally. If you code a rule for the notch, then you're going to have to consider which hardware OSX is running on in order to determine if the notch is present or not for your "notch width calculation."

leptoniscool · 2026-04-03T10:06:51 1775210811

"Think Different"

VerifiedReports · 2026-04-03T18:04:39 1775239479

"Courage"

jwr · 2026-04-03T09:14:23 1775207663

PoE is not obvious to implement (take it from someone who has done it with a fair share of mistakes), uses more expensive components that normal ethernet, takes up more space on the board, makes passing emissions certification more complex, and is more prone to mistakes that ruin boards in the future, causing support/warranty issues. In other words, a bag of worms: not impossible to handle, but something you would rather avoid if possible.

ldng · 2026-04-03T10:02:48 1775210568

And what would a better alternative look like ?

timschmidt · 2026-04-03T11:14:15 1775214855

I wouldn't call it "better", but the least-effort path among hobbyists and low end gear is often 12v or 24v sent over a pair with Gnd and a forgiving voltage regulator on the other end.

jwr · 2026-04-03T15:12:47 1775229167

There is none, I never said PoE is "bad": it's a very good solution, it's just difficult to implement.

jwr · 2026-04-02T16:17:07 1775146627

Really looking forward to testing and benchmarking this on my spam filtering benchmark. gemma-3-27b was a really strong model, surpassed later by gpt-oss:20b (which was also much faster). qwen models always had more variance.

mhitza · 2026-04-02T17:25:54 1775150754

If you wouldn't mind chatting about your usage, my email is in my profile, and I'd love to share experiences with other HNers using self-hosted models.

jeffbee · 2026-04-02T16:51:06 1775148666

Does spam filtering really need a better model? My impression is that the whole game is based on having the best and freshest user-contributed labels.

drob518 · 2026-04-02T21:09:03 1775164143

He said it’s a benchmark.

hrmtst93837 · 2026-04-02T19:37:08 1775158628

[flagged]

jeffbee · 2026-04-02T19:49:01 1775159341

In my experience the contents of the message are all but totally irrelevant to the classification, and it is the behavior of the mailing peer that gives all the relevant features.

mh- · 2026-04-03T01:48:35 1775180915

Based on how much blatant gmail->gmail spam I receive, the gmail team agrees with this strategy.

HN For You