More

taytus · 2026-05-28T17:12:10 1779988330

Incremental gains compounds.

itake · 2026-05-28T17:21:22 1779988882

meta threw in the towel when it came to producing AI models since their gains couldn't keep up with China.

HDThoreaun · 2026-05-28T17:57:45 1779991065

Has meta stopped producing new models? I figured they were just regrouping after all the drama they’ve had recently. Meta’s massive user base means they don’t need to be involved in the customer acquisition rat race. Once they have a model they’re happy with they can have a billion people interacting with it within a month.

staticman2 · 2026-05-28T20:04:59 1779998699

Meta released a major new closed source model a month or so ago.

It didn't make a splash like a new open source release would have.

TurdF3rguson · 2026-05-28T20:55:02 1780001702

muse-spark is beating all the Chinese text models on lmarena leaderboard FYI. Maybe you only care about coding models.

paulddraper · 2026-05-28T17:16:23 1779988583

Exactly. Go back to Opus 4.5 and see how you like it.

You won't, really.

taytus · 2026-05-24T01:14:13 1779585253

Signaling.

taytus · 2026-05-11T14:16:26 1778508986

Nice Fiction story.

taytus · 2026-05-04T15:31:03 1777908663

Kimi feels almost limitless. I have the $40/month plan and I've never ever remotely close to hitting the limit. Using opus as the orchestrator.

cpursley · 2026-05-04T20:42:49 1777927369

I've had some good results with Kimi in Opencode. Can you tell me more about using Opus as orchestrator - what type of harness setup?

taytus · 2026-04-24T22:15:23 1777068923

Skills are not like hooks. Skills can and will inevitably be ignored.

AndyNemmity · 2026-04-25T06:20:36 1777098036

Skills are not ignored if you use a router in front of them, and they are actually called.

The problem is the base harnesses don't call them aggressively enough. Not that they don't work.

taytus · 2026-04-24T22:14:43 1777068883

Hooks are hard stops. In theory the model must respect them, unlike Claude.md or agents.md so yeah, it helps a lot.

xienze · 2026-04-24T22:29:51 1777069791

Yes, in theory. But these are inherently non-deterministic systems interpreting English prose. It's not the same thing as a real honest-to-God program that executes a deterministic algorithm to verify the output.

I can't believe we've sunk this low, to start complaining that the non-deterministic black box didn't respect "YOU MUST DO THIS" or "DO NOT DO THIS" commands in a Markdown file. We used to be engineers.

tkiolp4 · 2026-04-24T22:29:48 1777069788

That has never been true.

taytus · 2026-04-24T22:13:02 1777068782

Yes an no. Some skills are very very tuned to our own workflows. The model providers may come up with some similar alternatives but not always. Also, sometimes you need a solution now and not in three months.

taytus · 2026-04-24T17:15:20 1777050920

I'd recommend Kimi k2.6 for your use. It is an excellent model at a fraction of the cost, and you can use Claude Code with it.

I did a 1:1 map of all my Claude Code skills, and it feels like I never left Opus.

Super happy with the results.

wolttam · 2026-04-24T17:17:13 1777051033

I was saying the same until DeepSeek v4 this morning... sorry, Kimi. The competition is intense!

Aldipower · 2026-04-24T18:33:04 1777055584

Fascinated, a bummer that DeepSeek does not offer a DPA or opt-out for training. This renders it unusable for my use cases unfortunately. At least z.ai GLM has a somewhat DPA in Singapore.

wolttam · 2026-04-24T18:55:00 1777056900

The weights are open and you can use the model with any third party provider that gives you the DPA you want.

For my use-case, I want the providers to get my tokens as long as they plan to keep releasing open-weight models

folmar · 2026-04-24T18:36:53 1777055813

If you don't use a lot of quota the cheapest monthly Claude Code is $20, Kimi Code is $19, i.e. the cost difference is minuscule.

Kimi wants my phone number on signup so a no-go for me.

ramoz · 2026-04-24T17:17:39 1777051059

What provider do you use for Kimi

skippyboxedhero · 2026-04-24T18:51:47 1777056707

The provider is a massive issue. People moving off Claude tend to assume this is solved.

Claude's uptime is terrible. The uptime of most other providers is even worse...and you get all the quantization, don't know what model you are actually getting, etc.

Leynos · 2026-04-24T21:54:09 1777067649

Kimi 2.5 was like using Sonnet 4 on a flaky ADSL line. I haven't tried K2.6 yet, but the physical unreliability of the connection was too off-putting.

bigethan · 2026-04-24T19:46:20 1777059980

OpenRouter and I'm toying around with Hermes. Seems good so far, but haven't really gotten into anything heavy yet. Though the "freedom" of not sweating the token pause and the costs not being too high is real.

taytus · 2026-04-24T18:03:12 1777053792

Straight from them, but I know other providers like io.net can be faster but I like to directly support the project.

subscribed · 2026-04-24T20:56:52 1777064212

Thx. I'll try with my personal projects (because dues to the data collection and ToS most providers are forbidden in my company), if I can opt out of training on my input.

I'm just getting a but tired of using Opus 2.6 which eats my whole allowance and then some £££ going through the 4kB prompt to review ~13 kB text file twice - and that's on top of the sometimes utter bonkers, bad, lazy answers I'm not getting even from the local Gemma 4 E4B.

spaceman_2020 · 2026-04-24T21:29:30 1777066170

did you just copy-paste or is there a difference in the way kimi uses skills?

taytus · 2026-04-24T22:08:29 1777068509

I don’t have the prompt at hand but basically I told Kimi (paraphrasing): I have these Claude code skills, and I know it uses different tool calls than you but read them and re-write them as your own tools.

I also created a mini framework so it can test that the skills are actually working after implementation.

Everything runs perfectly.

taytus · 2026-04-24T10:53:51 1777028031

>I asked DS itself and it denied this

Bro, seriously?

taytus · 2026-04-23T23:22:41 1776986561

“after evals and dogfooding” couldn’t have done this before releasing the model? We are paying $200/month to beta test the software for you.

HN For You