More

someguy101010 · 2026-06-01T16:18:13 1780330693

You may have heard of Programmatic Tool Calling from anthropic or Code Mode from cloudflare for calling mcp servers. I have been working on an open source version of this and recently put together some basic docs, and thought this page would be a good one to share.

There are plenty of other features to explore, but this one alone has resulted in significant improvements in the speed of execution in many tasks including minecraft playing agents, computer use agents, and for querying and aggregating information from my telemetry systems in a lightweight low risk way.

someguy101010 · 2026-05-15T03:18:10 1778815090

if i didn't have to prompt it to learn from its mistakes and it just "intuitively" knew to do that

someguy101010 · 2026-03-05T18:29:33 1772735373

have built in this space which led me to develop a minizinc mcp server [0] for scheduling bocce tournaments [1]. scheduling with constraints is a np hard problem and it makes sense people struggle. tools exist to solve this problem but they are complex and hard to use for non technical folks, and even technical folks. am hoping a tool like this can bridge the gap and would like to bring it to your awareness if you aren't already thinking about the problem this way :)

edit: after reading a bit more of description looks like yall are taking a similar approach, kudos!

[0] https://github.com/r33drichards/minizinc-mcp

[1] https://github.com/r33drichards/bocce-scheduler

skorisep · 2026-03-05T19:22:29 1772738549

This is awesome! Completely agree: modeling each real life scenario as a constraint satisfaction problem is tricky in and of itself (especially with the diversity of non-intersecting constraints we encounter) and something we are actively working on. Using LLMs as a layer above has made it much more tractable. Curious how the bocce scheduling has fared in real world scenarios. How was the performance?

someguy101010 · 2026-03-01T19:08:14 1772392094

yep! thats the motivation behind https://github.com/r33drichards/mcp-js

I want to be able to give agents access to computation in a secure way without giving them full access to a computer

someguy101010 · 2026-01-29T02:01:09 1769652069

Does this support bedrock?

jmuncor · 2026-01-29T03:40:59 1769658059

Could add support if you need it! Just let me know :D

someguy101010 · 2025-12-18T17:57:11 1766080631

Is it possible to provide a llm a skill through the mcp resource feature?

uhgrippa · 2025-12-18T18:48:12 1766083692

In a way yes, you can reduce context usage by a non-negligible amount approaching it this way. I'm investigate this on my skill validation/analysis/bidirectional MCP server project and hope to have it as a released feature soon: https://github.com/athola/skrills

theshrike79 · 2025-12-18T18:35:37 1766082937

It’s also possible to implement an MCP as a skill

someguy101010 · 2025-12-16T16:31:03 1765902663

as an infrastructure engineer the idea of being able to train computer use agents without provisioning infrastructure sounds amazing!

a common use case i run into is i want to be able to configure corporate vpn software on windows machines. is there a link for a getting started guide i could try this out with?

frabonacci · 2025-12-16T16:37:05 1765903025

Yes, in a simulated environment you can do this today using plain JS and connecting to a real VPN, while driving the desktop UI. No infra provisioning needed.

If you need a real Windows OS + corporate VPN, we also support binding agents to actual Windows sandboxes. This example shows automating a Windows app behind a VPN: https://cua.ai/docs/example-usecases/windows-app-behind-vpn

you'll need to define a new task in the cua-bench registry first though - just sign up on the website for early access!

someguy101010 · 2025-12-14T14:30:02 1765722602

have used it, and i do like it, but the licensing situation is not great. It open source but its not free software by any means.

someguy101010 · 2025-12-08T14:06:05 1765202765

wrote about this a bit too in https://www.robw.fyi/2025/10/24/simple-control-flow-for-auto...

ran into this when writing agents to fix unit tests. often times they would just give up early so i started writing the verifiers directly into the agent's control flow and this produced much more reliable results. i believe claude code has hooks that do something similar as well.

someguy101010 · 2025-12-01T20:01:36 1764619296

clearly you have never worked in enterprise

HN For You