You may have heard of Programmatic Tool Calling from anthropic or Code Mode from cloudflare for calling mcp servers. I have been working on an open source version of this and recently put together some basic docs, and thought this page would be a good one to share.
There are plenty of other features to explore, but this one alone has resulted in significant improvements in the speed of execution in many tasks including minecraft playing agents, computer use agents, and for querying and aggregating information from my telemetry systems in a lightweight low risk way.
have built in this space which led me to develop a minizinc mcp server [0] for scheduling bocce tournaments [1]. scheduling with constraints is a np hard problem and it makes sense people struggle. tools exist to solve this problem but they are complex and hard to use for non technical folks, and even technical folks. am hoping a tool like this can bridge the gap and would like to bring it to your awareness if you aren't already thinking about the problem this way :)
edit: after reading a bit more of description looks like yall are taking a similar approach, kudos!
This is awesome! Completely agree: modeling each real life scenario as a constraint satisfaction problem is tricky in and of itself (especially with the diversity of non-intersecting constraints we encounter) and something we are actively working on. Using LLMs as a layer above has made it much more tractable. Curious how the bocce scheduling has fared in real world scenarios. How was the performance?
In a way yes, you can reduce context usage by a non-negligible amount approaching it this way. I'm investigate this on my skill validation/analysis/bidirectional MCP server project and hope to have it as a released feature soon: https://github.com/athola/skrills
as an infrastructure engineer the idea of being able to train computer use agents without provisioning infrastructure sounds amazing!
a common use case i run into is i want to be able to configure corporate vpn software on windows machines. is there a link for a getting started guide i could try this out with?
Yes, in a simulated environment you can do this today using plain JS and connecting to a real VPN, while driving the desktop UI. No infra provisioning needed.
ran into this when writing agents to fix unit tests. often times they would just give up early so i started writing the verifiers directly into the agent's control flow and this produced much more reliable results. i believe claude code has hooks that do something similar as well.
There are plenty of other features to explore, but this one alone has resulted in significant improvements in the speed of execution in many tasks including minecraft playing agents, computer use agents, and for querying and aggregating information from my telemetry systems in a lightweight low risk way.
reply