More

viksit · 2025-07-13T23:14:55 1752448495

Following up on my last post about optimizing tool selection with differentiable programming, I’ve been thinking about how to extend those ideas to full agent workflows. This post shares some early experiments using DSPy to optimize routing and structure end-to-end for a sample customer service agent workflow. Feedback welcome!

viksit · 2025-07-08T16:02:39 1751990559

would you have a link?

digitcatphd · 2025-07-11T18:09:39 1752257379

https://arxiv.org/pdf/2506.02153

viksit · 2025-07-08T16:02:30 1751990550

there's a world where the model could infer that as well!

viksit · 2025-07-08T16:01:46 1751990506

yes, AFAIK right now, there are no easy ways of "slimming" context because no one knows what it should be or how.

viksit · 2025-07-06T04:49:26 1751777366

for sure, there's a way here where I think we ought to be able to learn multiple tool calls and prompts together with real world data. investigating that next.

viksit · 2025-07-06T04:48:20 1751777300

(author here, put the code in a gist here for reference)

https://gist.github.com/viksit/c67d1d960c4cec89488290496defb...

viksit · 2025-07-06T04:22:41 1751775761

+1 thanks for mentioning MCP!

re: different tools (apis vs mcps). in my mind, there should be no real difference at what kind of tools is called at this moment since I model this as a softmax over a label set of tools.

that said, an idea I want to investigate is whether tools can live in a learned embedding space, where selection isn’t a softmax over discrete labels but a nearest-neighbor or attention mechanism over continuous vectors.

this is the intuition I'm developing as we speak and in some of my other comments on this thread (see differentiable state machine comment).

viksit · 2025-07-06T04:17:30 1751775450

+1 - you can propagate the loss for a workflow across prompts + tools, which would make it much better to do resilient workflows. or "agents" as everyone calls them now ;)

viksit · 2025-07-06T04:16:56 1751775416

this is my goal :) appreciate the feedback.

viksit · 2025-07-06T04:16:43 1751775403

+1 - the biggest issue is not being able to fine tune the llm to learn the specifics of how to make a tool call better over time, which an approach like this can bring to the table.

HN For You