More

skeptrune · 2026-04-03T19:20:44 1775244044

yea chromadb is not the point. multiple data storage solutions work

kenforthewin · 2026-04-03T19:26:59 1775244419

I see .. so you're not using the vectors at all. Where are the evaluations showing this chromaFS approach is performing better than vectors?

skeptrune · 2026-04-03T21:47:48 1775252868

Working on publishing those, but publishing benchmarks requires a lot of attention to detail so it will likely be a bit longer.

skeptrune · 2026-04-03T19:14:34 1775243674

agreed!

skeptrune · 2026-04-03T18:43:55 1775241835

We would also be super interested to see that comparison. I agree that there isn't a specific reason why Chroma would be required to build something like this.

skeptrune · 2026-04-03T18:42:33 1775241753

I agree that would have been the way to go given more time and resources. However, setting up a FUSE mount would have taken significantly longer and required additional infrastructure.

skeptrune · 2026-04-03T18:30:58 1775241058

100% agree. However, if there were no resource tradeoffs, then a FUSE mount would probably be the way to go.

skeptrune · 2026-04-03T18:30:08 1775241008

Modern OCR tooling is quite good. If the knowledge you are adding into your search database is able to be OCR'd then I think the approach we took here is able to be generalized.

skeptrune · 2026-04-03T18:28:49 1775240929

Hmmm, the post is an attempt to explain that Mintlify migrated from embedding-retrieval->reranker->LLM to an agent loop with access to call POSIX tools as it desires. Perhaps we didn't provide enough detail?

dmix · 2026-04-03T18:39:04 1775241544

That matches what I'm curious about. Where an LLM is doing the bulk of information discovery and tool calling directly. Most simpler RAGs have an LLM on the frontend mostly just doing simpler query clean up, subqueries and taxonomy, then again later to rerank and parse the data. So I'd imagine the prompting and guardrails part is much more complicated in an agent loop approach, since it's more powerful and open ended.

skeptrune · 2026-04-03T18:27:35 1775240855

Vector search has moved from a "complete solution" to just one tool among many which you should likely provide to an agent.

skeptrune · 2026-04-03T18:26:35 1775240795

I think it's cool that LLMs can effectively do this kind of categorization on the fly at relatively large scale. When you give the LLM tools beyond just "search", it really is effectively cheating.

skeptrune · 2026-04-03T18:24:17 1775240657

100% agree a FUSE mount would be the way to go given more time and resources.

Putting Chroma behind a FUSE adapter was my initial thought when I was implementing this but it was way too slow.

I think we would also need to optimize grep even if we had a FUSE mount.

This was easier in our case, because we didn’t need a 100% POSIX compatibility for our read only docs use case because the agent used only a subset of bash commands anyway to traverse the docs. This also avoids any extra infra overhead or maintenance of EC2 nodes/sandboxes that the agent would have to use.

darkteflon · 2026-04-03T21:43:24 1775252604

Did you guys look at Firecracker-based options such as E2B and Fly.io? We’ve had positive early results on latency, but yeah … too early to tell where we end up on cost.

skeptrune · 2026-04-03T21:46:37 1775252797

Yea we did and actually use Daytona for another product, but it would have been too slow here.

readitalready · 2026-04-03T19:31:18 1775244678

Yah my Claude Code agents run a ton of Python and bash scripts. You're probably missing out on a lot of tool use cases without full tool use through POSIX compatibility.

skeptrune · 2026-04-03T21:46:58 1775252818

agreed. hopefully we can get there soon

Galanwe · 2026-04-03T19:18:31 1775243911

Makes sense, thanks for clarifying!

HN For You