More

ClaireGz · 2026-02-26T16:04:29 1772121869

OpenAI recently shared how their internal data agent works: [https://openai.com/index/inside-our-in-house-data-agent/](https://openai.com/index/inside-our-in-house-data-agent/)

What interested me most wasn’t the model itself, but the surrounding system design: automated context retrieval, evaluation loops, and memory that improves the agent over time.

I’ve been experimenting with recreating a similar setup, but with a different goal: making the configuration more accessible for any company.

The result is an open-source YAML + Markdown framework where you define context sources, tools, and behavior explicitly instead of writing Python code. The idea is to make agent context easier to reason about, version, and iterate on, especially for data teams.

Repo: [https://github.com/getnao/nao](https://github.com/getnao/nao)

Would love feedback from people who have tried deploying analytics or data agents before.

ClaireGz · 2026-02-26T08:20:07 1772094007

A lot of agent failures I’ve seen come from small details getting dropped when context is summarized or compressed.

How do you make sure the densification step doesn’t remove something that ends up being important for the task?

semihcihan · 2026-02-26T09:42:19 1772098939

You’re right that it’s important to keep facts, numbers, constants, and the like and that’s exactly how the agent is prompted. It’s not really summarization, it’s removing noise (UI chrome, menus, repeated labels, etc.) while keeping all the numbers and facts. I’ve been using the tool myself for a while and it’s held up well. If we find that keeping everything as-is works better, we can always turn off densification and keep the raw capture.

ClaireGz · 2026-02-26T08:18:09 1772093889

Interesting direction.

One thing I keep seeing in practice is that “memory” problems are often less about storage and more about structure + retrieval strategy.

Vector search helps sometimes, but for a lot of agent workflows we’ve had better results with explicit context organization (files, metadata, rules) rather than semantic similarity alone.

Curious how you’re thinking about memory updates over time — append-only vs rewriting summaries?

xqli · 2026-02-26T08:40:50 1772095250

That matches our experience pretty closely. A lot of “memory” issues we saw weren’t about storage capacity, but about what kind of information is allowed to persist and how it’s structured. Once everything is flattened into one blob, retrieval strategy becomes the only lever left — which is where vectors often get overused.

In Mneme, updates are intentionally asymmetric: – Facts are append-only and explicitly curated (they’re meant to be boring and stable). – Task state is rewritten as work progresses. – Context is disposable and aggressively compacted or dropped.

The idea is that only a small subset of information deserves long-term durability; everything else should be easy to overwrite or forget.

This reduces the need for heavy retrieval logic in the first place, since the model is usually operating over a much smaller, more explicit working set.

ClaireGz · 2026-02-26T08:17:20 1772093840

This is super helpful — most writeups skip over the actual communication steps, so seeing the All-to-All flow laid out makes it much clearer.

Curious from your experiments: at 1M+ context, does communication start dominating vs compute?

I keep seeing cases where bigger context windows are technically possible but don’t translate into better results unless the context is very structured, so I wonder where the real scaling limit ends up being in practice.

DARSHANFOFADIYA · 2026-02-26T15:27:01 1772119621

As we scale to 1MN context length (inference) the biggest bottleneck is memory and to tackle that at scale we pay the price of communication overhead. Now fortunately the gpus are smartly fetching data for the next step while the previous step is computing thus masking the communication overhead and keeping responses at such scale appear realistic.

The quality degradation as context length increaes is a whole another science problem

ClaireGz · 2025-05-23T06:18:20 1747981100

As an update, we now have SSL connections available

ClaireGz · 2025-05-23T06:18:06 1747981086

As an update, we now have SSL connections available

ClaireGz · 2025-05-12T17:10:10 1747069810

Here is our contact page, feel free to contact us and we'll hep you setup: https://docs.getnao.io/docs/support/support Also, we'll be adding a new dbt onboarding flow tomorrow!

ClaireGz · 2025-05-10T00:16:43 1746836203

Thanks for your feedback, we'll add SSL in the connection soon

ClaireGz · 2025-05-10T00:15:24 1746836124

Probably in a few months. For now we're focusing to make the experience great for a restricted number of warehouses. But you can reach out by email and we'll keep you updated

ClaireGz · 2025-05-09T21:13:22 1746825202

Thanks for sharing. I like the view you built to visualize the profiling of your data, I think that's indeed key to understand your data.

HN For You