I just did the opposite and am seeing better results.
Claude was struggling to use the ‘gh’ command to reliably read and respond to code review line level comments because it had to use the api. I had it write a few simple command line tools and a skill for invoking it, instantly better results.
Human DX optimizes for discoverability and forgiveness.
Agent DX optimizes for predictability and defense-in-depth.
These are different enough that retrofitting a human-first CLI for agents is a losing bet.
I recently formalized a `.agent/workflows/knowledge-extraction.md` to extract high-signal engineering patterns from Antigravity’s persistence layer to make them explicit and portable.