Maybe they used Claude to come up with a good method to do this. /s
But I was also wondering, how this even works. The AI agent can have its own cursors and none of its actions interrupt my own workflow at all? Maybe I need to try this.
Also, this sounds like it would be very expensive since from my understanding each app frame needs to be analysed as an image first, which is pretty token intensive.
I am working on a Dungeons & Dragons combat tracker: "Top Of The Round".
For those DMs that use tools like these, my app sits between Shieldmaiden and Improved Initiative in terms of features/complexity. I tried to offer as many features as possible but "hide" them in a way that makes it easy to understand the most important information like initiative order, health and conditions, stat-blocks. But then I added many buttons with keyboard shortcuts and a quick-access command-palette (think MacOS Spotlight or Alfred on Linux) that lets you access even more commands and features just by typing.
> the OpenShell piece that does the security sandboxing. Gives a lot more granular control over exec and network egress calls. Docker doesn't provide this out of the box.
Same for me, I hope it won't be laggy at all and the UI is good to use. So far I like my setup with a chromecast and my phone as the remote/tv library (youtube, netflix, amazon video, disney etc.).
I especially like that I can start watching something and at the same time I can browse around for something else to watch.
It looks like the remote has volume buttons on the right side by the way.
But I was also wondering, how this even works. The AI agent can have its own cursors and none of its actions interrupt my own workflow at all? Maybe I need to try this.
Also, this sounds like it would be very expensive since from my understanding each app frame needs to be analysed as an image first, which is pretty token intensive.
reply