Hermes Agent + Gemini 3.5 Flash + Computer Use via trycua.
I asked it to open Chrome, navigate to ChatGPT, and generate an image.
It took 1 minute 47 seconds to submit the request, then another 30 seconds after the image was generated to analyze the result and respond.
(This doesn't include ChatGPT's image generation time.)
It's still slow, but it's pretty cool for tasks you don't want to sit around and do yourself.
Video speed: 4×
显示更多