Exclusive: Honor Debuts a New AI Agent That Can Read and Understand Your Screen

0
73


It chose a restaurant, but then couldn’t complete the process as the spot it chose required a credit card to confirm a reservation, at which point the user had to take over. You can be flexible in your query—in another example, asking it to book a “highly rated” restaurant meant it would look at reviews with high scores, though the agent doesn’t do any more research than that. It’s not cross-referencing OpenTable reviews with data from other parts of the web, especially since all of this data is processed on device and isn’t sent to the cloud.

This kind of agentic artificial intelligence is the current buzzword in the tech sphere. My colleague Will Knight recently tested an AI assistant that could browse the web and perform tasks online. Google late last year unveiled its Gemini 2 AI model trained to take actions on your behalf. It also renews the idea of a generative user interface for smartphones—at MWC 2024, we saw a few companies working on ways to interact with apps without using apps at all, instead leaning on AI assistants to generate a user interface as you issued a command.

Honor’s approach feels somewhat like what Rabbit—of the infamous Rabbit R1—is doing with Teach Mode, where you train its assistant manually to complete a task. There’s no need to access an app’s Application Programming Interface (API), which is the traditional way apps or services communicate with each other. The agent memorizes the process, allowing you to then issue the command and have it execute the task.

But Honor says its self-reliant AI execution model isn’t trained to follow strict steps—it’s capable of multimodal screen context recognition to perform tasks autonomously. Instead of having to train the assistant to learn every single part of the OpenTable app, it is capable of understanding the semantic elements of the user interface and will follow-through with a multi-step process to execute your request. Honor highlighted that this process was more cost effective: “Unlike competitors such as Apple, Samsung, and Google, which rely on external APIs—resulting in higher operational costs—Honor’s AI Agent independently manages a wide range of tasks.”

Closeup of a mobile phone screen showing Honor AI is handling the task the user requested

Photograph: Julian Chokkattu

LEAVE A REPLY

Please enter your comment!
Please enter your name here