Google set to unveil ‘Project Jarvis’ AI model for task automation, sources say

Powered by a future version of Google’s Gemini, Jarvis reportedly only works with a web browser


News Desk October 28, 2024

Google may unveil its own take on Rabbit’s large action model concept as early as December, according to "The Information". Reportedly codenamed “Project Jarvis,” the tool aims to perform tasks on behalf of users, including “gathering research, purchasing a product, or booking a flight,” as disclosed by three sources with direct knowledge of the project.

Powered by a future version of Google’s Gemini, Jarvis reportedly operates exclusively within a web browser, fine-tuned specifically for Chrome.

The tool is designed to “automate everyday, web-based tasks” by analysing and interacting with on-screen elements, such as clicking buttons or entering text after taking screenshots. As it stands, the tool requires “a few seconds” to perform each action, *The Information* notes.

Google’s move aligns with efforts by other leading AI companies to enhance digital task automation. Microsoft’s upcoming Copilot Vision, for instance, will allow users to discuss webpages they’re viewing, while Apple Intelligence is anticipated to handle screen-based actions across multiple apps within the next year. Anthropic recently launched a beta version of Claude, though it has been described as “cumbersome and error-prone,” and OpenAI is reportedly working on a similar model.

However, "The Information" cautions that Google’s plan to preview Jarvis in December could still change. The company may opt for a limited release to a select group of testers to help identify and resolve any issues.

COMMENTS

Replying to X

Comments are moderated and generally will be posted if they are on-topic and not abusive.

For more information, please see our Comments FAQ