Jarvis will be able to respond to user requests by taking screenshots and interpreting them before performing any actions, such as pressing a button or entering text.
In general, the agent will be able to help users with standard tasks. For example, shopping, ticket booking, or research.
Google is likely to introduce Jarvis in December 2024, along with its next major flagship language model, Gemini. It is not yet known whether the agent will be available only for Chrome or whether it can be launched in other browsers.