Google may be close to unveiling an AI agent that can interact with your web browser and help you automate your daily tasks. According to The Information, the company is working on a “computer-based agent” codenamed Project Jarvis, which could be previewed as early as December. Sources who spoke to The Information said Jarvis “frequently captures screenshots of what’s on his computer screen before taking any action, such as clicking a button or typing in a text field.” “By interpreting the shot, it responds to human commands.”
Jarvis is reportedly made to work only in web browsers, specifically Chrome, to help with common tasks such as research, shopping, and booking flight tickets. The Verge reports that the announcement comes as Google continues to expand the functionality of Gemini AI, with the next generation model expected to be announced in December. Gemini Live, Google’s AI chatbot, gained support for dozens of new languages this month, and Gemini integration recently extended to Google Meet, Photos, and other apps.
Jarvis’ news comes just days after Anthropic introduced similar, but seemingly more expansive, capabilities to its Claude AI. The company says this feature includes computer skills and “the ability to use a wide range of standard tools and software programs designed for humans.” Currently available as a public beta.