When ChatGPT arrived from OpenAI at the end of 2022, wowing the public with the way it answered questions, wrote term papers and generated computer code, Google found itself playing catch-up. Like other tech giants, the company had spent years developing similar technology but had not released a product as advanced as ChatGPT.
(The New York Times sued OpenAI and its partner, Microsoft, in December, claiming copyright infringement of news content related to AI systems.)
Google released its own chatbot, Bard, in March to middling reviews. In the weeks that followed, the company merged its two leading AI labs — Google Brain and DeepMind — and announced that the combined lab was developing new AI technology called Gemini.
Gemini is what researchers call a large language model, or LLM, a mathematical system that can learn skills by analysing vast amounts of data, including books, computer programs and online chatter. By identifying patterns in all that text, an LLM can learn to generate text on its own. That means it can write poetry, generate computer code and even carry on a conversation.
It is also prone to mistakes. It can get facts wrong or “hallucinate” — make stuff up.
Gemini is a “multimodal” system, meaning it can respond to both images and sounds. After analysing a maths problem that included graphs, shapes and other images, it could answer the question much the way a high school student would.
In December, Google used a limited version of this technology to upgrade Bard. Now, the company has retired the Bard name and is releasing a more powerful version of the technology through the Gemini app, which is available on Android phones and the web. A version for iPhones will arrive “in the coming weeks”, Google said.
Sub required for full version
Google created a free but limited version of the Gemini app. A more powerful version — called Gemini Advanced and underpinned by a version of Google’s Ultra language model — is available for a $36.99 monthly subscription. Google offers a free two-month trial.
Google has released benchmark test results claiming that Ultra outperformed OpenAI’s latest technology, GPT-4, in several key areas, including generating computer code and summarising news articles.
The Gemini app can also generate, analyse and respond to images. Users can upload a photo from their Super Bowl party, for instance, and ask the app to generate a caption.
Google also said it would offer similar technology through the Google Workspace and Google Cloud business services. This will allow customers to use the technology alongside apps like Gmail and Google Docs.
On Android phones, the new app will replace Google Assistant if users download Gemini. Like Google Assistant, it can respond to voice commands, though it also responds to text commands.
Google said it would also continue to offer and improve Google Assistant.
Last year, OpenAI released a similar version of its ChatGPT chatbot that can respond to voice commands. Most industry insiders believe that the AI technology that drives chatbots like ChatGPT will merge with and replace digital assistants like Apple’s Siri and Amazon’s Alexa.
Written by: Cade Metz
©2024 THE NEW YORK TIMES