Zoubin Ghahramani, vice president of Google Research, speaking at an AI event.
Google is preparing to launch its answer to rival OpenAI’s GPT-4: Gemini.Gemini is a next-gen, multimodal AI model due for release later this year.Google has already distributed early versions of Gemini, The Information reported.
Google is getting ready to launch Gemini, its answer to OpenAI’s GPT-4.
The tech is a next-gen, multimodal AI model being worked on by a team of researchers pulled from Google’s now-merged AI divisions DeepMind and Google Brain.
The model is being promoted as a significant advancement in natural language processing. Google is expected to release the model sometime later this year.
The upcoming release has been closely watched by the tech world, with many speculating about the model’s ability to better its main competitor, OpenAI’s large language model GPT-4.
While details on Google’s new offering are still few and far between, here’s what we know so far.
Gemini is multimodal
Google’s Gemini is a multimodal AI, meaning it can process more than one type of data.
The model is expected to be able to process images and text, allowing for features such as written analysis of visual graphs, The Information reported. The tech giant is also upping the tech’s code-generating abilities to try and take on Microsoft’s GitHub Copilot, which is powered by OpenAI, the outlet added.
Google said in a blog post that once testing and fine-tuning had been completed, the model would be available in various sizes and capabilities.
Inspired by AlphaGo
Gemini owes a debt to AlphaGo, which was developed by Google’s DeepMind and became the first computer program to defeat a professional human Go player. AI history was made back in 2016 when AlphaGo beat Lee Sedol, one of the world’s greatest Go players, at his own game.
The techniques used in AlphaGo would be combined with the technology that powers ChatGPT, Demis Hassabis, the boss of DeepMind, told Wired in June.
“At a high level you can think of Gemini as combining some of the strengths of AlphaGo-type systems with the amazing language capabilities of the large models,” he said.
Google is so close to launching Gemini that the tech giant has been handing out an early version of the model to a small group of companies, The Information reported.
One person who had tested the tech told the outlet it may have an advantage on GPT-4 because it leverages Google’s data from consumer products, as well as information gathered from the internet. The addition should mean the model can more accurately understand the user’s intentions, the report said.
The person also said the model appeared to generate fewer incorrect answers, a common problem in artificial intelligence known as hallucinations. AI-powered chatbots have been known to present incorrect information as fact. Back in February, Google’s ad for ChatGPT rival Bard showed the AI chatbot giving an inaccurate answer.
Researchers behind the SemiAnalysis blog have also predicted that Google’s Gemini would likely outperform GPT-4 because of Google’s access to top-flight chips.