Google has announced the introduction of Gemini, the most capable and general multimodal AI model it has ever built.
Gemini is the result of large-scale collaborative efforts by teams across Google, including their colleagues at Google Research. It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across and combine different types of information including text, code, audio, image and video.
Gemini is also their most flexible model yet — able to efficiently run on everything from data centers to mobile devices. Its state-of-the-art capabilities will significantly enhance the way developers and enterprise customers build and scale with AI.
Google has been rigorously testing its Gemini models and evaluating their performance on a wide variety of tasks. From natural image, audio and video understanding to mathematical reasoning, Gemini Ultra’s performance exceeds current state-of-the-art results on 30 of the 32 widely-used academic benchmarks used in large language model (LLM) research and development.
With a score of 90.0%, Gemini Ultra is the first model to outperform human experts on MMLU (massive multitask language understanding), which uses a combination of 57 subjects such as math, physics, history, law, medicine and ethics for testing both world knowledge and problem-solving abilities.
Google and Alphabet CEO Sundar Pichai said: “Every technology shift is an opportunity to advance scientific discovery, accelerate human progress, and improve lives. I believe the transition we are seeing right now with AI will be the most profound in our lifetimes, far bigger than the shift to mobile or to the web before it. AI has the potential to create opportunities — from the everyday to the extraordinary — for people everywhere. It will bring new waves of innovation and economic progress and drive knowledge, learning, creativity and productivity on a scale we haven’t seen before.”
Although this new AI model introduced by Google has been getting attention for appearing to be the first model to beat OpenAI’s GPT-4, this is is only going to be the case for the Gemini Ultra version, which won’t be available until next year. Nevertheless, the report that Google released along with the announcement of Gemini’s release reveals that it outperforms OpenAI’s GPT-3.5 in many aspects.