Google Launches AI Model Gemini

Google has launched Gemini, a large language model (LLM) that can understand different types of information, including text, audio, images, and video. Google calls Gemini its most capable and general-purpose AI model.

Here are some details about Gemini:

  • Size: Gemini comes in three sizes
  • Power: Gemini Pro outperformed OpenAI’s GPT-3.5
  • Availability: Gemini is available in Bard and Pixel phones
  • Plans: Google plans to expand the advanced version of Gemini next year

Google AI is a division of Google dedicated to artificial intelligence. Google has research facilities in various parts of the world such as Zurich, Paris, Israel, and Beijing.

The case for Bard may have just gotten more compelling, though: as of today, for English-speaking users in 170 countries, Bard is now powered by Google’s new Gemini model, which it says matches and even exceeds OpenAI’s tech in a number of ways. (Google says Gemini is coming to more languages and countries “in the near future.”)

Bard is now running Gemini Pro, the middle tier of the Gemini series. Ultra is the biggest and slowest but the most capable, Nano is small and fast and meant for on-device tasks, and Pro sits right in the middle. It’s meant to be the Goldilocks version of the model, really: fast and efficient while still as capable as possible.

Sissie Hsiao, who runs Bard and Assistant at Google, said in a press briefing that Gemini represents the “biggest and best upgrade yet” for Bard. It should be a marked improvement for just about everything Bard already does: summarizing, brainstorming, writing, and the like.

Sundar Pichai, Google’s CEO, tells me that, in his testing, he’s found that there’s not so much a whizbang new feature as there is just an overall improvement across the board. “

I think people are just going to find that the product got a lot better,” he says. “It understands their intent better, it’s answering better. It’s more factual, higher quality. If you’re trying to code it’s better!”

Right now, Bard is still just a chatbot: you type, it types back. But there’s a new version of Bard coming soon that could be much more. Next year, Google is planning to launch a preview of “Bard Advanced,” powered by Gemini Ultra, which is the most powerful and capable version of Google’s new large language model.

Gemini Ultra is also the multimodal version of the model, meaning it can accept and create images, audio, and video in addition to just text. The non-text interactions are where Gemini in general really shines, says Demis Hassabis, the head of Google DeepMind. “We built it to be natively multimodal from the ground up,” he says.

“That’s one of the new capabilities that it has… the kinds of seamless integration and reasoning it can do across modalities.” Google’s demos included the YouTuber Mark Rober using Bard to make the perfect paper airplane — including by taking photos of his designs to get AI-provided feedback — and parents uploading pictures of their children’s homework to get help figuring out where their math went wrong.

That’s all just demos and promotional videos for now, though. Pichai says he thinks of this launch both as a big moment for Bard and as the very beginning of the Gemini era. But if Google’s benchmarking is right, the new model might already make Bard as good a chatbot as ChatGPT. And that’s already a pretty impressive feat.

California18

Welcome to California18, your number one source for Breaking News from the World. We’re dedicated to giving you the very best of News.

Leave a Reply