ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (2024)

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (1)

Google launched a new artificial intelligence product at its Google I/O event on Tuesday —Gemini Live. We all assumed that is what the Gemini Assistant in Android was supposed to do but this is Google and anything goes.

If it wasn’t for the fact it comes just one day after OpenAI’s first consumer product event, I’d ponder over whether Gemini Live was launched to take on ChatGPT Voice. Both are built using native multi-modal AI models and have impressive voice and video capabilities.

Currently in the global AI race the front runners seem to be OpenAI and Google, with the former seemingly cozying up to Apple and the iPhone and the latter in control of Android. Forget AI devices like the Rabbit r1 or the Humane Pin —the short-term winner is the smartphone.

Both ChatGPT Voice and Gemini Live are being integrated into an existing AI product and neither is available today —but how else do these next-generation assistants compare?

How do Gemini Live and ChatGPT 4o compare?

This summer, we’re expanding Gemini’s multimodal capabilities — including the ability to have an in-depth two-way conversation using your voice. This new experience is called Live. #GoogleIO pic.twitter.com/eAZbaO5WKzMay 14, 2024

Google is on the back foot a little when it comes to credibility, especially around showing off live video analysis and voice capabilities. When it announced Gemini Ultra last year it did so with a video of it responding to real-time video— only it wasn’t real-time or video.

However, this time they made a point of making the tech, at least the underlying “Project Astra” aspect of it including speech and video conversation available to try out at I/O.

Both offer a conversational, natural language voice interface, both offer the potential for live video analysis through a smartphone camera and both seem to be fast enough for a truly natural conversation where you can interrupt the AI mid-flow.

However, there are some notable differences. OpenAI’s ChatGPT Voice sounds more natural, can detect and respond to emotion and vocal tones and even adapt in real-time to how you ask it to speak. I didn’t see evidence of that capability from Gemini Live.

The other big difference is around multimodality. Gemini still relies on other models for output including using Imagen 3 for images and Veo for video. GPT-4o is natively multimodal in both directions —the o stands for omni, or in all directions. It creates its own images and sound.

Gemini Live vs GPT-4o: The future of voice assistants

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (2)

The world seems to be moving towards voice and away from text input. When I first watched the OpenAI announcement my reaction was that this is a paradigm shift in human-computer interface, one as big as the launch of the mouse or the touch screen.

I still hold that view and the fact Google is also launching a native, natural-sounding voice interface further cements that. Even Meta has its MetaAI, a voice bot available in its VR headsets and the Ray-Ban smart glasses.

While the smartphone might be the winner for now, its clear the real form factor for these voice AI models is smart glasses. Available with cameras at eye height and arms to send soundwaves into your ears —they are the perfect AI device.

The question is whether OpenAI moves into hardware, launching its own pair of smart glasses or whether this is the new Siri and will power a future Apple Glasses product. Also, whether Google is really brave enough to resurrect Google Glass.

More from Tom's Guide

  • ChatGPT with GPT-4o — I cannot remember the last time I was this blown away by a piece of technology
  • Google just answered GPT-4o with Gemini demo that’s conversational and uses video
  • OpenAI GPT-4o is now rolling out — here's how to get access

Category

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (3)

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (4)

Back to MacBook Air

Brand

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (5)

Processor

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (6)

RAM

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (7)

Screen Size

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (9)

Colour

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (10)

Condition

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (11)

Price

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (12)

Any Price

Showing 10 of 94 deals

Filters

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (13)

Asus Zenbook S 13 OLED

(13.3-inch 1TB)

2

Asus ROG Zephyrus G14 2023

7

$1,599.99

View Deal

Asus ROG Zephyrus G14 2023

(14-inch 512GB)

10

$1,429.99
$1,072.99

View Deal

Load more deals

Sign up to get the BEST of Tom’s Guide direct to your inbox.

Upgrade your life with a daily dose of the biggest tech news, lifestyle hacks and our curated analysis. Be the first to know about cutting-edge gadgets and the hottest deals.

ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (34)

Ryan Morrison

AI Editor

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover.When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?

More about google gemini

Google Gemini's YouTube Music extension brings the power of AI to your playlistsElevenLabs does it again — turns Google Veo clips into a music video using its own AI music generator

Latest

Samsung's Galaxy smartwatches are about to get new AI health and wellbeing features —here's what we know
See more latest►

No comments yetComment from the forums

    Most Popular
    I found the perfect headset for those long summer gaming sessions and it doesn’t drown out the world around me
    Netflix’s 'A Man in Full’ wasn’t full enough for critics — but it’s still a worthwhile watch
    I'm a personal trainer — here's 3 reasons why I've added scissor crunches to my ab workouts
    I gave Google Chrome on iPad Pro a second chance but I still prefer browsing the web on my MacBook — here's why
    Should you buy the Tempur-Cloud mattress in Memorial Day sales? I reviewed it — here's what I think
    I tried the new $30 Roku Voice Remote Pro and it’s exactly what my streaming setup was missing
    I just tried the Sonos Arc soundbar and it's now my new at-home audio system
    Dark Souls 2 is the one thing keeping me preoccupied in anticipation of Elden Ring’s DLC
    Private Space is coming to Android 15 — and it’s something I’ve been waiting years for
    First look at ‘The Walking Dead: Dead City’ season 2 — and my love for this franchise has risen from the grave
    My favorite Apple TV Plus show just proved it’s better than Ted Lasso once again
    ChatGPT-4o vs Google Gemini Live — how the new AI assistants stack up (2024)
    Top Articles
    Latest Posts
    Article information

    Author: Aron Pacocha

    Last Updated:

    Views: 6238

    Rating: 4.8 / 5 (48 voted)

    Reviews: 95% of readers found this page helpful

    Author information

    Name: Aron Pacocha

    Birthday: 1999-08-12

    Address: 3808 Moen Corner, Gorczanyport, FL 67364-2074

    Phone: +393457723392

    Job: Retail Consultant

    Hobby: Jewelry making, Cooking, Gaming, Reading, Juggling, Cabaret, Origami

    Introduction: My name is Aron Pacocha, I am a happy, tasty, innocent, proud, talented, courageous, magnificent person who loves writing and wants to share my knowledge and understanding with you.