OpenAI and Google lean in to AI personal assistants. Is this AI’s killer app?

Jeremy Kahn

14 May 2024 at 1:46 pm·9-min read

Hello and welcome to Eye on AI.

The big news in AI this week are the dueling product announcements from OpenAI and Google.

OpenAI has consistently tried to steal the news cycle from rivals by jumping out in front of their big product reveals with its own product releases, and this week was no different. The AI startup had built expectations around yesterday’s announcement so high—with rampant speculation that OpenAI would debut GPT-5 or a generative AI search engine—that CEO Sam Altman took to social media platform X on Friday to disabuse people of those ideas, while still trying to build excitement for Monday’s product reveal.

What the company did announce was a souped-up version of GPT-4 called GPT-4o—the “o” stands for omni—that is designed to act as a personal assistant on a phone or tablet, with improved voice interaction, the ability to interpret and reason about pictures from a device’s camera, more capable language translation, and much faster response times. The assistant, with a default female voice, is apparently explicitly modeled on the digital assistant in the 2013 Spike Jonze movie Her.

OpenAI may have misplayed the expectations game a bit since compared to the hype it drummed up, many viewers of its livestream event seemed underwhelmed by the announcement. (To combat this, Altman and OpenAI also published blog posts as well as short videos showcasing a variety of use cases for the new model.)

The technological innovations behind GPT-4o are impressive. The model is natively multimodal—trained to take in voice and then produce voice, for example—as opposed to taking in the user’s voice, turning it into text that is fed to GPT-4 to create a prompt, and then feeding the resulting output to a text-to-speech model to produce a voice response. This speeds up the entire cycle. OpenAI has also impressively shrunk the number of tokens—segments of data that the model processes (in the case of English text, a token is usually equal to a word and a half)—the model requires to perform a task. This also makes the model considerably faster and cheaper to run than GPT-4 Turbo, OpenAI's previous best model. This, in turn, has enabled OpenAI to make GPT-4o available for free to all ChatGPT users, as well as to offer enterprise customers and developers use of the model through OpenAI’s API for half the cost of GPT-4 Turbo.

Then today, at Google’s I/O developer conference, the search giant announced a raft of new AI features and upcoming product releases, from the integration of generative AI capsule answers into its main search engine, a way to query the photos saved on Google Photos, and improvements to its Gemini chatbot. As my colleague Sharon Goldman, who is at I/O, relays, Google’s version of the AI personal assistant is being developed through what it’s calling “Project Astra,” with capabilities the company said will come to Google products, like the Gemini app, later this year. Demo videos that the company emphasized were done live in one take showed someone using a smartphone camera to show the AI what was around them. While right now OpenAI's GPT-4o can only process still images, Astra can handle video. In addition, Google also unveiled improvements to its already very capable Gemini 1.5 Pro model so that it can have more natural-sounding, longer dialogues, better understanding of audio and images, more logical reasoning and planning capabilities, and better computer code generation.

This is the sort of AI software that Google teased in December with a canned demonstration that was panned by reporters for being misleading about AI model Gemini’s video processing capabilities. Well, now Google is saying it has these capabilities for real. The company has also announced a doubling of the context window—how much data its models can process—for Gemini 1.5 Pro to 2 million tokens. That means the model can take in many books’ worth of text or the video equivalent of a feature film. Larger context windows don’t just allow the models to process more information, they also tend to reduce the model’s tendency to hallucinate (i.e. provide plausible but inaccurate outputs). Google also teased a future AI “agent” model that will be able to perform actions for users—such as booking movie tickets and flights—not simply generate text.

There are a few things to say about these announcements from OpenAI and Google. One is that they clearly put Apple and Amazon on the back foot. They need to upgrade Siri and Alexa to match these new rival capabilities or those products will be in trouble. We know both companies are working on it, and Amazon has Anthropic’s powerful Claude AI models to use. Apple is by all accounts much further behind on its generative AI efforts—which is why there are reports it was negotiating with OpenAI to license its technology in the near term. My colleague David Meyer has more on this in today’s Data Sheet newsletter.

More broadly, are these new personal assistants AI’s killer app? I think the verdict is very much still out—and depends entirely on what comes next. Most of the use cases OpenAI showcased so far seem fun and somewhat helpful, especially to parents, such as tutoring your kids or telling bedtime stories. But it’s unclear whether they are the sort of thing that will make such assistants ubiquitous, must-have products. The one exception might be translation—the ability to have a universal interpreter in your pocket wherever in the world you go could be transformative. But almost none of the use cases OpenAI or Google highlighted for the new assistants were around helping people in their jobs. That may change when these assistants have more “agentic” properties—and also when they can actually learn more about our personal preferences—and then complete tasks to our liking. We could all use a personal assistant that can actually do things for us in our daily lives—do our online grocery shopping for us, fill out insurance forms, book our vacations, etc. That really is likely to be a killer app.

How quickly those agents are coming is unclear. Google says it's working on them but has not put a timeline on a product release. On Monday, OpenAI continued to tease exciting future announcements “coming soon”—possibly next week when its partner Microsoft holds its Build developer conference—but what they are is still a secret.

In the meantime, the question is, as with so much of the generative AI revolution, whether the benefits are worth the costs—to the companies, to consumers, and to society. While OpenAI has clearly made some technological breakthroughs that have reduced the costs of GPT-4o enough that it can make the model available at no charge, it's definitely still costing them something to run. Altman recently said he wasn't worried about OpenAI’s burn rate—”$500 million a year or $5 billion or $50 billion a year, I don’t care,” he said—but at some point his investors will care. And his business customers probably care too. (The pricing of GPT-4o to enterprise developers through OpenAI’s API is half what GPT-4 Turbo goes for, which may indicate the startup’s own costs are similarly about half. Still, the model isn’t cheap. So it's unclear whether the use cases that businesses will be able to address with the new model will justify the price tag.)

While OpenAI is offering GPT-4o to consumers for free, users are essentially paying with their personal data, including their voice, and depending on how they use the model, images of their face or their family and friends, too. So there are definitely data privacy implications.

There may also be big societal costs that we aren’t aware of or anticipating. For instance, because OpenAI has said very little about how big a model GPT-4o is and how it was trained, we have little idea about what its lifetime carbon footprint and water usage will likely be. The electricity and water consumption of running AI models in the cloud is becoming an increasing concern as the adoption of the technology takes off. Will our glorious AI future be worth the damage to the planet? We don’t really know because the benefits are still uncertain, and tech companies are being less than transparent about the environmental bill.

We also don’t know how these AI personal assistants might subtly influence our thoughts and behaviors. People tend to be more influenced by voice-based interactions than they are when reading text. Can we trust that tech companies making these personal assistants will show us information that is in our best interest? Or will what they tell us be influenced by commercial partnerships the tech companies have struck? Last week, AdWeek reported on an OpenAI pitch deck it had obtained that revealed details of partnership agreements the company was offering media companies. It included priority placement and “better brand expression” in chatbot conversations. (OpenAI told AdWeek the documents were outdated.) While the publishers OpenAI has been talking to so far all have reputations for high journalistic standards and quality content, the idea of allowing partners and advertisers to pay to be featured more prominently in chatbot responses raises the specter of personal assistants that will subtly steer us to buy products, or even hold certain political views because that is what the tech companies are being paid to do. (Or, in some countries, it is easy to imagine that governments will mandate that personal assistants only express certain “politically correct” views.)

In the movie Her, Theodore (played by Joaquin Phoenix) falls madly in love with his AI assistant Samantha (voiced by Scarlett Johansson), and his obsession with the chatbot leads him to neglect real human relationships. When the chatbot is temporarily unavailable due to a systems upgrade, he is distraught. Versions of this have already happened in real life for some people, who have formed romantic bonds with chatbots from Replika and character.ai. And we don’t have good research yet on whether AI chatbots are a cure for loneliness—as some tech companies claim—or a crutch that substitutes for and ultimately impedes real human connection. My guess, judging from our experience with social media, is the latter.

Either way, I guess we are about to find out. With that, here’s more AI news.

Jeremy Kahn
jeremy.kahn@fortune.com
@jeremyakahn

This story was originally featured on Fortune.com

Cosmopolitan
Katy Perry Went Topless in Nothing But Ripped Tights and an Open Jacket at PFW
Katy Perry went topless in nothing but ripped tights and an open jacket during Paris Fashion Week.
The Independent
Victoria Beckham first slept with David ‘hours after meeting Prince Charles’
‘Thereafter, she said, Beckham was ‘a total animal’ in bed,’ the biography claims
The Independent
Apple says changing iPhone batteries will become easier, as it explains how it tries to make devices last
Company explains its ‘principles for designing for longevity’ as it announces new changes
Cosmo
Camila Cabello's shredded dress and tiiiiiny hot pants are Miami girl vibes
Camila Cabello is Complex's latest cover star. The artist models a see-through shredded dress as well as the tiniest hot pants in the accompanying photoshoot.
Reuters
Huawei's Harmony aims to end China's reliance on Windows, Android
Packed into a small room, a drone, bipedal robot, supermarket checkout and other devices showcase a vision of China's software future - one where an operating system developed by national champion Huawei has replaced Windows and Android. The collection is at the Harmony Ecosystem Innovation Centre in the southern city of Shenzhen, a local government-owned entity that encourages authorities, companies and hardware makers to develop software using OpenHarmony, an open-source version of the operating system Huawei launched five years ago after U.S. sanctions cut off support for Google's Android. While Huawei's recent strong-selling smartphone launches have been closely watched for signs of advances in China's chip supply chain, the company has also quietly built up expertise in sectors crucial to Beijing's vision of technology self-sufficiency from operating systems to in-vehicle software.
The Independent
Bill Gates’ daughter Jennifer is pregnant with second child
Gates is already mother to one-year-old daughter Leila
The Independent
‘Wasn’t like this in the Nineties’: Robbie Williams begs for someone to recognise him in hilarious video
Pop star was disappointed that his garish outfit had failed to attract any attention while out with his wife, Ayda
Hello!
Who is Rina Lipa? meet Dua Lipa's lookalike model sister
From her career to her best fashion moments, here's all you need to know about Dua Lipa's younger sister Rina - see photos
Hello!
Princess Charlotte and Princess Kate twin in identical hairstyles and lowkey outfits
The Princess of Wales and her daughter, Princess Kate could be twins with matching plaits
Cinema Online
Yang Zi tops 2024 female star popularity ranking
The actress bested other popular actresses like Dilraba Dilmurat and Bai Lu
Hello!
Selena Gomez radiates preppy-chic in Versace bustier dress
The actress stepped out in LA in a monochrome number from the brand's autumn/winter 2024 collection
WWD
Queen Camilla Favors Sapphire Dress With Exaggerated Shoulders for Final Day of the Emperor and Empress of Japan’s State Visit
The queen consort and king bid farewell to the Emperor and Empress of Japan at Buckingham Palace on Thursday.
TechRadar
Microsoft pauses Windows 11 update as it’s sending some PCs into an infinite reboot hell
Those of you who have experienced an infinite reboot loop will appreciate why this latest update had to be pulled.
HuffPost
10-Year-Old Headbanger Has Simon Cowell Saying 'Whoa!' In 'AGT' Audition
Maya Neelakantan strummed her guitar at first, then wailed away into the hearts of viewers.
Hello!
Princess Beatrice looks ravishing in sheer ruffles and Zara accessories
Prince Andrew's daughter Princess Beatrice joined her husband Edoardo Mapelli Mozzi at an early Independence Day party. See her sheer ruffled dress...
HuffPost UK
This Is What Hannah Waddingham Said To Get Such A Big Laugh Out Of Prince William
"You can't say that!"
Cinema Online
Yu Zheng slams Julia Xiang-Wang Xingyue rumours
The producer says the paparazzi are pretending not to know that the actors stay in the same building
Cosmo
Maya Jama channels Scary Spice in plunging bra top and leopard print
Maya Jama attended the Serpentine Summer Party 2024 in a black velvet bra top and flared trousers with a leopard print wrap top worn open & matching head scarf.
INSIDER
Some disgruntled 'Bridgerton' fans are turning on the showrunner
"Bridgerton" is facing an unprecedented amount of pushback from fans after season three. Some are complaining about Polin's love story.
CNN
The rise of the AI beauty pageant and its complicated quest for the ‘perfect’ woman
The first ever AI beauty pageant showcases remarkable technology, but are we losing sight of how an unedited face looks?

Straits Times Index

Nikkei

Hang Seng

FTSE 100

Bitcoin USD

CMC Crypto 200

S&P 500

Dow

Nasdaq

Gold

Crude Oil

10-Yr Bond

FTSE Bursa Malaysia

Jakarta Composite Index

PSE Index

OpenAI and Google lean in to AI personal assistants. Is this AI’s killer app?

Latest stories

Katy Perry Went Topless in Nothing But Ripped Tights and an Open Jacket at PFW

Victoria Beckham first slept with David ‘hours after meeting Prince Charles’

Apple says changing iPhone batteries will become easier, as it explains how it tries to make devices last

Camila Cabello's shredded dress and tiiiiiny hot pants are Miami girl vibes

Huawei's Harmony aims to end China's reliance on Windows, Android

Bill Gates’ daughter Jennifer is pregnant with second child

‘Wasn’t like this in the Nineties’: Robbie Williams begs for someone to recognise him in hilarious video

Who is Rina Lipa? meet Dua Lipa's lookalike model sister

Princess Charlotte and Princess Kate twin in identical hairstyles and lowkey outfits

Yang Zi tops 2024 female star popularity ranking

Selena Gomez radiates preppy-chic in Versace bustier dress

Queen Camilla Favors Sapphire Dress With Exaggerated Shoulders for Final Day of the Emperor and Empress of Japan’s State Visit

Microsoft pauses Windows 11 update as it’s sending some PCs into an infinite reboot hell

10-Year-Old Headbanger Has Simon Cowell Saying 'Whoa!' In 'AGT' Audition

Princess Beatrice looks ravishing in sheer ruffles and Zara accessories

This Is What Hannah Waddingham Said To Get Such A Big Laugh Out Of Prince William

Yu Zheng slams Julia Xiang-Wang Xingyue rumours

Maya Jama channels Scary Spice in plunging bra top and leopard print

Some disgruntled 'Bridgerton' fans are turning on the showrunner

The rise of the AI beauty pageant and its complicated quest for the ‘perfect’ woman