OpenAI debuts GPT-4o 'omni' model now powering ChatGPT

Kyle Wiggers

Updated 13 May 2024 at 4:21 pm·4-min read

OpenAI announced a new flagship generative AI model on Monday that they call GPT-4o — the "o" stands for "omni," referring to the model's ability to handle text, speech, and video. GPT-4o is set to roll out "iteratively" across the company's developer and consumer-facing products over the next few weeks.

OpenAI CTO Mira Murati said that GPT-4o provides "GPT-4-level" intelligence but improves on GPT-4's capabilities across multiple modalities and media.

"GPT-4o reasons across voice, text and vision," Murati said during a streamed presentation at OpenAI's offices in San Francisco on Monday. "And this is incredibly important, because we're looking at the future of interaction between ourselves and machines."

GPT-4 Turbo, OpenAI's previous "leading "most advanced" model, was trained on a combination of images and text and could analyze images and text to accomplish tasks like extracting text from images or even describing the content of those images. But GPT-4o adds speech to the mix.

What does this enable? A variety of things.

GPT-4o greatly improves the experience in OpenAI's AI-powered chatbot, ChatGPT. The platform has long offered a voice mode that transcribes the chatbot's responses using a text-to-speech model, but GPT-4o supercharges this, allowing users to interact with ChatGPT more like an assistant.

For example, users can ask the GPT-4o-powered ChatGPT a question and interrupt ChatGPT while it's answering. The model delivers "real-time" responsiveness, OpenAI says, and can even pick up on nuances in a user's voice, in response generating voices in "a range of different emotive styles" (including singing).

GPT-4o also upgrades ChatGPT's vision capabilities. Given a photo -- or a desktop screen -- ChatGPT can now quickly answer related questions, from topics ranging from "What's going on in this software code?" to "What brand of shirt is this person wearing?"

ChatGPT's desktop app in use in a coding task.

These features will evolve further in the future, Murati says. While today GPT-4o can look at a picture of a menu in a different language and translate it, in the future, the model could allow ChatGPT to, for instance, "watch" a live sports game and explain the rules to you.

"We know that these models are getting more and more complex, but we want the experience of interaction to actually become more natural, easy, and for you not to focus on the UI at all, but just focus on the collaboration with ChatGPT," Murati said. "For the past couple of years, we've been very focused on improving the intelligence of these models … But this is the first time that we are really making a huge step forward when it comes to the ease of use."

GPT-4o is more multilingual as well, OpenAI claims, with enhanced performance in around 50 languages. And in OpenAI's API and Microsoft's Azure OpenAI Service, GPT-4o is twice as fast as, half the price of and has higher rate limits than GPT-4 Turbo, the company says.

At present, voice isn't a part of the GPT-4o API for all customers. OpenAI, citing the risk of misuse, says that it plans to first launch support for GPT-4o's new audio capabilities to "a small group of trusted partners" in the coming weeks.

GPT-4o is available in the free tier of ChatGPT starting today and to subscribers to OpenAI's premium ChatGPT Plus and Team plans with "5x higher" message limits. (OpenAI notes that ChatGPT will automatically switch to GPT-3.5, an older and less capable model, when users hit the rate limit.) The improved ChatGPT voice experience underpinned by GPT-4o will arrive in alpha for Plus users in the next month or so, alongside enterprise-focused options.

In related news, OpenAI announced that it's releasing a refreshed ChatGPT UI on the web with a new, "more conversational" home screen and message layout, and a desktop version of ChatGPT for macOS that lets users ask questions via a keyboard shortcut or take and discuss screenshots. ChatGPT Plus users will get access to the app first, starting today, and a Windows version will arrive later in the year.

Elsewhere, the GPT Store, OpenAI's library of and creation tools for third-party chatbots built on its AI models, is now available to users of ChatGPT's free tier. And free users can take advantage of ChatGPT features that were formerly paywalled, like a memory capability that allows ChatGPT to "remember" preferences for future interactions, upload files and photos, and search the web for answers to timely questions.

We're launching an AI newsletter! Sign up here to start receiving it in your inboxes on June 5.

Read more about OpenAI's Spring Event on TechCrunch

South China Morning Post
Huawei plans tri-fold smartphone as Apple weighs foldable iPhone, reports say
The foldable smartphone market is set for a big boost amid reports that Apple is preparing a clamshell-style model, while Huawei Technologies may soon launch a tri-folding handset. Apple's top-down folding iPhone could come as early as 2026, according to a report by The Information on Wednesday. The US tech giant has reached out to Asian suppliers in recent months to make components for its first foldable handset, according to the report, citing anonymous sources. This would mark Apple's entry i
Reuters
Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows
BEIJING (Reuters) -Apple's smartphone shipments in China fell by 6.7% in the second quarter of 2024, as the tech giant faced intensifying competition from rivals like Huawei, according to data from market research firm Canalys. Apple's total shipments for the quarter ending in June stood at 9.7 million units, down from 10.4 million units in the same quarter last year, Canalys data shows. In contrast, Huawei's smartphone shipments surged 41% year-on-year to 10.6 milion in the quarter, bolstered by the launch of its new Pura 70 series in April.
Digital Spy
My long-term iPhone 15 Pro Max review
I’ve been using the iPhone 15 Pro Max for over six months to come to my long-term verdict on the big-screen flagship with excellent cameras and battery life.
Reuters
Amazon racing to develop AI chips cheaper, faster than Nvidia's, executives say
Inside Amazon.com's chip lab in Austin, Texas, half a dozen engineers on a Friday afternoon put a closely guarded new server design through its paces. The server was packed with Amazon's artificial intelligence chips that compete with those from market leader Nvidia, Amazon executive Rami Sinno said on Friday, during a visit to the lab. Amazon is developing its own processors to limit its reliance on costly Nvidia chips - the so-called Nvidia tax - that power some of the artificial intelligence cloud business at its Amazon Web Services, the main growth driver.
Fortune
Startup with ‘radical’ concept for AI chips emerges from stealth with $15 million to try to challenge Nvidia
Fractile thinks it can deliver 100 times faster performance 10 times cheaper than GPUs.
Reuters
CrowdStrike says over 97% of Windows sensors back online
The outage happened because the advanced platform contained a fault that forced computers running Microsoft's Windows operating system to crash and show the so-called blue screen of death. Microsoft said on Saturday about 8.5 million Windows devices had been affected in the outage that had left flights grounded, forced broadcasters off air and left customers without access to services such as healthcare or banking."Our recovery efforts have been enhanced thanks to the development of automatic recovery techniques and by mobilizing all our resources to support our customers," Kurtz said in a post on LinkedIn.
Reuters
BofA payments app for businesses handled record $500 billion by mid-year
Bank of America’s corporate clients approved a record $500 billion in payments through its CashPro app by mid-year, up almost 40% versus the same period in 2023, the company said in a statement on Thursday. Payments on the app are projected to exceed $1 trillion this year from $802 billion last year, BofA said. Global payments revenues rose to more than $2.2 trillion in 2022, with commercial accounts generating 53% of the total, the report showed.
MediaOutReach
Southco Introduces A Flush-Mount E6-73 Constant Torque Hinge
HONG KONG SAR - Media OutReach Newswire - 26 July 2024 - Southco Asia Ltd., a subsidiary of Southco Inc., a leading global provider of engineered access solutions such as locks, latches, captive fasteners, electronic access solutions, and hinges/positioning technology, has developed a flush-mount version of its popular E6 constant torque hinge. The E6-73 Stainless Steel Constant Torque Hinge provides all the benefits of a torque hinge in a low-profile, corrosion-resistant package, making it an i
Evening Standard
SearchGPT unveiled: OpenAI is launching a search engine to compete with Google
Here’s how you can sign up to be among the first to try the new AI-powered search tool
Engadget
Samsung Galaxy Buds 3 and Galaxy Buds 3 Pro review: AirPods clones that actually deliver
The Galaxy Buds 3 series offers two sets of great earbuds, but Samsung borrowed heavily from Apple to design them.
Yahoo Finance Video
OpenAI announces testing of new search engine, SearchGPT
OpenAI — which Microsoft (MSFT) owns a 49% stake in — is stepping into the search engine game. In a move that could rival Alphabet's (GOOG, GOOGL) Google search, the artificial intelligence developer unveiled its SearchGPT engine. The Market Domination team analyzes OpenAI's blog post outlining what it hopes to achieve in this search platform. For more expert insight and the latest market action, click here to watch this full episode of Market Domination. This post was written by Luke Carberry Mogan.
The Guardian
North Korea-backed cyber espionage campaign targets UK military
National Cyber Security Centre warns of global hacking effort to obtain nuclear and defence intelligence
Engadget
OpenAI unveils SearchGPT, an AI-powered search engine
The launch of SearchGPT comes amid growing competition in AI-powered search.
South China Morning Post
Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi
Baichuan AI, one of China's four so-called artificial intelligence (AI) tigers, raised about 5 billion yuan (US$687.6 million) in a new funding round that valued the start-up at more than 20 billion yuan, the company said on Thursday. The Beijing-based firm's latest round was backed by some of the biggest names in Chinese technology, including Alibaba Group Holding, Tencent Holdings and Xiaomi, along with some state-backed funds. Alibaba owns the South China Morning Post. China International Cap
The Independent
CrowdStrike offers $10 gift card apology for $5bn outage
Some of the Uber Eats gift cards offered to partners were flagged as fraud
Sky News
£7.7 million bounty offered in hunt for members of North Korea-backed hacking group
The UK, US and South Korea have accused a North Korea-backed cyber group of carrying out an online espionage campaign to steal military and nuclear secrets. The "Andariel" group has been compromising organisations around the globe as it attempts to get hold of sensitive and classified technical information and intellectual property data, according to the UK's National Cyber Security Centre (NCSC). The centre, along with the FBI in the US and South Korea's national intelligence service, have issued a joint warning and advisory note about Andariel's actions.
USA TODAY
Get an Apple AirTag tracking device for the lowest price we've seen in months
Keep a watchful eye on your keys, wallet, luggage, and more with an Apple AirTag. Get the tracker on sale at Amazon for just $24, the lowest price we've seen in months.
Cosmo
Rosalía goes braless and *almost* frees the nip in a lace naked dress
Rosalía stepped out wearing a breathtaking naked dress at the Prelude to the Olympics in Paris. The design was a nude coloured see-through lace gown by Dior.
HuffPost
‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him
That didn’t take long.
SETHLUI.COM
Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon
The post Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon appeared first on SETHLUI.com.

OpenAI debuts GPT-4o 'omni' model now powering ChatGPT

Latest stories

Huawei plans tri-fold smartphone as Apple weighs foldable iPhone, reports say

Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows

My long-term iPhone 15 Pro Max review

Amazon racing to develop AI chips cheaper, faster than Nvidia's, executives say

Startup with ‘radical’ concept for AI chips emerges from stealth with $15 million to try to challenge Nvidia

CrowdStrike says over 97% of Windows sensors back online

BofA payments app for businesses handled record $500 billion by mid-year

Southco Introduces A Flush-Mount E6-73 Constant Torque Hinge

SearchGPT unveiled: OpenAI is launching a search engine to compete with Google

Samsung Galaxy Buds 3 and Galaxy Buds 3 Pro review: AirPods clones that actually deliver

OpenAI announces testing of new search engine, SearchGPT

North Korea-backed cyber espionage campaign targets UK military

OpenAI unveils SearchGPT, an AI-powered search engine

Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi

CrowdStrike offers $10 gift card apology for $5bn outage

£7.7 million bounty offered in hunt for members of North Korea-backed hacking group

Get an Apple AirTag tracking device for the lowest price we've seen in months

Rosalía goes braless and almost frees the nip in a lace naked dress

‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him

Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon