OpenAI Makes New & Formidable GPT-4o Free For Everyone
By Mikelle Leow, 14 May 2024
Photo 309455957 © Ralf Liebhold | Dreamstime.com
OpenAI, the research lab co-founded by Elon Musk, has unveiled its latest advancement in artificial intelligence: GPT-4o. This new model, announced at its spring update, builds upon the success of its predecessors, GPT-3 and GPT-4, but with a crucial twist—it’s “omni.” The “o” stands for its ability to handle not just text, but also speech and code. Alongside this, the chatbot is getting a facelift with a more intuitive interface and advanced features aimed at making conversations smoother and more engaging.
GPT-4o, the latest brainchild of the AI giant, promises the intellectual might of GPT-4 but at a much swifter pace. It brings improvements across text, voice, and vision, setting a new benchmark in AI capabilities. This model will be available to both free and paid users, integrating naturally with their digital routines.
Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN
— OpenAI (@OpenAI) May 13, 2024
Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx
ChatGPT desktop app
Notably, a new desktop app for macOS, complete with a convenient keyboard shortcut (Option + Space), allows users to query ChatGPT instantly and even discuss screenshots directly within the app.
Voice interaction takes a leap forward with the ability to have voice conversations directly from your computer. Starting with the Voice Mode present at launch, GPT-4o’s enhanced audio and video capabilities are set to arrive soon. By clicking the headphone icon in the bottom right corner of the desktop app, users can start brainstorming sessions, interview preparations, or casual discussions using their voice.
Live demo of GPT-4o voice variation pic.twitter.com/b7lLJkhBt1
— OpenAI (@OpenAI) May 13, 2024
The macOS app is available to Plus users starting today, with broader availability in the coming weeks. A Windows version is on the horizon, expected later this year.
oMG
One of the standout features of GPT-4o is its advanced image comprehension. Users can snap a photo of a menu in a foreign language and get it translated, learn about the cuisine’s history, or receive recommendations.
Prompt: “A first person view of a robot typewriting the following journal entries:
1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?
the text is large, legible and clear. the robot’s hands type on the typewriter.” Image via OpenAI
Future updates promise even more natural voice interactions and real-time video capabilities, such as live sports game explanations. OpenAI is gearing up to release these advanced features in a new Voice Mode, initially available in alpha for Plus users.
Expanding accessibility, GPT-4o enhances language capabilities across more than 50 languages, ensuring a smoother experience for a global audience. The rollout is beginning with ChatGPT Plus and Team users, with Enterprise users to follow.
A slice for ChatGPT Free users
ChatGPT Free users will also get a taste of GPT-4o, albeit with usage limits. Plus users enjoy significantly higher message limits, while Team and Enterprise users benefit from even greater allowances, according to OpenAI.
ChatGPT Free accounts will now experience GPT-4 level intelligence, access responses from both the model and the web, analyze data, create charts, chat about photos, and utilize the Memory feature for a more tailored experience. When limits are reached, the language model will automatically revert to GPT-3.5 to maintain conversation flow.
[via OpenAI, images via various sources]