OpenAI launches GPTo, improving ChatGPT's text, visual and audio capabilities

OpenAI has introduced a new artificial intelligence model called GPT-4o

Published - May 14, 2024 07:42 am IST - SAN FRANCISCO

OpenAI says it does works faster than previous versions and can reason across text, audio and video in real time [File]

OpenAI says it does works faster than previous versions and can reason across text, audio and video in real time [File] | Photo Credit: REUTERS

OpenAI's latest update to its artificial intelligence model can mimic human cadences in its verbal responses and can even try to detect people's moods.

The effect conjures up images of the 2013 Spike Jonze move “Her,” where the (human) main character falls in love with an artificially intelligent operating system, leading to some complications.

While few will find the new model seductive, OpenAI says it does works faster than previous versions and can reason across text, audio and video in real time.

GPT-4o, short for “omni,” will power OpenAI's popular ChatGPT chatbot, and will be available to users, including those who use the free version, in the coming weeks, the company announced during a short live-streamed update. CEO Sam Altman, who was not one of the presenters at the event, simply posted the word “her” on the social media site X.

(For top technology news of the day, subscribe to our tech newsletter Today’s Cache)

During a demonstration with Chief Technology Officer Mira Murati and other executives, the AI bot chatted in real time, adding emotion — specifically “more drama” — to its voice as requested. It also helped walk through the steps needed to solve a simple math equation without first spitting out the answer, and assisted with a more complex software coding problem on a computer screen.

It also took a stab at extrapolating a person's emotional state by looking at a selfie video of their face (deciding he was happy since he was smiling) and translated English and Italian to show how it could help people who speak different languages have a conversation.

Gartner analyst Chirag Dekate said the update, which lasted less than 30 minutes, gave the impression OpenAI is playing catch-up to larger rivals.

“Many of the demos and capabilities showcased by OpenAI seemed familiar because we had seen advanced versions of these demos showcased by Google in their Gemini 1.5 pro launch,” Dekate said. “While Open AI had a first-mover advantage last year with ChatGPT and GPT3, when compared to their peers, especially Google, we now are seeing capability gaps emerge.”

Google plans to hold its I/O developer conference on Tuesday and Wednesday, where it is expected to unveil updates to its own Gemini, its AI model.

0 / 0
Sign in to unlock member-only benefits!
  • Access 10 free stories every month
  • Save stories to read later
  • Access to comment on every story
  • Sign-up/manage your newsletter subscriptions with a single click
  • Get notified by email for early access to discounts & offers on our products
Sign in

Comments

Comments have to be in English, and in full sentences. They cannot be abusive or personal. Please abide by our community guidelines for posting your comments.

We have migrated to a new commenting platform. If you are already a registered user of The Hindu and logged in, you may continue to engage with our articles. If you do not have an account please register and login to post comments. Users can access their older comments by logging into their accounts on Vuukle.