ADVERTISEMENT

OpenAI used YouTube data to train some of its models: Report

June 15, 2023 03:11 pm | Updated 03:35 pm IST

OpenAI, the company behind ChatGPT, reportedly used YouTube data to train some AI models

OpenAI used YouTube data to train some of its models: Report | Photo Credit: Reuters

OpenAI, the company behind the AI-powered chatbot ChatGPT, used YouTube data to train some of its AI models, reported tech outlet The Information, citing an anonymous source.

ADVERTISEMENT

The outlet also reported that Google, which owns YouTube, has been using the video sharing platform’s data to train its own model Gemini.

As more Big Tech companies pivot to developing their AI capabilities or AI-powered offerings, there have been debates about the scraping of data, including copyrighted media, for the purpose of training models.

ADVERTISEMENT

While companies behind text-to-image generators have been subject to lawsuits revolving around violating the copyright of artists, many large language models are being developed in secrecy with little to no transparency about the content in their training data.

(For top technology news of the day, subscribe to our tech newsletter Today’s Cache)

In April, billionaire Elon Musk threatened to sue Microsoft, which has invested heavily in OpenAI. Musk alleged that the software maker “trained illegally with the use of Twitter data.”

This is a Premium article available exclusively to our subscribers. To read 250+ such premium articles every month
You have exhausted your free article limit.
Please support quality journalism.
You have exhausted your free article limit.
Please support quality journalism.
The Hindu operates by its editorial values to provide you quality journalism.
This is your last free article.

ADVERTISEMENT

ADVERTISEMENT