Business CircleBusiness Circle
  • Home
  • AI News
  • Startups
  • Markets
  • Finances
  • Technology
  • More
    • Human Resource
    • Marketing & Sales
    • SMEs
    • Lifestyle
    • Trading & Stock Market
What's Hot

When to Use Digital vs. Human Outreach in B2B Marketing

April 23, 2026

The Missing Link That Could Trigger a Robotics Boom

April 23, 2026

India’s app market is booming — but global platforms are capturing most of the gains

April 23, 2026
Facebook Twitter Instagram
Thursday, April 23
  • Advertise with us
  • Submit Articles
  • About us
  • Contact us
Business CircleBusiness Circle
  • Home
  • AI News
  • Startups
  • Markets
  • Finances
  • Technology
  • More
    • Human Resource
    • Marketing & Sales
    • SMEs
    • Lifestyle
    • Trading & Stock Market
Subscribe
Business CircleBusiness Circle
Home » OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice
Technology

OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice

Business Circle TeamBy Business Circle TeamApril 8, 2024Updated:August 21, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice
Share
Facebook Twitter LinkedIn Pinterest Email


A scorching potato: One of many many controversial parts surrounding generative AIs and their massive language fashions’ (LLM) coaching information is the potential copyright infringements. It is a matter underneath the highlight as soon as once more following a report that OpenAI transcribed over 1,000,000 hours of YouTube movies to coach GPT-4. Why did not YouTube proprietor Google object? As a result of it did the identical factor.

To be able to entry extra respected English language-based textual content on the web in 2021, OpenAI researchers created a speech recognition device referred to as Whisper, writes The New York Occasions. It was designed to transcribe audio from YouTube movies, giving the corporate a trove of information to coach its LLMs.

OpenAI reportedly knew that scraping YouTube information was legally questionable however did it anyway, assuming such motion can be honest use. The Occasions writes that OpenAI president Greg Brockman was personally concerned in amassing movies that have been transcribed.

One would think about Google being lower than glad about OpenAI’s actions, however that will have been hypocritical provided that Google additionally transcribed YouTube movies for its AI fashions, doubtlessly violating creators’ copyrighted materials.

YouTube CEO Neal Mohan stated throughout an interview with Bloomberg final week that the platform’s phrases of service don’t allow unauthorized transcripts or downloading of video content material. When requested about OpenAI’s transcribing, he stated, “I’ve seen stories that it might or might not have been used. I’ve no info myself.”

Google spokesperson Matt Bryant repeated the ToS guidelines, including that the corporate takes “technical and authorized measures” to forestall this type of unauthorized apply “when now we have a transparent authorized or technical foundation to take action.” Google stated that its AI fashions “are educated on some YouTube content material” that’s allowed underneath agreements with creators.

The NY Occasions states that Google has expanded its phrases of service, giving it extra rights to make use of shopper information resembling publicly obtainable Google Docs and restaurant critiques on Google Maps for the corporate’s AI fashions. The revised coverage was launched on July 1 within the hope that the Independence Day weekend would act as a distraction.

Meta was additionally stated to be contemplating shady strategies of accomplishing extra information for its LLM coaching. The NY Occasions writes that the Fb mum or dad thought-about amassing copyrighted information from the web, even when that meant dealing with lawsuits, as negotiations with license holders would take too lengthy.

Hundreds of organizations and people are complaining and submitting lawsuits towards massive AI firms over the usage of their content material with out fee or acknowledgment. The New York Occasions is suing OpenAI and Microsoft for utilizing its copyrighted information articles. In February, OpenAI accused the publication of paying somebody to “hack” its well-known chatbot and different merchandise to generate deceptive proof supporting these claims.

Masthead: Souvik Banerjee



Source link

Engaged Google hours LLMs Million OpenAI Practice Train transcribed Videos YouTube
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Business Circle Team
Business Circle Team
  • Website

Related Posts

India’s app market is booming — but global platforms are capturing most of the gains

April 23, 2026

Microsoft told carbon-removal project developers it is shelving contracts under negotiation; Microsoft has been a major backer of carbon capture tech (Alastair Marsh/Bloomberg)

April 23, 2026

The Bafta games awards showed me again that honouring art over commerce is a win for all | Games

April 23, 2026

The shadowy SIM farms behind those incessant scam texts – and how to stay safe

April 23, 2026
LATEST UPDATES

When to Use Digital vs. Human Outreach in B2B Marketing

April 23, 2026

The Missing Link That Could Trigger a Robotics Boom

April 23, 2026

India’s app market is booming — but global platforms are capturing most of the gains

April 23, 2026

Ryan Giggs Loses £100,000 as George’s Dining Room Restaurant Collapses with £563,600 Debt Pile

April 23, 2026

Redesign Work Using Job Enrichment and Job Enlargement

April 23, 2026

Why an East Coast Traveler Loves His Alaska Airlines Credit Card

April 23, 2026

Subscribe to Updates

Get the latest sports news from SportsSite about soccer, football and tennis.

Business, Finance and Market Growth News Site

Important Pages
  • Advertise with us
  • Submit Articles
  • About us
  • Contact us
Recent Posts
  • When to Use Digital vs. Human Outreach in B2B Marketing
  • The Missing Link That Could Trigger a Robotics Boom
  • India’s app market is booming — but global platforms are capturing most of the gains
© 2026 BusinessCircle.co
  • Privacy Policy
  • Terms and Conditions
  • Cookie Privacy Policy
  • Disclaimer
  • DMCA

Type above and press Enter to search. Press Esc to cancel.