close
close

Research shows companies are training AI models with YouTube content without permission

Artificial intelligence models need as much useful data as possible to work, but some of the biggest AI developers rely in part on transcribed YouTube videos without the creators’ permission, violating YouTube’s own rules, an investigation by Proof News And Wired.

The two media outlets revealed that Apple, Nvidia, Anthropic and other major AI companies trained their models on a dataset called “YouTube Subtitles,” which contains transcripts of nearly 175,000 videos from 48,000 channels — all without the knowledge of the video creators.