Runway’s AI video generator trained on thousands of scraped YouTube videos

Jul 26, 2024 01:19 AM - 5 months ago 138099

Runway trained its AI text-to-video generator connected thousands of YouTube videos and pirated films, according to a study from 404 Media. A spreadsheet of training data obtained by 404 Media includes links to YouTube channels belonging to awesome intermezo companies, specified arsenic Netflix, Disney, Nintendo, and Rockstar Games, on pinch creators for illustration MKBHD, Linus Tech Tips, and Sam Kolder.

There are besides links to channels owned by news outlets for illustration The Verge, The New Yorker, Reuters, and Wired. “The channels successful that spreadsheet were a company-wide effort to find bully value videos to build nan exemplary with,” a erstwhile Runway worker tells 404 Media. “This was past utilized arsenic input to a monolithic web crawler which downloaded each nan videos from each those channels, utilizing proxies to debar getting blocked by Google.” 

Runway is an AI startup that has received millions successful funding from Google genitor institution Alphabet and Nvidia. It has created awesome devices that let users to make realistic-looking AI videos arsenic good arsenic ones that seizure a peculiar animation type. Runway’s latest tool, Gen-3 Alpha, launched successful June and tin “create videos successful immoderate style you tin imagine.” Like different AI models, Gen-3 Alpha needs to ingest a breadth of contented erstwhile training.

In summation to YouTube channels, 404 Media besides recovered that Runway’s dataset contains links to piracy sites for illustration KissCartoon, which lets you watch anime and different animated contented for free. It’s still not clear whether Runway utilized each of nan videos successful this spreadsheet to train its Gen-3 Alpha exemplary — and we whitethorn ne'er find out. In an question and reply pinch TechCrunch successful June, Runway cofounder Anastasis Germanidis said nan institution uses “curated, soul datasets” to train its models, but he didn’t supply further detail.

When reached for comment, Google pointed The Verge to a connection from YouTube CEO Neal Mohan, who told Bloomberg successful April that training AI connected nan platform’s videos is simply a “clear violation” of its policies. The Verge reached retired to Runway pinch a petition for remark but didn’t instantly perceive back.

Runway isn’t nan only AI institution that has had its AI training information linked to YouTube. Earlier this year, OpenAI CTO Mira Murati said she “wasn’t sure” whether nan company’s text-to-video generator, Sora, trained connected YouTube. Meanwhile, a caller study from Proof News and Wired recovered that Anthropic, Apple, Nvidia, and Salesforce trained their AI models connected much than 170,000 YouTube videos.

More