Get the latest tech news

US Lawmaker Proposes a Public Database of All AI Training Material


An anonymous reader quotes a report from Ars Technica: Amid a flurry of lawsuits over AI models' training data, US Representative Adam Schiff (D-Calif.) has introduced (PDF) a bill that would require AI companies to disclose exactly which copyrighted works are included in datasets training AI system...

The Generative AI Disclosure Act "would require a notice to be submitted to the Register of Copyrights prior to the release of a new generative AI system with regard to all copyrighted works used in building or altering the training dataset for that system," Schiff said in a press release. Under Schiff's law, The New York Times would need to consult the database to ID all articles used to train ChatGPT or any other AI system. Any AI maker who violates the act would risk a "civil penalty in an amount not less than $5,000," the proposed bill said.

Get the Android app

Or read this on Slashdot

Read more on:

Photo of lawmaker

lawmaker

Photo of training material

training material

Photo of public database

public database

Related news:

News photo

US Lawmaker Cited NYC Protests in a Defense of Warrantless Spying

News photo

US lawmakers vote 50-0 to force sale of TikTok despite angry calls from users | Lawmaker: TikTok must "sever relationship with the Chinese Communist Party."

News photo

Robotaxis ‘do not belong in the city of Los Angeles,’ lawmaker says