Get the latest tech news

Generative AI Systems Miss Vast Bodies of Human Knowledge, Study Finds


Generative AI models trained on internet data lack exposure to vast domains of human knowledge that remain undigitized or underrepresented online. English dominates Common Crawl with 44% of content. Hindi accounts for 0.2% of the data despite being spoken by 7.5% of the global population. Tamil repr...

None

Get the Android app

Or read this on Slashdot

Read more on:

Photo of Study

Study

Photo of systems

systems

Photo of human knowledge

human knowledge

Related news:

News photo

Kids who use social media score lower on reading and memory tests, a study shows

News photo

Systems as Mirrors

News photo

Study: Artificial intelligence (AI) is wrecking havoc on university assessments and exams