DataSet/DataTable - Search News

AI Chart Understanding Breakthrough: MIT-IBM Dataset Lets Small Models Beat GPT-4o

MIT and IBM released ChartNet, a 1.7-million-sample synthetic training dataset that lets compact open-source vision-language ...

Crypto Briefing

Nvidia and FPT release 900K synthetic personas dataset for Vietnam

Nvidia and FPT released 900,000 synthetic personas on Hugging Face to train AI models that understand Vietnamese language, ...

3don MSN

HETDEX opens massive Cosmic Noon dataset to scientists, novices and AI

The Hobby-Eberly Telescope Dark Energy Experiment (HETDEX)—which recently completed the largest survey ever taken of the early universe—has released all of its immense, information-rich database to ...

Wired

Harvard Is Releasing a Massive Free AI Training Dataset Funded by OpenAI and Microsoft

Harvard University announced Thursday it’s releasing a high-quality dataset of nearly 1 million public-domain books that could be used by anyone to train large language models and other AI tools. The ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results