Introduction Artificial Intelligence lives on data. Without data, large language models (LLMs) cannot learn, adapt, or make ...
A research team led by Prof. Liu Liangyun from the Aerospace Information Research Institute of the Chinese Academy of ...
QVAC launches Genesis II, expanding the world’s largest synthetic AI dataset to 148B tokens and 19 domains for better ...
A new dataset from Lawrence Livermore National Laboratory maps one million cis-lunar orbits, highlighting orbital stability challenges, space domain awareness needs, and planning requirements for Moon ...
Research paper details a new kind of dataset for open-ended dialogue similar to Google's AI Search Generative Experience Google researchers created a new form of dataset to train language models for ...
Language models like GPT-4 and Claude are powerful and useful, but the data on which they are trained is a closely guarded secret. The Allen Institute for AI (AI2) aims to reverse this trend with a ...
Margaret Mitchell, an AI ethics researcher at Hugging Face, tells WIRED about a new dataset designed to test AI models for bias in multiple languages. We spoke about a new dataset she helped create to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results