Data Flywheel in Reasoning LLMs

Researchers find you don’t need a ton of data to train LLMs for reasoning tasks

Large language models (LLMs) can learn complex reasoning tasks without relying on large datasets, according to a new study by researchers at Shanghai Jiao Tong University. Their findings show that ...

VentureBeat

Forget data labeling: Tencent’s R-Zero shows how LLMs can train themselves

A new training framework developed by researchers at Tencent AI Lab and Washington University in St. Louis enables large language models (LLMs) to improve themselves without requiring any ...

SiliconANGLE

OpenAI finds DeepSeek used its data to train R1 reasoning model

OpenAI believes its data was used to train DeepSeek’s R1 large language model, multiple publications reported today. DeepSeek is a Chinese artificial intelligence provider that develops open-source ...

Ars Technica

LLMs’ “simulated reasoning” abilities are a “brittle mirage,” researchers find

In recent months, the AI industry has started moving toward so-called simulated reasoning models that use a “chain of thought” process to work through tricky problems in multiple logical steps. At the ...

InfoWorld

LLMs aren’t enough for real-world, real-time projects

Large language models can generate useful insights, but without a true reasoning layer, like a knowledge graph and graph-based retrieval, they’re flying blind. The major builders of large language ...

The AI-Data Product Flywheel: A Blueprint For Enterprise AI

1. The "Data Trash" Problem: AI models are only as good as the information they ingest. For most enterprises today, data is ...

Hosted on MSN

Ether0’s Transparent Reasoning and Data-Efficient Training Set a New Standard for Chemistry AI

“I think it’s very cool what they pulled off,” said Kevin Jablonka, a digital chemist at the University of Jena, after checking out Ether0, a novel AI system that’s revolutionizing how large language ...

XDA Developers on MSN

This self-hosted tool makes my local LLMs feel exactly like ChatGPT, but nothing leaves my network

It's perfect for privacy-conscious folks looking to break away from ChatGPT ...

SiliconANGLE

Mistral AI debuts new Magistral series of reasoning LLMs

Mistral AI SAS today introduced Magistral, a new lineup of reasoning-optimized large language models. The LLM series includes two algorithms on launch. The first, Magistral Small, is available under ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results