We need a new Turing test to assess AI’s real-world knowledge

A New Turing Test for AI's Real-World Knowledge

A fresh set of benchmarks is needed to assess artificial intelligence's understanding of the real world, according to experts.

Artificial intelligence (AI) models have shown impressive performance on law exams, answering multiple-choice, short-answer, and essay questions as well as humans [1]. However, they struggle with real-world legal tasks.

Some lawyers have learnt that the hard way, and have been fined for filing AI-generated court briefs that misrepresented principles of law and cited non-existent cases.

Experts like Chaudhri, principal scientist at Knowledge Systems Research in Sunnyvale, California, emphasize the need for better evaluation methods.

Author's summary: New benchmarks needed for AI.

more

Nature Nature — 2025-10-30

More News