ChemBench: A Benchmark for LLMs in Chemistry

Popular：

Virtualization DNS security formal verification reachability analysis compiler errors macro conflict web extension development framework Bitmap Graphics API inconsistencies All Tags

ChemBench: A Benchmark for LLMs in Chemistry

2025-06-16

ChemBench is a new benchmark dataset designed to evaluate the performance of large language models (LLMs) in chemistry. It features a diverse range of chemistry questions spanning various subfields, categorized by difficulty. Results show leading LLMs outperforming human experts overall, but limitations remain in knowledge-intensive questions and chemical reasoning. ChemBench aims to advance chemical LLMs and provide tools for more robust model evaluation.

(www.nature.com)

AI Chemistry

DARPA Shatters Records with Long-Range Wireless Power Beaming

AI Coding Agents: From Helpful Assistants to Essential Partners