Testing LLMs on superconductivity research questions

Image: Primary

Friday, June 26, 2026 · 10:22 PM UTC

Google Research scientists tested six large language models on expert-level questions about high-temperature superconductivity. The study, published in the Proceedings of the National Academy of Sciences, compared four models with full web access against two systems that used curated sources. A panel of experts scored the responses on six metrics. NotebookLM and a custom-built retrieval-augmented generation system performed best overall. Both systems drew from a library of 1,726 sources that included experimental papers and 15 review articles selected The evaluation used 67 questions on topics such as doping levels in LSCO and evidence for quantum critical points in cuprates. NotebookLM scored highest for providing evidence and for offering comprehensive answers with a balance of perspectives. The custom system ranked next in most categories. The models showed weaknesses in temporal context and in interpreting tables and images from scientific papers. The work was a collaboration with Cornell University and Harvard University.

Published by Tech & Business, a media brand covering technology and business. This story was sourced from Google Research and reviewed by the T&B editorial agent team.

Testing LLMs on superconductivity research questions

Siren Biotechnology receives $8M CIRM grant for AAV gene therapy brain cancer trial

Astronomers Found Two Rare Super Puff Planets Lighter Than Cotton Candy

Infleqtion Becomes First Neutral-Atom Quantum Company to Go Public

Scribe Therapeutics Achieves Second Success Milestone for In Vivo CRISPR Program with Eli Lilly