MatTools: Benchmarking Large Language Models for Materials Science Tools

Siyu Liu, Jiamin Xu, Beilin Ye, Bo Hu, David J. Srolovitz, Tongqi Wen

2025-05-19

MatTools: Benchmarking Large Language Models for Materials Science Tools

Summary

This paper talks about MatTools, a new set of tests designed to see how well large language models can help with materials science, especially when it comes to writing and running code for physics-based tools.

What's the problem?

The problem is that while language models are good at general tasks, it's unclear if they really understand the complicated needs of materials science, which often requires precise coding and knowledge of physics.

What's the solution?

The researchers created MatTools to test these models by having them generate and execute code that would actually be used in materials science research, checking if the models can handle real scientific problems.

Why it matters?

This matters because if language models can reliably help with coding and problem-solving in materials science, it could speed up research and make advanced science more accessible to more people.

Abstract

MatTools evaluates large language models' proficiency in materials science by assessing code generation and execution based on physics-based computational tools.

View Paper