学术论文

Mining experimental data from Materials Science literature with Large Language Models: an evaluation study

来源：arXiv发布日期：2024-01-19作者：Luca Foppiano, Guillaume Lambard, Toshiyuki Amagasa, Masashi Ishii0 次点击

内容摘要

This study is dedicated to assessing the capabilities of large language models (LLMs) such as GPT-3.5-Turbo, GPT-4, and GPT-4-Turbo in extracting structured information from scientific documents in materials science. To this end, we primarily focus on two critical tasks of information extraction: (i) a named entity recognition (NER) of studied materials and physical properties and (ii) a relation extraction (RE) between these entities. Due to the evident lack of datasets within Materials Informatics (MI), we evaluated using SuperMat, based on superconductor research, and MeasEval, a generic measurement evaluation corpus. The performance of LLMs in executing these tasks is benchmarked against traditional models based on the BERT architecture and rule-based approaches (baseline). We introduce a novel methodology for the comparative analysis of intricate material expressions, emphasising the standardisation of chemical formulas to tackle the complexities inherent in materials science information assessment. For NER, LLMs fail to outperform the baseline with zero-shot prompting and exhibit only limited improvement with few-shot prompting. However, a GPT-3.5-Turbo fine-tuned with the ap

中文翻译

使用 AI 将内容摘要翻译为中文，便于快速阅读

使用 AI 分析这篇文章的核心发现、关键要点和深度见解

由 DeepSeek AI 提供分析 · 首次使用需配置 API Key