A Survey of Large Language Model-Driven Scientific Hypothesis Generation

Author: Chang Yuan ^1,2 Li Ziyue ^1,2 Kong Yuanbo ^1,2 Le Xiaoqiu ^1,2
Institute:

1. National Science Library, Chinese Academy of Sciences

2. Department of Information Resources Management, School of Economics and Management, University of Chinese Academy of Sciences
Correspondent： 乐小虬 Email:lexq@mail.las.ac.cn
Submit Time:2026-04-02 22:05:51

Abstract:
[Objective] To systematically review the methodological framework and application progress of large language model-driven scientific hypothesis generation, and to reveal the current research landscape and development trends in this field. [Coverage] Using keywords such as "Large Language Models" and "Scientific Hypothesis Generation", we conducted searches in databases including WOS, Google Scholar, and CNKI. Representative literature from 2021 to 2026 was screened, resulting in a final set of 98 papers for analysis. [Methods] An analytical framework was established along three dimensions: generation process logic, evolution of technical pathways, and key issues. Existing approaches at each stage—knowledge acquisition, preliminary hypothesis generation, iterative refinement, and evaluation and validation—were systematically reviewed. The underlying technical architectures were comparatively analyzed, core difficulties and current solutions were examined in depth, and relevant benchmark datasets and representative applications were summarized. [Results] The capabilities of LLMs in knowledge integration and association discovery offer a new paradigm for scientific hypothesis generation, having already yielded experimentally verified hypotheses in real-world scenarios across multiple domains. Current research exhibits a synergistic trend among five technical pathways: context engineering, supervised fine-tuning, reinforcement learning, planning and search, and multi-agent collaboration. A preliminary methodology has been formed for the core generation process; however, challenges remain in knowledge clue discovery, innovative hypothesis reasoning, and credibility, with model hallucination and intrinsic reasoning capabilities being the primary bottlenecks. [Limitations] As this emerging interdisciplinary field evolves rapidly, some of the most recent works may not be fully covered. This study focuses on methodological framework review and does not provide a systematic quantitative performance comparison of current methods. [Conclusions] Large language models have demonstrated the capability to assist in or even autonomously generate scientifically valuable hypotheses, enabling scalable and cross-disciplinary hypothesis exploration. Future research should seek breakthroughs in balancing reliability with novelty, enhancing deep reasoning capabilities, innovating human-AI collaborative paradigms, and establishing closed-loop integration between hypothesis generation and experimental verification.

Large Language Models Scientific Hypothesis Generation AI for Science Autonomous Scientific Discovery

From: 常远
Subject: Computer Science >> Computer Application Technology
Comments： 该论文已被期刊《数据分析与知识发现》录用。
Contribution： Accepted
Cite as: ChinaXiv:202604.00021 (or this version ChinaXiv:202604.00021V1)
DOI:10.12074/202604.00021
CSTR:32003.36.ChinaXiv.202604.00021
TXID： 239ae2aa-3427-422e-95c8-3392ac56dc03
Recommended references： 常远,李紫玥,孔源博,乐小虬.大语言模型驱动的科学假设生成研究综述.null.[DOI:10.12074/202604.00021] (Click&Copy)

Version History

[V1]

2026-04-02 22:05:51

ChinaXiv:202604.00021V1

Download

Related Paper

1. CREA-Eval：用于测试大语言模型理解稀土领域相关问题能力的评估基准	2026-04-13
2. 面向矢量图形生成的大语言模型研究综述	2026-01-08
3. 从概念识别到自动化测量：基于大语言模型的国家刻板印象评估	2025-03-11
4. 融合大语言模型与多模态特征的古文命名实体识别	2024-11-20
5. 引导大语言模型生成计算机可解析内容	2024-04-21
6. LLAMA-2 大语言模型的数学形式	2023-08-31
7. 大语言模型旋转位置编码的简易推导	2023-07-12


Public comments Anonymous comments Send only to author