Shandong Science

   

Evaluation of innovativeness instem cell research articles using DeepSeek

MA Chunjian1, Wang Chao2*, XU Haiyun2, WANG Lekang2, ZHANG Xin3, CHEN Liang4   

  1. 1. Information Research Institute of Shandong Academy Sciences, Qilu University of Technology (Shandong Academy of Sciences),
     Jinan 250014, China; 2. Shandong University of Technology, Zibo 255049, China; 3. Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 6102994, China;  4. Institute of Scientific and Technical Information of China, Beijing 100038, China
  • Received:2025-12-04 Accepted:2025-12-29 Online:2026-06-01
  • Contact: Wang Chao E-mail:kingtaoist@yeah.net

Abstract: Current approaches to assessing the innovativeness of scientific and technological papers rely predominantly on expert peer review, a process that is often inefficient and subject to bias. Quantitative metrics offer greater objectivity but are largely retrospective and provide limited foresight or explanatory insight. This study proposes a novel framework for evaluating the innovativeness of stem cellscientific and technological papers using the DeepSeek model. Focusing on a corpus of stem cell research articles, the titles and abstracts of representative papers were vectorized using the bge-large-en-v1.5 model to construct a semantic vector database. Subsequently, the deepseek-reasoner model was applied to extract innovation-related features, which were organized into a vectorized innovation feature database. The two databases were subsequently integrated using a weighted fusion strategy. Target papers were then evaluated through FAISS-based vector retrieval and Top-k similarity matching within the unified database, resulting in a final innovativeness score and ranking. The results were rigorously validated against scores generated by the unassisted DeepSeek model to assess the framework’s effectiveness in evaluating innovativeness in biomedical scientific and technological papers. Empirical results indicate that the DeepSeek model tends to overestimate innovation when used without calibration. However, after targeted training, the model exhibits substantially improved stability and validity in innovation assessment, highlighting its strong potential for identifying innovative dimensions and distinguishing features in scientific literature.

Key words: generative large language models, innovation assessment, semantic embeddings, automated evaluation, peer review

CLC Number: 

  • Cite this article

    MA Chunjian, Wang Chao, XU Haiyun, WANG Lekang, ZHANG Xin, CHEN Liang. Evaluation of innovativeness instem cell research articles using DeepSeek[J].Shandong Science, 0, (): 1-.

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0), which permits third parties to freely share (i.e., copy and redistribute the material in any medium or format) and adapt (i.e., remix, transform, or build upon the material) the articles published in this journal, provided that appropriate credit is given, a link to the license is provided, and any changes made are indicated. The material may not be used for commercial purposes. For details of the CC BY-NC 4.0 license, please visit: https://creativecommons.org/licenses/by-nc/4.0