Frontiers of Data and Computing ›› 2024, Vol. 6 ›› Issue (4): 106-115.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.04.009

doi: 10.11871/jfdc.issn.2096-742X.2024.04.009

• Special Issue: Fundamental Software Stack and Systems for National Scientific Data Centers • Previous Articles     Next Articles

Research on Knowledge Extraction Method for Agricultural Science and Technology Policies Based on Deep Learning

ZHAO Xiaodan(),HU Lin*()   

  1. Agricultural Information Institute of CAAS, Beijing 100081, China
  • Received:2024-01-05 Online:2024-08-20 Published:2024-08-20

Abstract:

[Application Background] Agricultural science and technology policies have a significant impact on technological progress and the development of agricultural production. Policies issued by different government departments have correlations with conceptual entities. [Objective] Addressing the issue of time-consuming and labor-intensive manual feature design for named entity recognition and relationship extraction in agricultural science and technology policies, this study introduces a knowledge extraction approach utilizing the BERT-BiLSTM-CRF model. [Method] Using a new annotation pattern adapted to the domain corpus, directly modeling triplets, instead of the traditional separate modeling or joint extraction, transforms the entity and relationship extraction problem into a sequence labeling task. The experiment involved 19,779 sentences and 376,721 characters of policy text, identifying eight types of entities such as policy and industry, and ten types of relationships such as citation and publication. [Results] The model achieves an accuracy of 81.61%, a recall of 85.34%, and an F1 score of 83.47% on the corpus. The results of the experiments demonstrate that the suggested approach proficiently extracts entities and relationships related to agricultural science and technology policies, and its performance surpasses that of other classical models.

Key words: agricultural science and technology policies, BERT-BiLSTM-CRF, knowledge extraction, entity recognition