数据与计算发展前沿 ›› 2024, Vol. 6 ›› Issue (5): 159-168.

CSTR: 32002.14.jfdc.CN10-1649/TP.2024.05.015

doi: 10.11871/jfdc.issn.2096-742X.2024.05.015

• • 上一篇    下一篇

多源异构数据接入技术及其在垂直领域应用框架的探索

刘宝琪(),薛非*(),孟生炜,张成鲁   

  1. 军事科学院,军事科学信息研究中心,北京 100142
  • 收稿日期:2023-10-09 出版日期:2024-10-20 发布日期:2024-10-21
  • 通讯作者: * 薛非(E-mail: huanhuansuperman@163.com
  • 作者简介:刘宝琪,军事科学院军事科学信息研究中心,助理研究员,硕士,主要研究方向是数据分析与应用。
    本文主要负责文章框架搭建,初稿撰写。
    LIU Baoqi, Master, is an assistant researcher at the Military Science Information Research Center, Academy of Military Science. Her main research direction is data analysis and application.
    In this paper, she is mainly responsible for building the framework and writing the first draft.
    E-mail: baoqi87@foxmail.com|薛非,军事科学院军事科学信息研究中心,副研究员,硕士,主要研究方向是数据工程。
    本文主要负责技术指导,总体统筹。
    Xue Fei, Master, is an associate researcher at the Military Science Information Research Center, Academy of Military Science.. His main research direction is data engineering.
    In this paper, he is mainly responsible for technical guidance and overall coordination.
    E-mail: huanhuansuperman@163.com
  • 基金资助:
    军队网信XXX重点项目“某数据资源体系建设”

Research on Multi-Source Heterogeneous Data Access Technology and Its Application Framework in Vertical Field

LIU Baoqi(),XUE Fei*(),MENG Shengwei,ZHANG Chenglu   

  1. Military Science Information Research Center, Academy of Military Science, Beijing 100142, China
  • Received:2023-10-09 Online:2024-10-20 Published:2024-10-21

摘要:

【目的】多源异构数据接入技术的突破对于切实解决垂直领域“数据孤岛”严峻问题,实现数据获取的全局性、真实性、实时性保障以及数据的互通融合应用尤为重要。【方法】本文在相关文献基础上,分析基于ETL的多源异构数据接入技术、基于语义本体的多源异构数据接入技术、基于机器人流程自动化的多源异构数据接入技术和基于云—端融合资源反射机制的多源异构数据接入技术等各类技术的内在机理与特点。【结果】对4种多源异构数据接入技术在垂直领域应用的典型框架进行探索设计,并进一步对4种技术特征进行归纳总结并横向对比,以期为相关实践应用提供有指导价值的示范与参考。【结论】本文对垂直领域大数据集成与互操作实践提供理论基础,如何充分利用技术关键方法更好地解决实时数据接入等技术局限性将作为未来研究重点。

关键词: 多源异构数据, 数据接入, 互操作技术

Abstract:

[Objective] The breakthrough of multi-source heterogeneous data access technology is particularly important for effectively solving the severe problem of "data island" in the vertical field, achieving global, authenticity, real-time guarantee of data acquisition, and data fusion applications. [Methods] Through literature analysis, current researches on multi-source heterogeneous data access technologies of ETL based, semantic ontology based, robot process automation based and cloud side integration resource reflection mechanism based are summarized. The characteristics of various technologies are analyzed. [Results] Typical frameworks for the application of four multi-source heterogeneous data access technologies in the vertical field are explored and designed, and four technical features are summarized and compared, in order to provide valuable demonstration and reference for relevant practical applications. [Conclusions] A theoretical basis for the integration and interoperability practices of big data in the vertical field is provided, and how to fully utilize key technical methods to better solve the technical limitations of real-time data access will be the focus of future research.

Key words: multi-source heterogeneous data, data access, interoperability technology