罗婷婷,赵瑞雪,李娇,傅智杰,武丽丽,黄永文,鲜国建.面向多源异构科技信息治理的元数据标准规范体系构建[J].数字图书馆论坛,2021,(4):58~67 |
面向多源异构科技信息治理的元数据标准规范体系构建 |
Construction of Metadata Standard System for Multi-source and Isomerized Scientific and Technological Information Governance |
投稿时间:2021-04-01 |
DOI:10.3772/j.issn.1673-2286.2021.04.009 |
中文关键词: 多源异构;科技信息;元数据;标准体系 |
英文关键词: Multi-source and Isomerized; Scientific and Technological Information; Metadata; Standard System |
基金项目:本研究得到国家社会科学基金项目“科技论文全景式摘要知识图谱构建与应用研究”(编号:19BTQ061)资助。 |
作者 | 单位 | 罗婷婷 | 中国农业科学院农业信息研究所 | 赵瑞雪 | 中国农业科学院农业信息研究所 农业农村部农业大数据重点实验室 | 李娇 | 中国农业科学院农业信息研究所 | 傅智杰 | 中国工程院战略咨询中心 | 武丽丽 | 中国工程院战略咨询中心 | 黄永文 | 中国农业科学院农业信息研究所 | 鲜国建 | 中国农业科学院农业信息研究所 农业农村部农业大数据重点实验室 |
|
摘要点击次数: 2267 |
全文下载次数: 4059 |
中文摘要: |
为开展大数据环境下多类型、多来源、异构化科技信息的汇聚治理,实现大数据资源的规范化描述与互联互通,提高数据资源的可发现、可利用和开放共享能力,本文基于元数据理论和知识对象建模思想,构建了一套广泛适用的、可扩展的元数据标准规范体系,覆盖13类通用容器、24类资源元素集描述规范及28个规范编码表,并编制了配套的XML Schema形式化描述规范,实现对多类多源异构元数据向统一的XML格式转化、验证和解析等自动化处理。该规范体系已在中国工程科技知识中心开展了应用验证,指导30余个分中心完成24类数据资源超过亿级数据的转化汇交,有力支撑了工程科技“元数据海”的建设,快捷、高效地实现了近百类专业领域特色资源元数据标准规范的定制与应用。 |
英文摘要: |
Based on the theory of metadata and the idea of knowledge object-oriented modeling, this paper constructs a flexible and widely applicable metadata standard system, covering 13 kinds of generic containers, 24 kinds of resource element set description specifications and 28 specification coding tables, to carry out the aggregation governance of multi-type, multi-source and isomerized technology resources in the big data environment, realize the standardized description and interconnection of big data resources, and improve the discoverability, utilizability and sharability of data resources. Moreover, this paper compiles a supporting XML Schema formal description specification to support the computer’s automatic processing of conversion, verification and parsing from multi-type and multi-source heterogeneous metadata to unified XML format. The standard system has been applied and verified in China Knowledge Centre for Engineering Sciences and Technology, and has guided more than 30 sub-centers to complete the transformation and collection of 24 kinds of data resources with more than 100 million-level data, strongly supporting the construction of “metadata sea” of engineering sciences and technology. In addition, the customization and application of metadata standards and specifications for characteristic resources in nearly 100 kinds of professional fields have been further realized quickly and efficiently based on the system. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |
|
|
|