赵蕴华.基于GATE的中文专利摘要的抽取[J].数字图书馆论坛,2008,(11): |
基于GATE的中文专利摘要的抽取 |
GATE-based Chinese Patent Abstracts’ Extraction |
投稿时间:2008-10-07 修订日期:2008-10-07 |
DOI: |
中文关键词: 中文专利摘要 GATE 信息抽取 |
英文关键词: Chinese Patent Abstract, GATE, Information Extraction |
基金项目:其它 |
|
摘要点击次数: 1346 |
全文下载次数: 678 |
中文摘要: |
本文通过对“新能源汽车”中文专利摘要的阅读和分析,提出了一种专利摘要内容判别原则。并通过对国外开源抽取工具GATE和中科院分词工具ICTCLAS的学习和改进,实现了对中文专利摘要的批量抽取,为专利知识库的自动构建准备了充分的语料基础。 |
英文摘要: |
With reading and analyzing the Chinese Patent Abstracts of New Resource Cars, this paper brings forward a judging principle of the abstracts. Then, this paper learns a foreign open-source extraction tool which named GATE and the word split software which named ICTCLAS. With improving the GATE and ICTCLAS, this paper achieves at extracting a batch of Chinese Patent Abstracts, which prepares enough language resources for constituting the Patent Knowledge Base automatically.
|
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |