应用科学学报 ›› 2024, Vol. 42 ›› Issue (2): 323-333.doi: 10.3969/j.issn.0255-8297.2024.02.013

• 计算机科学与应用 • 上一篇    下一篇

基于实例分布约束的事件语义自动划分

高剑奇, 骆祥峰, 裴昕淼   

  1. 上海大学 计算机工程与科学学院, 上海 200444
  • 收稿日期:2021-12-26 出版日期:2024-03-31 发布日期:2024-03-28
  • 通信作者: 骆祥峰,研究员,研究方向为海量网络信息处理。E-mail:luoxf@shu.edu.cn E-mail:luoxf@shu.edu.cn
  • 基金资助:
    国家自然科学基金项目(No.91746203);上海市优秀学术带头人项目(No.20XD1401700)资助

Automatic Event Semantic Division Based on Instance Distribution Constraints

GAO Jianqi, LUO Xiangfeng, PEI Xinmiao   

  1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
  • Received:2021-12-26 Online:2024-03-31 Published:2024-03-28

摘要: 针对离散分布于新闻文本集合中的事件语义难以聚合的问题,提出了基于实例分布约束的事件语义自动划分算法。首先,利用远程监督方法,构建用于事件语义划分的训练数据集;其次,设计基于实例分布约束的事件语义分类器,用于判断新的事件触发词的加入是否影响事件语义的聚合;最后,在该分类器的基础上设计事件语义集合生成算法,在不需要预先设定事件类型的情况下,将分布离散的事件触发词自动地划分到不同的事件语义集合中。结果表明本方法可有效实现事件语义的自动划分,为事件语义的高质量聚合提供了一种新的探索。

关键词: 实例分布约束, 事件语义自动划分, 远程监督, 事件语义分类器, 集合生成算法

Abstract: This paper proposes an automatic event semantic division algorithm based on instance distribution constraints to address the difficulty in aggregating event semantics that are discretely distributed in news text collections. First, the distant supervision method is used to construct training dataset for event semantic division. Second, a semantic classifier based on instance constraints is designed to determine whether the addition of new event trigger affects the aggregation of event semantics. Finally, an event semantic set generation algorithm is designed based on the classifier, which can automatically divide the discrete event triggers into different event semantic sets without the need for pre-setting event types. Experimental results show that the proposed method can effectively classify event semantics, and offer a new approach for achieving high-quality aggregation of event semantics.

Key words: instance distribution constraint, automatic event semantic division, distant supervision, event semantic classifier, set generation algorithm

中图分类号: