论文标题

原油相关的事件提取和处理:转移学习方法

Crude Oil-related Events Extraction and Processing: A Transfer Learning Approach

论文作者

Lee, Meisin, Soon, Lay-Ki, Siew, Eu-Gene

论文摘要

通过传统监督学习范式提取事件的挑战之一是需要大量注释的数据集以实现令人满意的模型性能。在金融和经济领域的事件提取方面,它的资源较少,这更具挑战性。本文提出了一个完整的框架,用于提取和加工在CroudeNews语料库中发现的原油相关事件,通过利用转移学习的有效性来解决注释稀缺和阶级失衡问题。除事件提取外,我们还特别强调事件属性(极性,模态和强度)分类,以确定每个事件的事实确定性。我们首先通过监督学习构建基线模型,然后利用转移学习方法来提高事件提取模型的性能,尽管注释的数据数量有限和严重的类不平衡。这是通过转移学习框架中的方法来完成的,例如域自适应预训练,多任务学习和顺序转移学习。根据实验结果,与通过标准监督学习训练的基线模型相比,我们能够改善F1和MCC1得分中的所有事件提取子任务模型。从原油新闻中提取的准确和整体事件对于下游任务,例如了解事件链和学习事件事件 - 事件 - 事件关系非常有用,这些关系可用于其他下游任务,例如商品价格预测,摘要等,以支持广泛的业务决策。

One of the challenges in event extraction via traditional supervised learning paradigm is the need for a sizeable annotated dataset to achieve satisfactory model performance. It is even more challenging when it comes to event extraction in the finance and economics domain, a domain with considerably fewer resources. This paper presents a complete framework for extracting and processing crude oil-related events found in CrudeOilNews corpus, addressing the issue of annotation scarcity and class imbalance by leveraging on the effectiveness of transfer learning. Apart from event extraction, we place special emphasis on event properties (Polarity, Modality, and Intensity) classification to determine the factual certainty of each event. We build baseline models first by supervised learning and then exploit Transfer Learning methods to boost event extraction model performance despite the limited amount of annotated data and severe class imbalance. This is done via methods within the transfer learning framework such as Domain Adaptive Pre-training, Multi-task Learning and Sequential Transfer Learning. Based on experiment results, we are able to improve all event extraction sub-task models both in F1 and MCC1-score as compared to baseline models trained via the standard supervised learning. Accurate and holistic event extraction from crude oil news is very useful for downstream tasks such as understanding event chains and learning event-event relations, which can be used for other downstream tasks such as commodity price prediction, summarisation, etc. to support a wide range of business decision making.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源