1 Citation 650 Views 85 Downloads
The text dataset of the military field is the basis for event extraction in military field, and high-quality data set can effectively promote the study of event extraction in this field,However, the event extraction data set commonly used in the real world (such as ACE2005, etc.) is oriented to the general field, and the text corpus resources on military events are scarce. Therefore, we collect a large amount of military news content from public military news websites; On the basis of text content analysis, we firstly establish an event model of military news that includes event types, entity types and entity relationship types. Secondly, the text data is manually labeled according to the event model, which is iteratively verified and corrected simultaneously. Finally, a dataset of 13,000 high-quality military news events with a full variety of labels was obtained. We make this military news event dataset publicly available in this paper.
650 views reported since publication in 2022.