藏文文本摘要数据集

Xiaodong Yan

doi:10.11922/11-6035.csd.2021.0098.zh

Abstract

Automatic text summarization is a key task in natural language processing. High-quality datasets can effectively promote the research progress of summarization. Recent research is closer to generate abstractive summarizations by using the deep learning methods. However, there is a lack of high-quality and large-scale summarization datasets available to the public. Besides, it is difficult to construct this kind of dataset manually. The Tibetan text summarization task is still in its infancy due to the lack of public datasets. In order to promote the development of Tibetan informatization. we artificially constructed a small dataset of Tibetan text summarization in this paper, which is composed of 1,000 real Tibetan news articles, each with a short summary. In addition, we have also constructed more than 3,500 article keywords for each news article as a supplement to text summarization tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: China Scientific Data	Publication Date: Jun 30, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

藏文文本摘要数据集

Abstract

Talk to us

Similar Papers

More From: China Scientific Data

Lead the way for us

Similar Papers

Ti-SUM
Yan Xiaodong Yan Xiaodong
-
Yan Xiaodong Yan XiaodongYan Xiaodong Yan Xiaodong
19 Jul 2022
19 Jul 2022

Multilingual Text Summarization for German Texts Using Transformer Models
Tomas Humberto Montiel Alcantara ... David Krütli
Information | VOL. 14
Tomas Humberto Montiel Alcantara, et. al.Tomas Humberto Montiel Alcantara ... David Krütli
25 May 2023
Information | VOL. 14

Surveying the landscape of text summarization with deep learning: A comprehensive review
Guanghua Wang ... Weili Wu
Discrete Mathematics, Algorithms and Applications | VOL. 16
Guanghua Wang, et. al.Guanghua Wang ... Weili Wu
20 Dec 2023
Discrete Mathematics, Algorithms and Applications | VOL. 16

A Developed Graphical User Interface-Based on Different Generative Pre-trained Transformers Models
Ekrem Küçük ... Zeynep Küçükakçalı
ODÜ Tıp Dergisi | VOL. 11
Ekrem Küçük, et. al.Ekrem Küçük ... Zeynep Küçükakçalı
30 Apr 2024
ODÜ Tıp Dergisi | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

藏文文本摘要数据集

Abstract

Talk to us

Similar Papers

More From: China Scientific Data