Abstract
On Wikipedia, articles about various topics can be created and edited independently in each language version. Therefore, the quality of information about the same topic depends on the language. Any interested user can improve an article and that improvement may depend on the popularity of the article. The goal of this study is to show what topics are best represented in different language versions of Wikipedia using results of quality assessment for over 39 million articles in 55 languages. In this paper, we also analyze how popular selected topics are among readers and authors in various languages. We used two approaches to assign articles to various topics. First, we selected 27 main multilingual categories and analyzed all their connections with sub-categories based on information extracted from over 10 million categories in 55 language versions. To classify the articles to one of the 27 main categories, we took into account over 400 million links from articles to over 10 million categories and over 26 million links between categories. In the second approach, we used data from DBpedia and Wikidata. We also showed how the results of the study can be used to build local and global rankings of the Wikipedia content.
Highlights
Nowadays, in order to make the right economic decisions, one needs to analyze and interpret a vast amount of information
We present the assessment of quality and popularity of Wikipedia articles in different languages related to selected topics
Our experiments showed that each language version has a specific ratio between the number of articles and the number of categories
Summary
In order to make the right economic decisions, one needs to analyze and interpret a vast amount of information. The quantity and quality of information to a large extent determine the quality of decisions in various branches of the economy. One must take care of access to proper sources of information. The quality of information determined by various characteristics is important. High-quality information is essential for effective operation and decision-making in organizations [1]. Inaccurate and incomplete information may have a negative impact on a company’s competitive edge [2]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.