Abstract

Many industries face an increasing need for smart systems that support the processing and generation of digital content. This is both due to an ever increasing amount of incoming content that needs to be processed faster and more efficiently, but also due to an ever increasing pressure of publishing new content in cycles that are getting shorter and shorter. In a research and technology transfer project we develop a platform that provides content curation services that can be integrated into Content Management Systems, among others. In the project we develop curation services, which comprise semantic text and document analytics processes as well as knowledge technologies that can be applied to document collections. The key objective is to support digital curators in their daily work, i.e., to (semi-)automate processes that the human experts are normally required to carry out intellectually and, typically, without tool support. The goal is to enable knowledge workers to become more efficient and more effective as well as to produce high-quality content. In this article we focus on the current state of development with regard to semantic storytelling in our four use cases.

Highlights

  • Digital content and online media have reached an unprecedented level of relevance and importance, especially with regard to commercial and political and societal aspects

  • Given that the same number of documents was clustered into topics using six different models and that the length of the summary for each topic was fixed at a maximum of 200 words, we discovered that the bag of words approach yields lengthier summaries than tf/idf

  • We developed curation technologies that can be applied in the sector-specific use cases of companies active in different sectors and content curation use cases

Read more

Summary

Introduction

Digital content and online media have reached an unprecedented level of relevance and importance, especially with regard to commercial and political and societal aspects. A multitude of examples exist in multiple sectors and branches of media (television, radio, blogs, journalism etc.) All these different professional environments can benefit immensely from semantic technologies that support knowledge workers, who typically work under high time pressure, in their activities: finding relevant information, highlighting important concepts, sorting incoming documents, translating articles in foreign languages, suggesting interesting topics etc. We call these different semantic services, that can be applied flexibly in different professional environments that all have to do with the processing, analysis, translation, evaluation, contextualisation, verification, synthesis and production of digital information, Curation Technologies. For each use case we present a prototype application, all of which are currently in experimental use in these companies

Curation Technologies
Named Entity Recognition and Named Entity Linking
Geographical Localisation Module and Map Visualisations
Temporal Expression Analysis and Timelining
Text Classification and Document Clustering
Coreference Resolution
Monolingual and Cross-Lingual Event Detection
Single and Multi-document Summarisation
User Interaction in the Curation Technologies Prototypes
Semantic Storytelling
Sector
Related Work
Findings
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.