Information Extraction from Social Media: A Hands-On Tutorial on Tasks, Data, and Open Source Tools

Shubhanshu Mishra,Rezvaneh Rezapour,Jana Diesner

doi:10.1007/978-3-030-99739-7_74

Abstract

AbstractInformation extraction (IE) is a common sub-area of natural language processing that focuses on identifying structured data from unstructured data. The community of Information Retrieval (IR) relies on accurate and high-performance IE to be able to retrieve high quality results from massive datasets. One example of IE is to identify named entities in a text, e.g., “Barack Obama served as the president of the USA”. Here, Barack Obama and USA are named entities of types of PERSON and LOCATION, respectively. Another example is to identify sentiment expressed in a text, e.g., “This movie was awesome”. Here, the sentiment expressed is positive. Finally, identifying various linguistic aspects of a text, e.g., part of speech tags, noun phrases, dependency parses, etc., which can serve as features for additional IE tasks. This tutorial introduces participants to a) the usage of Python based, open-source tools that support IE from social media data (mainly Twitter), and b) best practices for ensuring the reproducibility of research. Participants will learn and practice various semantic and syntactic IE techniques that are commonly used for analyzing tweets. Additionally, participants will be familiarized with the landscape of publicly available tweet data, and methods for collecting and preparing them for analysis. Finally, participants will be trained to use a suite of open source tools (SAIL for active learning, TwitterNER for named entity recognition3, and SocialMediaIE for multi task learning), which utilize advanced machine learning techniques (e.g., deep learning, active learning with human-in-the-loop, multi-lingual, and multi-task learning) to perform IE on their own or existing datasets. Participants will also learn how social context can be integrated in Information Extraction systems to make them better. The tools introduced in the tutorial will focus on the three main stages of IE, namely, collection of data (including annotation), data processing and analytics, and visualization of the extracted information. More details can be found at: https://socialmediaie.github.io/tutorials/.KeywordsInformation extractionMulti-task learningNatural language processingSocial media dataTwitterMachine learning bias

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Information Extraction from Social Media: A Hands-On Tutorial on Tasks, Data, and Open Source Tools

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Information extraction from digital social trace data with applications to social media and scholarly communication data
Shubhanshu Mishra
ACM SIGIR Forum | VOL. 54
Shubhanshu MishraShubhanshu Mishra
01 Jun 2020
ACM SIGIR Forum | VOL. 54

FVI-BD: Multiple File Extraction using Fusion Vector Investigation (FVI) in Big Data Hadoop Environment
V Vadivu ... N Kavitha
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11
V Vadivu, et. al.V Vadivu ... N Kavitha
13 Jul 2023
International Journal on Recent and Innovation Trends in Computing and Communication | VOL. 11

Arabic Part Of Speech (POS) Tagging Analysis using Bee Colony Optimization (BCO) Algorithm on Quran Corpus
Arief Fatchul Huda ... Dian Rachmat Gumelar
-
Arief Fatchul Huda, et. al.Arief Fatchul Huda ... Dian Rachmat Gumelar
19 Aug 2021
19 Aug 2021

Improving Code-mixed POS Tagging Using Code-mixed Embeddings
S Nagesh Bhattu ... D V L N Somayajulu
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
S Nagesh Bhattu, et. al.S Nagesh Bhattu ... D V L N Somayajulu
29 Mar 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Information Extraction from Social Media: A Hands-On Tutorial on Tasks, Data, and Open Source Tools

Abstract

Talk to us

Similar Papers