DICO: A Graph-DB Framework for Community Detection on Big Scholarly Data

Fabio Mercorio,Giancarlo Sperli,Vincenzo Moscato,Mario Mezzazanica,Antonio Picariello

doi:10.1109/tetc.2019.2952765

Abstract

The widespread use of online social networks has also involved the scientific field in which researchers interact each other by publishing or citing a given paper. The huge amount of information about scientific research documents has been described through the term <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Big Scholarly Data</i> . In this article we propose a framework, namely <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Discovery Information using COmmunity detection</i> (DICO), for identifying overlapped communities of authors from Big Scholarly Data by modeling authors’ interactions through a novel graph-based data model combining jointly document metadata with semantic information. In particular, DICO presents three distinctive characteristics: i) the coauthorship network has been built from publication records using a novel approach for estimating relationships weight between users; ii) a new community detection algorithm based on <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Node Location Analysis</i> has been developed to identify overlapped communities; iii) some built-in queries are provided to browse the generated network, though any graph-traversal query can be implemented through the Cypher query language. The experimental evaluation has been carried out to evaluate the efficacy of the proposed community detection algorithm on benchmark networks. Finally, DICO has been tested on a real-world Big Scholarly Dataset to show its usefulness working on the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">DBLP+AMiner</i> dataset, that contains 1.7M+ distinct authors, 3M+ papers, handling 25M+ citation relationships.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DICO: A Graph-DB Framework for Community Detection on Big Scholarly Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Emerging Topics in Computing

Lead the way for us

Journal: IEEE Transactions on Emerging Topics in Computing	Publication Date: Oct 1, 2021
Citations: 30

Similar Papers

Team Recognition in Big Scholarly Data: Exploring Collaboration Intensity
Shuo Yu ... Jiaofei Zhong
-
Shuo Yu, et. al.Shuo Yu ... Jiaofei Zhong
01 Nov 2017
01 Nov 2017

Cultures of the Central Highlands, New Guinea
K E Read
Southwestern Journal of Anthropology | VOL. 10
K E ReadK E Read
01 Apr 1954
Southwestern Journal of Anthropology | VOL. 10

Comparative Study to Measure the Quality of Big Scholarly Data and Its Hypothetical Mapping Towards Granular Computing
Md Manjur Ahmed ... Md Abdul Kader
Advanced Science Letters | VOL. 24
Md Manjur Ahmed, et. al.Md Manjur Ahmed ... Md Abdul Kader
01 Oct 2018
Advanced Science Letters | VOL. 24

Subspace based network community detection using sparse linear coding
Arif Mahmood ... Michael Small
-
Arif Mahmood, et. al.Arif Mahmood ... Michael Small
01 May 2016
01 May 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DICO: A Graph-DB Framework for Community Detection on Big Scholarly Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Emerging Topics in Computing