Hybrid Distributed Techniques for Lower-dimensional Representation of Documents in the Scientific Literature

Bahareh Kazemi

doi:10.32920/ryerson.14657058.v1

Abstract

Surfing data mining techniques for representing data sources have specifically attracted much attention among researchers. Given the curse of dimensionality in representing text using the traditional Bag-of-words models, lower-dimensional representation of text has been an important line of research due to its impact on many prediction, and recommendation tasks. This thesis studies two main different viewpoints in text representation using content and citation information and then, different existing approaches along with their advantages, limitations and drawbacks are reviewed. A novel hybrid distributed technique for text representation is proposed where the textual content of documents is projected into a vector representation using an artificial neural network . To test the performance of the new proposed technique, the well known link-prediction problem is selected to serve as a benchmark. A comparison is performed with other common techniques by predicting the existence of citation links between tuple of papers in a large citation graph.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hybrid Distributed Techniques for Lower-dimensional Representation of Documents in the Scientific Literature

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hybrid Distributed Techniques for Lower-dimensional Representation of Documents in the Scientific Literature
Bahareh Kazemi
-
Bahareh KazemiBahareh Kazemi
23 May 2021
23 May 2021

Lower bounds for artificial neural network approximations: A proof that shallow neural networks fail to overcome the curse of dimensionality
Philipp Grohs ... Sarah Koppensteiner
Journal of Complexity | VOL. 77
Philipp Grohs, et. al.Philipp Grohs ... Sarah Koppensteiner
01 Mar 2023
Journal of Complexity | VOL. 77

Content-based Node2Vec for representation of papers in the scientific literature
B Kazemi ... A Abhari
Data & Knowledge Engineering | VOL. 127
B Kazemi, et. al.B Kazemi ... A Abhari
14 Feb 2020
Data & Knowledge Engineering | VOL. 127

Deep neural network approximations for solutions of PDEs based on Monte Carlo algorithms
Philipp Grohs ... Arnulf Jentzen
Partial Differential Equations and Applications | VOL. 3
Philipp Grohs, et. al.Philipp Grohs ... Arnulf Jentzen
08 Jun 2022
Partial Differential Equations and Applications | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid Distributed Techniques for Lower-dimensional Representation of Documents in the Scientific Literature

Abstract

Talk to us

Similar Papers