Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold

Ajit Kumar,Sanjeev Maskara,I-Jen Chiang

doi:10.4236/jdaip.2015.34014

Ajit Kumar, Sanjeev Maskara + Show 1 more

Open Access

https://doi.org/10.4236/jdaip.2015.34014

Copy DOI

Abstract

Latent Semantic Analysis involves natural language processing techniques for analyzing relationships between a set of documents and the terms they contain, by producing a set of concepts (related to the documents and terms) called semantic topics. These semantic topics assist search engine users by providing leads to the more relevant document. We develope a novel algorithm called Latent Semantic Manifold (LSM) that can identify the semantic topics in the high-dimensional web data. The LSM algorithm is established upon the concepts of topology and probability. Asearch tool is also developed using the LSM algorithm. This search tool is deployed for two years at two sites in Taiwan: 1) Taipei Medical University Library, Taipei, and 2) Biomedical Engineering Laboratory, Institute of Biomedical Engineering, National Taiwan University, Taipei. We evaluate the effectiveness and efficiency of the LSM algorithm by comparing with other contemporary algorithms. The results show that the LSM algorithm outperforms compared with others. This algorithm can be used to enhance the functionality of currently available search engines.

Highlights

In the traditional approach to data gathering, we collect data on a few well-chosen variables, and manually perform various tasks, such as finding relevant information, analyzing them, making decisions, and so on [1].How to cite this paper: Kumar, A., Maskara, S. and Chiang, I.-J. (2015) Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold
This paper aims to explain the Latent Semantic Manifold algorithm, its deployment, and performance evaluation
The proposed Latent Semantic Manifold (LSM) algorithm is based upon the concepts of probability and topology, which identifies the latentsemantic in data

Summary

Introduction

In the traditional approach to data gathering, we collect data on a few well-chosen variables, and manually perform various tasks, such as finding relevant information, analyzing them, making decisions, and so on [1].How to cite this paper: Kumar, A., Maskara, S. and Chiang, I.-J. (2015) Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold. In the traditional approach to data gathering, we collect data on a few well-chosen variables, and manually perform various tasks, such as finding relevant information, analyzing them, making decisions, and so on [1]. (2015) Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold. In this high-tech era, the high volumes of data are generated with high velocity from a variety of resources ( known as 3 V—Volume, Velocity, and Variety) [2] [3]. Gigantic repositories that include data, texts, and media have rapidly grown during recent years [5]-[9]. Several huge repositories are freely available for the public use on the World Wide Web causing another problem—the relevant information is buried in the irrelevant ones

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Data Analysis and Information Processing	Publication Date: Jan 1, 2015
Citations: 47	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Data Analysis and Information Processing

Lead the way for us

Similar Papers

Clinical Application Driven Physiology in Biomedical Engineering Laboratory Course Education
R.N Schmidt
-
R.N SchmidtR.N Schmidt
01 Jan 2004
01 Jan 2004

Bridging biomedical basics with practical applications in BME laboratory education
J.P Giuffrida
-
J.P GiuffridaJ.P Giuffrida
01 Jan 2004
01 Jan 2004

Millennial Students’ Online Search Strategies are Associated With Their Mental Models of Search
Leslie Bussert
Evidence Based Library and Information Practice | VOL. 6
Leslie BussertLeslie Bussert
14 Sep 2011
Evidence Based Library and Information Practice | VOL. 6

Exploration on Construction of Biomedical Engineering Laboratory
...
-
, et. al. ...
30 Jun 2017
30 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying Semantic in High-Dimensional Web Data Using Latent Semantic Manifold

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Data Analysis and Information Processing