Using Combined List Hierarchy and Headings of HTML Documents for Learning Domain-Specific Ontology

Muhammad Ahsan Raza ,Binish Raza,Taiba Jabeen,Munnawar Abbas,Sehrish Raza

doi:10.14569/ijacsa.2020.0110431

Abstract

HTML pages contain unstructured and diverse information. However, these documents lack semantics and are not machine understandable. Semantic webs aim to add formal semantics to web data, whereas ontology provides formal semantics to a domain and is thus considered a foundation of semantic webs. Domain ontologies can be constructed manually, but this process is tedious and inefficient. Thus, this study presents an ontology learning (OL) model to create domain ontologies automatically from a set of HTML pages. The key insight of this research is that it combines the list structure and headings of HTML pages to recognize the ontology vocabulary. The approach also incorporates synonym relationships with ontology and allows the semantic interpretation of ontology concepts. We implement the proposed OL approach to build sports ontology from a collection of sports domain HTML documents. The new sports ontology is tested using FaCT++ reasoner; results show no inconsistency in the ontology. Furthermore, experts evaluate the successful mapping of HTML lists and headings to the ontology vocabulary. The proposed OL approach performs effectively and achieves 92.7% and 95.4% precision values for list and heading mapping, respectively.

Highlights

HTML is a markup language that is used to write web pages over the World Wide Web [1]
We evaluated our approach by using the sports domain dataset, which consists of 105 HTML documents collected from https://www.sports.ru website1
We initially evaluated the new ontology learned by our ontology learning (OL) model by using a semantic reasoner

Summary

INTRODUCTION

HTML is a markup language that is used to write web pages over the World Wide Web [1] It consists of elements called tags, which have a fixed definition. Web browsers are tools that interpret these tags and display the web pages Many web applications, such as data mining, machine learning, artificial intelligence, and natural language processing, facilitate the retrieval of information from web pages to fulfill user information requirements [2,3,4]. The vision of semantic webs is to achieve HTML documents that are understandable by machines To achieve this vision, a formal manner of representing semantics is required. Ontology has emerged as an approach that represents the machine-understandable semantics of a domain and is currently considered the heart of semantic web technologies [5].

Textual-based OL Techniques

Knowledge-based OL Techniques

Semistructured-based Techniques

List Extractor

Heading Extractor

Hierarchy Identification

List and Heading Merger

Add Synonyms

IMPLEMENTATION AND RESULTS

Evaluation Measures

Result Analysis

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Combined List Hierarchy and Headings of HTML Documents for Learning Domain-Specific Ontology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2020
License type: cc-by

Similar Papers

Exploiting affinity propagation for automatic acquisition of domain concept in ontology learning
Iqbal Qasim ... Dong-Ho Lee
-
Iqbal Qasim, et. al.Iqbal Qasim ... Dong-Ho Lee
01 Sep 2011
01 Sep 2011

Preface for LLMs4OL 2024: The 1st Large Language Models for Ontology Learning Challenge at the 23rd ISWC
Hamed Babaei Giglou ... Sören Auer
Open Conference Proceedings | VOL. 4
Hamed Babaei Giglou, et. al.Hamed Babaei Giglou ... Sören Auer
02 Oct 2024
Preface for LLMs4OL 2024: The 1st Large Language Models for Ontology Learning Challenge at the 23rd ISWC
Hamed Babaei Giglou ... Sören Auer

Ontology extension based on axiomatic cognitive model for Ontology learning
Dehai Zhang ... Naiyao Wang
-
Dehai Zhang, et. al. Dehai Zhang ... Naiyao Wang
01 Oct 2016
01 Oct 2016

Semantic Web Mining: Using Ontology Learning and Grammatical Rule Inference Technique
C S Bhatia ... Suresh Jain
-
C S Bhatia, et. al.C S Bhatia ... Suresh Jain
01 Jul 2011
01 Jul 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Combined List Hierarchy and Headings of HTML Documents for Learning Domain-Specific Ontology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications