Augmenting Data Systems with Prediction based Embeddings

Rahul Ramachandran,Derek Koehl,M Ramasubramanian,Tsengdar Lee,Iksha Gurung,Carson Davis,Manil Maskey

doi:10.1109/igarss47720.2021.9555031

Abstract

One of the challenges of improving the search and use of complex Earth science data is designing and incorporating semantic components in existing Earth science data systems. Many projects have addressed this by using a knowledge engineering approach. However, using ontologies has inherent limitations as a practical and scalable approach. Data-driven strategies based on natural language processing, coupled with Machine Learning, provide an alternative approach. Data-driven approaches utilize existing corpus available as unstructured text. This paper describes a hybrid strategy that uses a data-driven approach to build an embedding from a large corpus of Earth science journal publications while leveraging existing ontologies to develop validation tests to evaluate the embedding's robustness and correctness. The paper also describes the use of this embedding in two different applications. The first application provides a semantic mapping service to bridge the gap between a science application need and the appropriate instruments or datasets required to address that need. The second application is keyword recommender to make the data set tagging process efficient for the data operators and ensure keyword consistency within a data catalog.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Augmenting Data Systems with Prediction based Embeddings

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evolution of Information Management at the GSFC Earth Sciences (GES) Data and Information Services Center (DISC): 2006–2007
S Kempler ... B Vollmer
IEEE Transactions on Geoscience and Remote Sensing | VOL. 47
S Kempler, et. al.S Kempler ... B Vollmer
01 Jan 2009
Evolution of Information Management at the GSFC Earth Sciences (GES) Data and Information Services Center (DISC): 2006–2007
S Kempler ... B Vollmer

Earth observing system (EOS) data and information system (EOSDIS) — evolution update and future
M Esfandiari ... H Ramapriyan
-
M Esfandiari, et. al.M Esfandiari ... H Ramapriyan
01 Jan 2007
01 Jan 2007

Developing Metrics for NASA Earth Science Interdisciplinary Data Products and Services
Zhong Liu ... James Acker
Data Science Journal | VOL. 21
Zhong Liu, et. al.Zhong Liu ... James Acker
11 Feb 2022
Data Science Journal | VOL. 21

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports
David M Greer ... Rebecca Zhang
-
David M Greer, et. al.David M Greer ... Rebecca Zhang
19 Jun 2020
19 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Augmenting Data Systems with Prediction based Embeddings

Abstract

Talk to us

Similar Papers