Simple and Powerful Data Analytics with Deep Natural Language Understanding

Nicholas Cassimatis

doi:10.13163/scior.preprints.5

Abstract

There exists a vast amount of data freely available that contains insights that can transform and accelerate progress in science, technology, and business. The great majority of these insights go undiscovered because it is so difficult and time-consuming to analyze data. A key source of this difficulty is that questions and analyses that take human scientists only a few seconds to formulate in human language can take days using existing computer interfaces and data analysis software. For example, consider the task of determining which are the five molecules that most highly correlate with the expression of genes regulating neurotransmitters involved in creating or erasing longterm memories. This is a task that now can take a highly trained researcher days to accomplish. This is true even though all the data required is publicly available and the computational power to analyze it is very inexpensive. Such data analyses are so expensive because of the computer interfaces people must use to perform them. Today, one cannot simply ask a computer in ordinary human language “Which are the five molecules that most highly correlate with the expression of genes regulating neurotransmitters involved in creating or erasing long-term memories” and get the answer. Instead, because of the limitations of computer natural language processing abilities, people must use cumbersome software and often write specially tailored programs to perform such analyses. If, however, these limitations of natural language processing were overcome, scientists would become dramatically more productive. They would be able to generate and explore hypotheses that in the past were too time-consuming. They could ask and answer, literally, orders of magnitudes more questions than they could in the past. A key challenge to elevating the natural language processing abilities of computers to the level where they will be useful in data analysis is the gulf between the nature of human and computer languages. Human language is generally ambiguous, incomplete, and often non-literal while computer languages are specifically and purposely designed to be unambiguous, complete, and literal. Although simple queries such as restaurants near boston can be understood by search engines and products such as Siri, many of the most important use cases require computers to understand much more complex utterances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Simple and Powerful Data Analytics with Deep Natural Language Understanding

Abstract

Talk to us

Similar Papers

More From: Proceedings of Science Open Reviewed

Lead the way for us

Similar Papers

Role of NLP in Indian Regional Languages
...
IBMRD s Journal of Management & Research | VOL. 3
, et. al. ...
01 Sep 2014
IBMRD s Journal of Management & Research | VOL. 3

Developments in The Field of Natural Language Processing

International Journal of Advanced Research in Computer Science | VOL. 8

30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Impact of Deep Learning on Natural Language Processing
Arun Kumar Singh ... Pushpa Choudhary
-
Arun Kumar Singh, et. al.Arun Kumar Singh ... Pushpa Choudhary
08 May 2024
08 May 2024

Analysis on connection and translation of natural language to traditional computer language
Hanxiang Liu
Applied and Computational Engineering | VOL. 19
Hanxiang LiuHanxiang Liu
23 Oct 2023
Applied and Computational Engineering | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Simple and Powerful Data Analytics with Deep Natural Language Understanding

Abstract

Talk to us

Similar Papers

More From: Proceedings of Science Open Reviewed