An Emergent Approach to Text Analysis Based on a Connectionist Model and the Web

Mario Cimino,Gigliola Vaglini

doi:10.3390/a6030565

Abstract

In this paper, we present a method to provide proactive assistance in text checking, based on usage relationships between words structuralized on the Web. For a given sentence, the method builds a connectionist structure of relationships between word n-grams. Such structure is then parameterized by means of an unsupervised and language agnostic optimization process. Finally, the method provides a representation of the sentence that allows emerging the least prominent usage-based relational patterns, helping to easily find badly-written and unpopular text. The study includes the problem statement and its characterization in the literature, as well as the proposed solving approach and some experimental use.

Highlights

In this paper, we present a method to provide proactive assistance in text checking, based on usage relationships between words structuralized on the Web
In order to test the effectiveness of the system, a collection of 80 sentences have been derived from the British National Corpus (BNC) [40]
A correct result is an atypical subsequence discovered in the sentence, whereas a correct absence of result is a good sentence where no atypical subsequence has been discovered, i.e., the lowest usage category is empty.(the lowest usage category contains the zero usage value by default, and this condition from a technical standpoint means that the category contains the zero usage value only) the terms positive and negative refer to the expectation, whereas the terms true and false refer to whether that expectation corresponds to the observation

Summary

Related Work

To the best of our knowledge, no work has been done in the field of text analysis using a connectionist model and the Web. The research field of open-world approaches to text correction is characterized by a variety of specialized NLP sub-tasks. Training process leads to scalability issues when applied to complex problems or to large training sets without guidance For this reason, web-based NLP models are typically supervised models using annotated training data, or unsupervised models which rely on external resources such as taxonomies to strengthen results. In [14], the authors present a method for correcting real-world spelling errors, i.e., words that occur when a user mistakenly types a correctly spelled word when another was intended. An unsupervised statistical method for correcting preposition errors is proposed in [19]. In [28] the authors propose a way of using web counts for some tasks of lexical disambiguation, such as part-of-speech tagging, spelling correction, and word sense disambiguation. The system is not language-specific and it can be used with other languages, by adapting the phonetic codes and transformation rules

Problem Formulation

Input Sentence and Operators

Search Engines and Hit Counts

The Connectionist Structure

The Visual Output of the Network

Overall Components of the System

The Determination of the Weights

Experimental Results

Conclusions and Future Works

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Emergent Approach to Text Analysis Based on a Connectionist Model and the Web

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Journal: Algorithms	Publication Date: Sep 17, 2013
License type: CC BY 4.0

Similar Papers

Neural-network-based fuzzy logic control and decision system
C.-T Lin ... C.S.G Lee
IEEE Transactions on Computers | VOL. 40
C.-T Lin, et. al.C.-T Lin ... C.S.G Lee
01 Jan 1991
IEEE Transactions on Computers | VOL. 40

A Practical Approach to Language Complexity: A Wikipedia Case Study
Taha Yasseri ... Eduardo G Altmann
PLoS ONE | VOL. 7
Taha Yasseri, et. al.Taha Yasseri ... Eduardo G Altmann
07 Nov 2012
PLoS ONE | VOL. 7

Cloud-based Textual Analysis as a Basis for Document Classification
George Weir ... Kolade Owoeye
-
George Weir, et. al.George Weir ... Kolade Owoeye
01 Jul 2018
01 Jul 2018

An Efficient Technique to Implement Similarity Measures in Text Document Clustering using Artificial Neural Networks Algorithm
K Selvi ... R.M Suresh
Research Journal of Applied Sciences, Engineering and Technology | VOL. 8
K Selvi, et. al.K Selvi ... R.M Suresh
20 Dec 2014
Research Journal of Applied Sciences, Engineering and Technology | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Emergent Approach to Text Analysis Based on a Connectionist Model and the Web

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms