Unseen Words Research Articles

Entity type recognition is used as a pre-processing step in common applications like summarization of text, classifying documents or automatic answering of questions posed in natural language. Here, ‘entity’ refers to concrete and abstract objects identified by proper and common nouns. Entity recognition focuses on detecting instances of types like person, location, organization, and so on. For example, an entity recognizer would take as input: George Washington was the first President of the United States of America. and output: <noun.person> George Washington </noun.person> was the first <noun.person> President </noun.person> of the <noun.location> United States of America </noun.Location>. The task can be performed using machine learning techniques to train a system that recognizes entities with performance comparable to a human annotator. Challenges like the lack of a large annotated training data corpus, impossible nature of listing all entity types, and ambiguity in language make this problem hard. There are existing entity recognizers which perform this task but with fair performance. One of the ways adopted to improve the performance of an existing entity recognizer is feature engineering. We initially find out which of the existing features, used in the recognizer, affect the performance most strongly. We accomplish this by adding and removing one or more features at a time from the feature list. We then use the training data to train a model and test to find out which set of features are important. The evaluation metric involves finding the precision, recall and f-score (which is the harmonic mean of precision and recall). As a next step, we add new features like word clusters and bigram word features to find out any improvements. Word clusters help when the training data does not have some words, but words belonging to the same cluster are present in the training data. This helps tagging unseen words in the test set. We also experiment with varying the size of the training data to find out how it affects the performance. Additionally, we look into Wikipedia as a source of additional features for the training data. Wikipedia has an elaborate internal link structure that can provide vital information about the category of a word. This category can be linked to a broader-sensed entity type.

Read full abstract

Recent multivariate analyses of fMRI activation have shown that discriminative classifiers such as Support Vector Machines (SVM) are capable of decoding fMRI-sensed neural states associated with the visual presentation of categories of various objects. However, the lack of a generative model of neural activity limits the generality of these discriminative classifiers for understanding the underlying neural representation. In this study, we propose a generative classifier that models the hidden factors that underpin the neural representation of objects, using a multivariate multiple linear regression model. The results indicate that object features derived from an independent behavioral feature norming study can explain a significant portion of the systematic variance in the neural activity observed in an object-contemplation task. Furthermore, the resulting regression model is useful for classifying a previously unseen neural activation vector, indicating that the distributed pattern of neural activities encodes sufficient signal to discriminate differences among stimuli. More importantly, there appears to be a double dissociation between the two classifier approaches and within- versus between-participants generalization. Whereas an SVM-based discriminative classifier achieves the best classification accuracy in within-participants analysis, the generative classifier outperforms an SVM-based model which does not utilize such intermediate representations in between-participants analysis. This pattern of results suggests the SVM-based classifier may be picking up some idiosyncratic patterns that do not generalize well across participants and that good generalization across participants may require broad, large-scale patterns that are used in our set of intermediate semantic features. Finally, this intermediate representation allows us to extrapolate the model of the neural activity to previously unseen words, which cannot be done with a discriminative classifier.

Read full abstract

Unseen Words Research Articles

Related Topics

Articles published on Unseen Words

Rich entity recognition in English text

Quantitative modeling of the neural representation of objects: How semantic feature norms can account for fMRI activation

Speech Recognition With Flat Direct Models

Importance of High-Order N-Gram Models in Morph-Based Speech Recognition

Automated recognition of brain region mentions in neuroscience literature.

Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval

An introduction to voice search

An analysis on document length retrieval trends in language modeling smoothing

Morph-based speech recognition and modeling of out-of-vocabulary words across languages

Timing of the brain events underlying access to consciousness during the attentional blink

A direct intracranial record of emotions evoked by subliminal words

Stochastic Korean Word-Spacing with Smoothing Using Korean Spelling Checker

Rare Events and Closed Domains: Two Delicate Concepts in Speech Synthesis

Cerebral mechanisms of word masking and unconscious repetition priming.

Unsupervised Learning of Word Segmentation Rules with Genetic Algorithms and Inductive Logic Programming

Similarity-Based Models of Word Cooccurrence Probabilities

Source models for natural language text

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Unseen Words Research Articles

Related Topics

Articles published on Unseen Words

Rich entity recognition in English text

Quantitative modeling of the neural representation of objects: How semantic feature norms can account for fMRI activation

Speech Recognition With Flat Direct Models

Importance of High-Order N-Gram Models in Morph-Based Speech Recognition

Automated recognition of brain region mentions in neuroscience literature.

Speech retrieval from unsegmented finnish audio using statistical morpheme-like units for segmentation, recognition, and retrieval

An introduction to voice search

An analysis on document length retrieval trends in language modeling smoothing

Morph-based speech recognition and modeling of out-of-vocabulary words across languages

Timing of the brain events underlying access to consciousness during the attentional blink

A direct intracranial record of emotions evoked by subliminal words

Stochastic Korean Word-Spacing with Smoothing Using Korean Spelling Checker

Rare Events and Closed Domains: Two Delicate Concepts in Speech Synthesis

Cerebral mechanisms of word masking and unconscious repetition priming.

Unsupervised Learning of Word Segmentation Rules with Genetic Algorithms and Inductive Logic Programming

Similarity-Based Models of Word Cooccurrence Probabilities

Source models for natural language text