Noisy Annotations Research Articles

BackgroundGene Ontology (GO) is a community effort to represent functional features of gene products. GO annotations (GOA) provide functional associations between GO terms and gene products. Due to resources limitation, only a small portion of annotations are manually checked by curators, and the others are electronically inferred. Although quality control techniques have been applied to ensure the quality of annotations, the community consistently report that there are still considerable noisy (or incorrect) annotations. Given the wide application of annotations, however, how to identify noisy annotations is an important but yet seldom studied open problem.ResultsWe introduce a novel approach called NoGOA to predict noisy annotations. NoGOA applies sparse representation on the gene-term association matrix to reduce the impact of noisy annotations, and takes advantage of sparse representation coefficients to measure the semantic similarity between genes. Secondly, it preliminarily predicts noisy annotations of a gene based on aggregated votes from semantic neighborhood genes of that gene. Next, NoGOA estimates the ratio of noisy annotations for each evidence code based on direct annotations in GOA files archived on different periods, and then weights entries of the association matrix via estimated ratios and propagates weights to ancestors of direct annotations using GO hierarchy. Finally, it integrates evidence-weighted association matrix and aggregated votes to predict noisy annotations. Experiments on archived GOA files of six model species (H. sapiens, A. thaliana, S. cerevisiae, G. gallus, B. Taurus and M. musculus) demonstrate that NoGOA achieves significantly better results than other related methods and removing noisy annotations improves the performance of gene function prediction.ConclusionsThe comparative study justifies the effectiveness of integrating evidence codes with sparse representation for predicting noisy GO annotations. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NoGOA.

Read full abstract

Having sufficient training images with fully annotated object locations is undoubtedly critical for modern learning-based image annotation, retrieval, and object detection methods. Typically, collecting such annotations for large-scale datasets is notoriously tedious because the process involves amount of manual cropping and hand labeling operations. In this work, following the principle of games with a purpose (GWAP), we design a so-called purposive hidden-object-game (P-HOG), which imperceptibly embeds localizing objects into enjoyable playing game process and thus attracts many people to make voluntary contribution to annotating images. In particular, besides preserving the interestingness as popular HOG games, P-HOG is able to automatically generate satisfactory game images (i.e., “hide” certain items into target images) by integrating several semantic and visual processing techniques. P-HOG is also built in an effective mechanism to prevent the players from cheating. The mechanism inherits the merit of Recaptcha and identifies potential cheating behavior based on the annotation accuracy of some known items. Moreover, P-HOG will filter noisy annotations effectively based on a weighted majority method and improve the accuracy of the raw annotations from the players. Most importantly, players only play P-HOG for entertainment purpose and they are unaware of the background data collection procedure. The collected data are used towards constructing a large database, which may benefit general learning-based algorithms for multimedia tasks. To the best of our knowledge, this is the first work dedicated to such a specific and important task under the GWAP framework. We conduct a pilot study of the game prototype and the comprehensive experiments show that the P-HOG appeals to general players, and is effective for collecting massive object locations with satisfactory accuracy, which further boosts the algorithmic performances for both tag refinement and image annotation tasks.

Read full abstract

Noisy Annotations Research Articles

Related Topics

Articles published on Noisy Annotations

An Interactive Method to Improve Crowdsourced Annotations.

Regression with re-labeling for noisy data

Dominant Sets for “Constrained” Image Segmentation

Suspicious Loitering Detection from Annotated CCTV Feed Using CEP Based Approach

Weakly Supervised Salient Object Detection Using Image Labels

Identifying noisy functional annotations of proteins using sparse semantic similarity

NoGOA: predicting noisy GO annotations using evidences and sparse representation

Multi-Label Classification from Multiple Noisy Sources Using Topic Models

Weakly Supervised Semantic Segmentation Using Superpixel Pooling Network

NoisyGOA: Noisy GO annotations prediction using taxonomic and semantic similarity

Combined retrieval: A convenient and precise approach for Internet image retrieval

AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images.

A Factor Graph Approach to Automated GO Annotation.

Dexter

Joint 3-D vessel segmentation and centerline extraction using oblique Hough forests with steerable filters.

Part and Attribute Discovery from Relative Annotations

Separate or joint? Estimation of multiple labels from crowdsourced annotations

Learning by aggregating experts and filtering novices: a solution to crowdsourcing problems in bioinformatics

Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game

Scholarometer: A Social Framework for Analyzing Impact across Disciplines

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Noisy Annotations Research Articles

Related Topics

Articles published on Noisy Annotations

An Interactive Method to Improve Crowdsourced Annotations.

Regression with re-labeling for noisy data

Dominant Sets for “Constrained” Image Segmentation

Suspicious Loitering Detection from Annotated CCTV Feed Using CEP Based Approach

Weakly Supervised Salient Object Detection Using Image Labels

Identifying noisy functional annotations of proteins using sparse semantic similarity

NoGOA: predicting noisy GO annotations using evidences and sparse representation

Multi-Label Classification from Multiple Noisy Sources Using Topic Models

Weakly Supervised Semantic Segmentation Using Superpixel Pooling Network

NoisyGOA: Noisy GO annotations prediction using taxonomic and semantic similarity

Combined retrieval: A convenient and precise approach for Internet image retrieval

AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images.

A Factor Graph Approach to Automated GO Annotation.

Dexter

Joint 3-D vessel segmentation and centerline extraction using oblique Hough forests with steerable filters.

Part and Attribute Discovery from Relative Annotations

Separate or joint? Estimation of multiple labels from crowdsourced annotations

Learning by aggregating experts and filtering novices: a solution to crowdsourcing problems in bioinformatics

Purposive Hidden-Object-Game: Embedding Human Computation in Popular Game

Scholarometer: A Social Framework for Analyzing Impact across Disciplines