Manual Markup Research Articles

Gold-standard annotated corpora have become important resources for the training and testing of natural-language-processing (NLP) systems designed to support biocuration efforts, and ontologies are increasingly used to facilitate curational consistency and semantic integration across disparate resources. Bringing together the respective power of these, the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of full-length, open-access biomedical journal articles with extensive manually created syntactic, formatting and semantic markup, was previously created and released. This initial public release has already been used in multiple projects to drive development of systems focused on a variety of biocuration, search, visualization, and semantic and syntactic NLP tasks. Building on its demonstrated utility, we have expanded the CRAFT Corpus with a large set of manually created semantic annotations relying on Uberon, an ontology representing anatomical entities and life-cycle stages of multicellular organisms across species as well as types of multicellular organisms defined in terms of life-cycle stage and sexual characteristics. This newly created set of annotations, which has been added for v2.1 of the corpus, is by far the largest publicly available collection of gold-standard anatomical markup and is the first large-scale effort at manual markup of biomedical text relying on the entirety of an anatomical terminology, as opposed to annotation with a small number of high-level anatomical categories, as performed in previous corpora. In addition to presenting and discussing this newly available resource, we apply it to provide a performance baseline for the automatic annotation of anatomical concepts in biomedical text using a prominent concept recognition system. The full corpus, released with a CC BY 3.0 license, may be downloaded from http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml. Database URL: http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml

Read full abstract

Latent fingerprint matching has played a critical role in identifying suspects and criminals. However, compared to rolled and plain fingerprint matching, latent identification accuracy is significantly lower due to complex background noise, poor ridge quality and overlapping structured noise in latent images. Accordingly, manual markup of various features (e.g., region of interest, singular points and minutiae) is typically necessary to extract reliable features from latents. To reduce this markup cost and to improve the consistency in feature markup, fully automatic and highly accurate ("lights-out" capability) latent matching algorithms are needed. In this paper, a dictionary-based approach is proposed for automatic latent segmentation and enhancement towards the goal of achieving "lights-out" latent identification systems. Given a latent fingerprint image, a total variation (TV) decomposition model with L1 fidelity regularization is used to remove piecewise-smooth background noise. The texture component image obtained from the decomposition of latent image is divided into overlapping patches. Ridge structure dictionary, which is learnt from a set of high quality ridge patches, is then used to restore ridge structure in these latent patches. The ridge quality of a patch, which is used for latent segmentation, is defined as the structural similarity between the patch and its reconstruction. Orientation and frequency fields, which are used for latent enhancement, are then extracted from the reconstructed patch. To balance robustness and accuracy, a coarse to fine strategy is proposed. Experimental results on two latent fingerprint databases (i.e., NIST SD27 and WVU DB) show that the proposed algorithm outperforms the state-of-the-art segmentation and enhancement algorithms and boosts the performance of a state-of-the-art commercial latent matcher.

Read full abstract

Manual Markup Research Articles

Articles published on Manual Markup

Methodology for extracting narratives from social media big data

Keyword Extraction from Kazakh News Dataset with BERT

Differentiation of Livestock Internal Organs Using Visible and Short-Wave Infrared Hyperspectral Imaging Sensors.

Hybrid Active Contour Mammographic Mass Segmentation and Classification

Automatic construction of the dialog tree based on unmarked text corpora in Russian

Illumination estimation challenge: The experience of the first 2 years

Automatic Pulse Classification for Artefact Removal Using SAX Strings, a CENTER-TBI Study.

Metric Classification of Traumatic Brain Injury Epileptiform Activity from Electroencephalography Data

Epileptiform Activity Detection and Classification Algorithms of Rats with Post-traumatic Epilepsy

SDHK meets NER

An approach for EEG of post traumatic sleep spindles and epilepsy seizures detection and classification in rats

Gold-standard ontology-based anatomical annotation in the CRAFT Corpus.

Bioluminescence-Based Tumor Quantification Method for Monitoring Tumor Progression and Treatment Effects in Mouse Lymphoma Models

Bioluminescence-Based Tumor Quantification Method for Monitoring Tumor Progression and Treatment Effects in Mouse Lymphoma Models

Neural network based automatic fingerprint recognition system for overlapped latent images

English

Segmentation and Enhancement of Latent Fingerprints: A Coarse to Fine Ridge Structure Dictionary.

Unsupervised Learning for Syntactic Disambiguation

Unsupervised Learning for Syntactic Disambiguation

Unsupervised Learning for Syntactic Disambiguation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Manual Markup Research Articles

Articles published on Manual Markup

Methodology for extracting narratives from social media big data

Keyword Extraction from Kazakh News Dataset with BERT

Differentiation of Livestock Internal Organs Using Visible and Short-Wave Infrared Hyperspectral Imaging Sensors.

Hybrid Active Contour Mammographic Mass Segmentation and Classification

Automatic construction of the dialog tree based on unmarked text corpora in Russian

Illumination estimation challenge: The experience of the first 2 years

Automatic Pulse Classification for Artefact Removal Using SAX Strings, a CENTER-TBI Study.

Metric Classification of Traumatic Brain Injury Epileptiform Activity from Electroencephalography Data

Epileptiform Activity Detection and Classification Algorithms of Rats with Post-traumatic Epilepsy

SDHK meets NER

An approach for EEG of post traumatic sleep spindles and epilepsy seizures detection and classification in rats

Gold-standard ontology-based anatomical annotation in the CRAFT Corpus.

Bioluminescence-Based Tumor Quantification Method for Monitoring Tumor Progression and Treatment Effects in Mouse Lymphoma Models

Bioluminescence-Based Tumor Quantification Method for Monitoring Tumor Progression and Treatment Effects in Mouse Lymphoma Models

Neural network based automatic fingerprint recognition system for overlapped latent images

English

Segmentation and Enhancement of Latent Fingerprints: A Coarse to Fine Ridge Structure Dictionary.

Unsupervised Learning for Syntactic Disambiguation

Unsupervised Learning for Syntactic Disambiguation

Unsupervised Learning for Syntactic Disambiguation