Label Map Research Articles

Affine image registration is a cornerstone of medical-image analysis. While classical algorithms can achieve excellent accuracy, they solve a time-consuming optimization for every image pair. Deep-learning (DL) methods learn a function that maps an image pair to an output transform. Evaluating the function is fast, but capturing large transforms can be challenging, and networks tend to struggle if a test-image characteristic shifts from the training domain, such as the resolution. Most affine methods are agnostic to the anatomy the user wishes to align, meaning the registration will be inaccurate if algorithms consider all structures in the image. We address these shortcomings with SynthMorph, a fast, symmetric, diffeomorphic, and easy-to-use DL tool for joint affine-deformable registration of any brain image without preprocessing. First, we leverage a strategy that trains networks with widely varying images synthesized from label maps, yielding robust performance across acquisition specifics unseen at training. Second, we optimize the spatial overlap of select anatomical labels. This enables networks to distinguish anatomy of interest from irrelevant structures, removing the need for preprocessing that excludes content which would impinge on anatomy-specific registration. Third, we combine the affine model with a deformable hypernetwork that lets users choose the optimal deformation-field regularity for their specific data, at registration time, in a fraction of the time required by classical methods. This framework is applicable to learning anatomy-aware, acquisition-agnostic registration of any anatomy with any architecture, as long as label maps are available for training. We analyze how competing architectures learn affine transforms and compare state-of-the-art registration tools across an extremely diverse set of neuroimaging data, aiming to truly capture the behavior of methods in the real world. SynthMorph demonstrates high accuracy and is available at https://w3id.org/synthmorph, as a single complete end-to-end solution for registration of brain magnetic resonance imaging (MRI) data.

In a conventional speech emotion recognition (SER) task, a classifier for a given language is trained on a pre-existing dataset for that same language. However, where training data for a language do not exist, data from other languages can be used instead. We experiment with cross-lingual and multilingual SER, working with Amharic, English, German, and Urdu. For Amharic, we use our own publicly available Amharic Speech Emotion Dataset (ASED). For English, German and Urdu, we use the existing RAVDESS, EMO-DB, and URDU datasets. We followed previous research in mapping labels for all of the datasets to just two classes: positive and negative. Thus, we can compare performance on different languages directly and combine languages for training and testing. In Experiment 1, monolingual SER trials were carried out using three classifiers, AlexNet, VGGE (a proposed variant of VGG), and ResNet50. The results, averaged for the three models, were very similar for ASED and RAVDESS, suggesting that Amharic and English SER are equally difficult. Similarly, German SER is more difficult, and Urdu SER is easier. In Experiment 2, we trained on one language and tested on another, in both directions for each of the following pairs: Amharic↔German, Amharic↔English, and Amharic↔Urdu. The results with Amharic as the target suggested that using English or German as the source gives the best result. In Experiment 3, we trained on several non-Amharic languages and then tested on Amharic. The best accuracy obtained was several percentage points greater than the best accuracy in Experiment 2, suggesting that a better result can be obtained when using two or three non-Amharic languages for training than when using just one non-Amharic language. Overall, the results suggest that cross-lingual and multilingual training can be an effective strategy for training an SER classifier when resources for a language are scarce.

Label Map Research Articles

Related Topics

Articles published on Label Map

Transformer models for Land Cover Classification with Satellite Image Time Series

Exploring graph learning techniques for enhancing cross-domain applications in computer vision and natural language processing

Anatomy-aware and acquisition-agnostic joint registration with SynthMorph.

DeepWaterFraction: A globally applicable, self-training deep learning approach for percent surface water area estimation from Landsat mission imagery

Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing

Stereo sample generation‐based domain generalization network for stereo matching

MMSeaIce: a collection of techniques for improving sea ice mapping with a multi-task model

Multi3D: 3D-aware multimodal image synthesis

Multi-Label Supervised Contrastive Learning

RAPHIA: A deep learning pipeline for the registration of MRI and whole-mount histopathology images of the prostate

Raw Spectral Filter Array Imaging for Scene Recognition.

Medical Image Description Based on Multimodal Auxiliary Signals and Transformer

Nighttime Thermal Infrared Image Translation Integrating Visible Images

Controllable multi-domain semantic artwork synthesis

A feasibility study of applying generative deep learning models for map labeling

A data mining method based on label mapping for long-term and short-term browsing behaviour of network users

MAS-CL: An End-to-end Multi-atlas Supervised Contrastive Learning Framework for Brain ROI Segmentation.

Reversible data hiding in encrypted images with multi-prediction and adaptive huffman encoding

High-Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Prediction and QuadTree Decomposition

Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Label Map Research Articles

Related Topics

Articles published on Label Map

Transformer models for Land Cover Classification with Satellite Image Time Series

Exploring graph learning techniques for enhancing cross-domain applications in computer vision and natural language processing

Anatomy-aware and acquisition-agnostic joint registration with SynthMorph.

DeepWaterFraction: A globally applicable, self-training deep learning approach for percent surface water area estimation from Landsat mission imagery

Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing

Stereo sample generation‐based domain generalization network for stereo matching

MMSeaIce: a collection of techniques for improving sea ice mapping with a multi-task model

Multi3D: 3D-aware multimodal image synthesis

Multi-Label Supervised Contrastive Learning

RAPHIA: A deep learning pipeline for the registration of MRI and whole-mount histopathology images of the prostate

Raw Spectral Filter Array Imaging for Scene Recognition.

Medical Image Description Based on Multimodal Auxiliary Signals and Transformer

Nighttime Thermal Infrared Image Translation Integrating Visible Images

Controllable multi-domain semantic artwork synthesis

A feasibility study of applying generative deep learning models for map labeling

A data mining method based on label mapping for long-term and short-term browsing behaviour of network users

MAS-CL: An End-to-end Multi-atlas Supervised Contrastive Learning Framework for Brain ROI Segmentation.

Reversible data hiding in encrypted images with multi-prediction and adaptive huffman encoding

High-Capacity Reversible Data Hiding in Encrypted Images Based on Pixel Prediction and QuadTree Decomposition

Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages