Late Fusion Techniques Research Articles

This work is motivated by the tremendous achievement of deep learning models for computer vision tasks, particularly for human activity recognition. It is gaining more attention due to the numerous applications in real life, for example smart surveillance system, human–computer interaction, sports action analysis, elderly healthcare, etc. Recent days, the acquisition and interface of multimodal data are straightforward due to the invention of low-cost depth devices. Several approaches have been developed based on RGB-D (depth) evidence at the cost of additional equipment’s setup and high complexity. Contrarily, the methods that utilize RGB frames provide inferior performance due to the absence of depth evidence and these approaches require to less hardware, simple and easy to generalize using only color cameras. In this work, a deeply coupled ConvNet for human activity recognition proposed that utilizes the RGB frames at the top layer with bi-directional long short-term memory (Bi-LSTM). At the bottom layer, the CNN model is trained with a single dynamic motion image. For the RGB frames, the CNN-Bi-LSTM model is trained end-to-end learning to refine the feature of the pre-trained CNN, while dynamic images stream is fine-tuned with the top layers of the pre-trained model to extract temporal information in videos. The features obtained from both the data streams are fused at decision level after the softmax layer with different late fusion techniques and achieved high accuracy with max fusion. The performance accuracy of the model is assessed using four standard single as well as multiple person activities RGB-D (depth) datasets. The highest classification accuracies achieved on human action datasets are compared with similar state of the art and found significantly higher margin such as 2% on SBU Interaction, 4% on MIVIA Action, 1% on MSR Action Pair, and 4% on MSR Daily Activity.

Background and Objective:The COVID-19 can cause severe pneumonia and is estimated to have a high impact on the healthcare system. Early diagnosis is crucial for correct treatment in order to possibly reduce the stress in the healthcare system. The standard image diagnosis tests for pneumonia are chest X-ray (CXR) and computed tomography (CT) scan. Although CT scan is the gold standard, CXR are still useful because it is cheaper, faster and more widespread. This study aims to identify pneumonia caused by COVID-19 from other types and also healthy lungs using only CXR images.Methods:In order to achieve the objectives, we have proposed a classification schema considering the following perspectives: i) a multi-class classification; ii) hierarchical classification, since pneumonia can be structured as a hierarchy. Given the natural data imbalance in this domain, we also proposed the use of resampling algorithms in the schema in order to re-balance the classes distribution. We observed that, texture is one of the main visual attributes of CXR images, our classification schema extract features using some well-known texture descriptors and also using a pre-trained CNN model. We also explored early and late fusion techniques in the schema in order to leverage the strength of multiple texture descriptors and base classifiers at once.To evaluate the approach, we composed a database, named RYDLS-20, containing CXR images of pneumonia caused by different pathogens as well as CXR images of healthy lungs. The classes distribution follows a real-world scenario in which some pathogens are more common than others.Results:The proposed approach tested in RYDLS-20 achieved a macro-avg F1-Score of 0.65 using a multi-class approach and a F1-Score of 0.89 for the COVID-19 identification in the hierarchical classification scenario.Conclusions:As far as we know, the top identification rate obtained in this paper is the best nominal rate obtained for COVID-19 identification in an unbalanced environment with more than three classes. We must also highlight the novel proposed hierarchical classification approach for this task, which considers the types of pneumonia caused by the different pathogens and lead us to the best COVID-19 recognition rate obtained here.

Late Fusion Techniques Research Articles

Related Topics

Articles published on Late Fusion Techniques

Classifying informative and non-informative tweets from the twitter by adapting image features during disaster

Automatic food recognition system for middle‐eastern cuisines

A deeply coupled ConvNet for human activity recognition using dynamic and RGB images

COVID-19 identification in chest X-ray images on flat and hierarchical classification scenarios

View-invariant Deep Architecture for Human Action Recognition using Two-stream Motion and Shape Temporal Dynamics.

Online Bangla handwritten word recognition using HMM and language model

Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition.

Assessing visual attributes of handwriting for prediction of neurological disorders—A case study on Parkinson’s disease

Data-Fusion Techniques for Open-Set Recognition Problems

Indian classical dance action identification using adaptive graph matching from unconstrained videos

Evaluation of a speaker identification system with and without fusion using three databases in the presence of noise and handset effects

Improving the accuracy of existing camera based fall detection algorithms through late fusion.

Uni- and multimodal methods for single- and multi-label recognition

Dual-layer kernel extreme learning machine for action recognition

Human action recognition using fusion of features for unconstrained video sequences

Multimodal features fusion for gait, gender and shoes recognition

Applying latent semantic analysis to large-scale medical image databases

Improving image retrieval by using spatial relations

Feature Analysis and Evaluation for Automatic Emotion Identification in Speech

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Late Fusion Techniques Research Articles

Related Topics

Articles published on Late Fusion Techniques

Classifying informative and non-informative tweets from the twitter by adapting image features during disaster

Automatic food recognition system for middle‐eastern cuisines

A deeply coupled ConvNet for human activity recognition using dynamic and RGB images

COVID-19 identification in chest X-ray images on flat and hierarchical classification scenarios

View-invariant Deep Architecture for Human Action Recognition using Two-stream Motion and Shape Temporal Dynamics.

Online Bangla handwritten word recognition using HMM and language model

Deep Learning of Fuzzy Weighted Multi-Resolution Depth Motion Maps with Spatial Feature Fusion for Action Recognition.

Assessing visual attributes of handwriting for prediction of neurological disorders—A case study on Parkinson’s disease

Data-Fusion Techniques for Open-Set Recognition Problems

Indian classical dance action identification using adaptive graph matching from unconstrained videos

Evaluation of a speaker identification system with and without fusion using three databases in the presence of noise and handset effects

Improving the accuracy of existing camera based fall detection algorithms through late fusion.

Uni- and multimodal methods for single- and multi-label recognition

Dual-layer kernel extreme learning machine for action recognition

Human action recognition using fusion of features for unconstrained video sequences

Multimodal features fusion for gait, gender and shoes recognition

Applying latent semantic analysis to large-scale medical image databases

Improving image retrieval by using spatial relations

Feature Analysis and Evaluation for Automatic Emotion Identification in Speech

A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams