Visual Concepts Research Articles

Decoding semantic concepts for imagination and perception tasks (SCIP) is important for rehabilitation medicine as well as cognitive neuroscience. Electroencephalogram (EEG) is commonly used in the relevant fields, because it is a low-cost noninvasive technique with high temporal resolution. However, as EEG signals contain a high noise level resulting in a low signal-to-noise ratio, it makes decoding EEG-based semantic concepts for imagination and perception tasks (SCIP-EEG) challenging. Currently, neural network algorithms such as CNN, RNN, and LSTM have almost reached their limits in EEG signal decoding due to their own short-comings. The emergence of transformer methods has improved the classification performance of neural networks for EEG signals. However, the transformer model has a large parameter set and high complexity, which is not conducive to the application of BCI. EEG signals have high spatial correlation. The relationship between signals from different electrodes is more complex. Capsule neural networks can effectively model the spatial relationship between electrodes through vector representation and a dynamic routing mechanism. Therefore, it achieves more accurate feature extraction and classification. This paper proposes a spatio-temporal capsule network with a self-correlation routing mechaninsm for the classification of semantic conceptual EEG signals. By improving the feature extraction and routing mechanism, the model is able to more effectively capture the highly variable spatio-temporal features from EEG signals and establish connections between capsules, thereby enhancing classification accuracy and model efficiency. The performance of the proposed model was validated using the publicly accessible semantic concept dataset for imagined and perceived tasks from Bath University. Our model achieved average accuracies of 94.9%, 93.3%, and 78.4% in the three sensory modalities (pictorial, orthographic, and audio), respectively. The overall average accuracy across the three sensory modalities is 88.9%. Compared to existing advanced algorithms, the proposed model achieved state-of-the-art performance, significantly improving classification accuracy. Additionally, the proposed model is more stable and efficient, making it a better decoding solution for SCIP-EEG decoding.

The increasing availability of biomedical data creates valuable resources for developing new deep learning algorithms to support experts, especially in domains where collecting large volumes of annotated data is not trivial. Biomedical data include several modalities containing complementary information, such as medical images and reports: images are often large and encode low-level information, while reports include a summarized high-level description of the findings identified within data and often only concerning a small part of the image. However, only a few methods allow to effectively link the visual content of images with the textual content of reports, preventing medical specialists from properly benefitting from the recent opportunities offered by deep learning models. This paper introduces a multimodal architecture creating a robust biomedical data representation encoding fine-grained text representations within image embeddings. The architecture aims to tackle data scarcity (combining supervised and self-supervised learning) and to create multimodal biomedical ontologies. The architecture is trained on over 6,000 colon whole slide Images (WSI), paired with the corresponding report, collected from two digital pathology workflows. The evaluation of the multimodal architecture involves three tasks: WSI classification (on data from pathology workflow and from public repositories), multimodal data retrieval, and linking between textual and visual concepts. Noticeably, the latter two tasks are available by architectural design without further training, showing that the multimodal architecture that can be adopted as a backbone to solve peculiar tasks. The multimodal data representation outperforms the unimodal one on the classification of colon WSIs and allows to halve the data needed to reach accurate performance, reducing the computational power required and thus the carbon footprint. The combination of images and reports exploiting self-supervised algorithms allows to mine databases without needing new annotations provided by experts, extracting new information. In particular, the multimodal visual ontology, linking semantic concepts to images, may pave the way to advancements in medicine and biomedical analysis domains, not limited to histopathology.

Visual Concepts Research Articles

Related Topics

Articles published on Visual Concepts

An inherently interpretable deep learning model for local explanations using visual concepts.

Focus Group Interview: University-Industrial Collaboration

Design for shared driverless vehicles of the future

Ferrimagnet-Based Neuromorphic Device Mimicking the Ventral Visual Pathway for High-Accuracy Target Recognition.

Estimation of the Weight and Volume of Lime (Citrus aurantifolia (Christm.) Swingle) Fruit Using Computer Vision Based on Traditional Machine Learning and Deep Learning

Multiexposed Image-Fusion Strategy Using Mutual Image Translation Learning with Multiscale Surround Switching Maps

Advanced Personal Budget Analytics: Combining Optical Character Recognition and Natural Language Processing for Automated Budget Categorization and Insight Extraction

Lifelong Generative Adversarial Autoencoder.

KRITIK TERHADAP PENGAJARAN VISUALIASI DAN AFIRMASI TEOLOGI KEMAKMURAN BERDASARKAN EKSPOSISI KEJADIAN 15:5-6

The Semiotics of Place in Saudi Women’s Poetry: Ashjan Hindi as a Model

Multimodality-guided Visual-Caption Semantic Enhancement

Related work analysis for determination of fatigue state based on eye movements monitoring

A Spatio-Temporal Capsule Neural Network with Self-Correlation Routing for EEG Decoding of Semantic Concepts of Imagination and Perception Tasks.

FICE: Text-conditioned fashion-image editing with guided GAN inversion

Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning

Connecting plant science education in undergraduate life science courses to plant awareness disparity, Vision and Change, and sustainability careers

Pemanfaatan Infografis Bagi Pengelola Wisata Alam Pulau Semut

Perancangan Desain Karakter pada Buku Interaktif Pengenalan Literasi Keuangan untuk Anak Usia Dini

Understanding 3D seismic data visualization with C++, OpenGL and GLSL

Multimodal semantic enhanced representation network for micro-video event detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Visual Concepts Research Articles

Related Topics

Articles published on Visual Concepts

An inherently interpretable deep learning model for local explanations using visual concepts.

Focus Group Interview: University-Industrial Collaboration

Design for shared driverless vehicles of the future

Ferrimagnet-Based Neuromorphic Device Mimicking the Ventral Visual Pathway for High-Accuracy Target Recognition.

Estimation of the Weight and Volume of Lime (Citrus aurantifolia (Christm.) Swingle) Fruit Using Computer Vision Based on Traditional Machine Learning and Deep Learning

Multiexposed Image-Fusion Strategy Using Mutual Image Translation Learning with Multiscale Surround Switching Maps

Advanced Personal Budget Analytics: Combining Optical Character Recognition and Natural Language Processing for Automated Budget Categorization and Insight Extraction

Lifelong Generative Adversarial Autoencoder.

KRITIK TERHADAP PENGAJARAN VISUALIASI DAN AFIRMASI TEOLOGI KEMAKMURAN BERDASARKAN EKSPOSISI KEJADIAN 15:5-6

The Semiotics of Place in Saudi Women’s Poetry: Ashjan Hindi as a Model

Multimodality-guided Visual-Caption Semantic Enhancement

Related work analysis for determination of fatigue state based on eye movements monitoring

A Spatio-Temporal Capsule Neural Network with Self-Correlation Routing for EEG Decoding of Semantic Concepts of Imagination and Perception Tasks.

FICE: Text-conditioned fashion-image editing with guided GAN inversion

Multimodal representations of biomedical knowledge from limited training whole slide images and reports using deep learning

Connecting plant science education in undergraduate life science courses to plant awareness disparity, Vision and Change, and sustainability careers

Pemanfaatan Infografis Bagi Pengelola Wisata Alam Pulau Semut

Perancangan Desain Karakter pada Buku Interaktif Pengenalan Literasi Keuangan untuk Anak Usia Dini

Understanding 3D seismic data visualization with C++, OpenGL and GLSL

Multimodal semantic enhanced representation network for micro-video event detection