Significant Improvement In Recall Research Articles

The accurate identification of protein-ligand binding sites is of critical importance in understanding and modulating protein function. Accordingly, ligand binding site prediction has remained a research focus for over three decades with over 50 methods developed and a change of paradigm from geometry-based to machine learning. In this work, we collate 13 ligand binding site predictors, spanning 30years, focusing on the latest machine learning-based methods such as VN-EGNN, IF-SitePred, GrASP, PUResNet, and DeepPocket and compare them to the established P2Rank, PRANK and fpocket and earlier methods like PocketFinder, Ligsite and Surfnet. We benchmark the methods against the human subset of our new curated reference dataset, LIGYSIS. LIGYSIS is a comprehensive protein-ligand complex dataset comprising 30,000 proteins with bound ligands which aggregates biologically relevant unique protein-ligand interfaces across biological units of multiple structures from the same protein. LIGYSIS is an improvement for testing methods over earlier datasets like sc-PDB, PDBbind, binding MOAD, COACH420 and HOLO4K which either include 1:1 protein-ligand complexes or consider asymmetric units. Re-scoring of fpocket predictions by PRANK and DeepPocket display the highest recall (60%) whilst IF-SitePred presents the lowest recall (39%). We demonstrate the detrimental effect that redundant prediction of binding sites has on performance as well as the beneficial impact of stronger pocket scoring schemes, with improvements up to 14% in recall (IF-SitePred) and 30% in precision (Surfnet). Finally, we propose top-N+2 recall as the universal benchmark metric for ligand binding site prediction and urge authors to share not only the source code of their methods, but also of their benchmark.Scientific contributionsThis study conducts the largest benchmark of ligand binding site prediction methods to date, comparing 13 original methods and 15 variants using 10 informative metrics. The LIGYSIS dataset is introduced, which aggregates biologically relevant protein-ligand interfaces across multiple structures of the same protein. The study highlights the detrimental effect of redundant binding site prediction and demonstrates significant improvement in recall and precision through stronger scoring schemes. Finally, top-N+2 recall is proposed as a universal benchmark metric for ligand binding site prediction, with a recommendation for open-source sharing of both methods and benchmarks.

Read full abstract

Background and ObjectiveMedical imaging techniques are widely employed in disease diagnosis and treatment. A readily available medical report can be a useful tool in assisting an expert for investigating the patient’s health. A radiologist can benefit from an automatic medical image to radiological report translation system while preparing a final report. Previous attempts on automatic medical report generation task includes image captioning algorithms without taking domain-specific visual and textual contents into account, thus arises the question about credibility of generated report. MethodsIn this work, a novel Adaptive Multilevel Multi-Attention (AMLMA) approach is proposed by offering domain-specific visual-textual knowledge to generate a thorough and believable radiological report for any view of a human chest X-ray image. The proposed approach leverages the encoder-decoder framework incorporated with multiple adaptive attention mechanisms. The potential of a convolutional neural network (CNN) with residual attention module (RAM) is demonstrated as a strong visual encoder for multi-label abnormality detection. The multilevel visual features (local and global) are extracted from proposed visual encoder to retrieve regional-level and abstract-level radiology-based semantic information. The Word2Vec and FastText word embeddings are trained on medical reports to acquire radiological knowledge and further used as textual encoders, feeding as input to Bi-directional Long Short Term Memory (Bi-LSTM) network to learn the co-relationship between medical terminologies in radiological reports. The AMLMA employs a weighted multilevel association of adaptive visual-semantic attention and visual-based linguistic attention mechanisms. This association of adaptive attention is exploited as a decoder and produces significant improvements in the report generation task. ResultsThe proposed approach is evaluated on a publicly available Indiana University chest X-ray (IU-CXR) dataset. The CNN with RAM shows the significant improvement in recall (0.4423), precision (0.1803) and F1-score (0.2551) for prediction of multiple abnormalities in X-ray image. The results of language generation metrics for proposed variants were acquired using the COCO-caption evaluation Application Program Interface (API). The trained embeddings with AMLMA model generates the convincing radiology report and outperform state-of-the-art (SOTA) approaches with high evaluation metrics scores for Bleu-4 (0.172), Meteor (0.247), Rouge_L (0.376) and CIDEr (0.381). In addition, a new “Unique Index” (UI) statistic is introduced to highlight the model’s ability for generating unique reports. ConclusionThe overall architecture aids to the understanding of various X-ray image views and generating the relevant normal and abnormal radiography statements. The proposed model is emphasized on multi-level visual-textual knowledge with adaptive attention mechanism to balance visual and linguistic information for the generation of admissible radiology report.

Read full abstract

Significant Improvement In Recall Research Articles

Articles published on Significant Improvement In Recall

Comparative evaluation of methods for the prediction of protein-ligand binding sites.

NewsZoom (focus on the important): Text summarization of News Articles based on named entity recognition using Spacy library

Synthetic Data-Driven Real-Time Detection Transformer Object Detection in Raining Weather Conditions

Boost recall in quasi-stellar object selection from highly imbalanced photometric datasets

Chinese English language learners’ vocabulary retention: Investigating the effectiveness of neuro/metacognitive and socio-cultural strategies

Effects of Virtual Reality on Complex Building System Recall

Hierarchical User Intention-Preference for Sequential Recommendation with Relation-Aware Heterogeneous Information Network Embedding.

Translating medical image to radiological report: Adaptive multilevel multi-attention approach

Design of a Music Recommendation Model on the Basis of Multilayer Attention Representation

Low-light image enhancement using Gaussian Process for features retrieval

Improving Regional and Teleseismic Detection for Single-Trace Waveforms Using a Deep Temporal Convolutional Neural Network Trained with an Array-Beam Catalog.

Randomized Controlled Trial of a Computer-Based Education Program in the Home for Solid Organ Transplant Recipients

Learning Genetic Regulatory Network Connectivity from Time Series Data

Pilot study using an Internet-based program in informed consent

A Theatrical Intervention to Improve Cognition in Intact Residents of Long Term Care Facilities

Effects of a knowledge base manipulation on children's recall

Effects of Cues on Memory in Alcoholics and Controls

The effectiveness of document neighboring in search enhancement

Optical disc technology for records management: A user perspective

Long-Term Memory of Operativity Figures

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Significant Improvement In Recall Research Articles

Articles published on Significant Improvement In Recall

Comparative evaluation of methods for the prediction of protein-ligand binding sites.

NewsZoom (focus on the important): Text summarization of News Articles based on named entity recognition using Spacy library

Synthetic Data-Driven Real-Time Detection Transformer Object Detection in Raining Weather Conditions

Boost recall in quasi-stellar object selection from highly imbalanced photometric datasets

Chinese English language learners’ vocabulary retention: Investigating the effectiveness of neuro/metacognitive and socio-cultural strategies

Effects of Virtual Reality on Complex Building System Recall

Hierarchical User Intention-Preference for Sequential Recommendation with Relation-Aware Heterogeneous Information Network Embedding.

Translating medical image to radiological report: Adaptive multilevel multi-attention approach

Design of a Music Recommendation Model on the Basis of Multilayer Attention Representation

Low-light image enhancement using Gaussian Process for features retrieval

Improving Regional and Teleseismic Detection for Single-Trace Waveforms Using a Deep Temporal Convolutional Neural Network Trained with an Array-Beam Catalog.

Randomized Controlled Trial of a Computer-Based Education Program in the Home for Solid Organ Transplant Recipients

Learning Genetic Regulatory Network Connectivity from Time Series Data

Pilot study using an Internet-based program in informed consent

A Theatrical Intervention to Improve Cognition in Intact Residents of Long Term Care Facilities

Effects of a knowledge base manipulation on children's recall

Effects of Cues on Memory in Alcoholics and Controls

The effectiveness of document neighboring in search enhancement

Optical disc technology for records management: A user perspective

Long-Term Memory of Operativity Figures