Hidden Representations Research Articles

Stack Overflow is very helpful for software developers who are seeking answers to programming problems. Previous studies have shown that a growing number of questions are of low quality and thus obtain less attention from potential answerers. Gao et al. proposed an LSTM-based model (i.e., BiLSTM-CC) to automatically generate question titles from the code snippets to improve the question quality. However, only using the code snippets in the question body cannot provide sufficient information for title generation, and LSTMs cannot capture the long-range dependencies between tokens. This paper proposes CCBERT, a deep learning based novel model to enhance the performance of question title generation by making full use of the bi-modal information of the entire question body. CCBERT follows the encoder–decoder paradigm and uses CodeBERT to encode the question body into hidden representations, a stacked Transformer decoder to generate predicted tokens, and an additional copy attention layer to refine the output distribution. Both the encoder and decoder perform the multi-head self-attention operation to better capture the long-range dependencies. This paper builds a dataset containing around 200,000 high-quality questions filtered from the data officially published by Stack Overflow to verify the effectiveness of the CCBERT model. CCBERT outperforms all the baseline models on the dataset. Experiments on both code-only and low-resource datasets show the superiority of CCBERT with less performance degradation. The human evaluation also shows the excellent performance of CCBERT concerning both readability and correlation criteria. CCBERT is capable of automatically capturing the bi-modal semantic information from the entire question body and parsing the long-range dependencies to achieve better performance. Therefore, CCBERT is an effective approach for generating Stack Overflow question titles.

BackgroundPersonalized medicine requires the patient similarity analysis for providing specific treatments tailed for each patient. However, the patient similarity analysis in personalized clinical scenarios encounters challenges, which are twofold. First, heterogeneous and multi-type data are usually recorded to Electronic Health Records (EHRs) during the course of admissions, which makes it difficult to measure the patient similarity. Second, disease progression manifests diverse disease states at different times, which brings sequential complexity to dynamically retrieve similar patients' sequences. Materials and methodsTo overcome the above-mentioned challenges, we propose a novel dynamic patient similarity analysis model based on deep learning. Specifically, the proposed model embeds the multi-type and heterogeneous data into hidden representations with a specially designed embedding and attention module. Thereafter, the proposed model retrieves similar patients' sequences based on these hidden representations in a dynamic manner. More importantly, we adopt two clinical tasks, i.e., diagnosis prediction and medication recommendation, to validate the effectiveness of the proposed model. It is worth noticing that the proposed model integrates a drug-drug interaction (DDI) knowledge graph in the medication recommendation task to reduce adverse reactions caused by combinational treatments, such that a more rational strategy can be realized. We evaluate our proposed model using the critical care database MIMIC-III, which includes 5,430 patients covering 14,096 clinical visits. ResultsThe proposed model outperforms several state-of-the-art methods. For diagnosis prediction, the average PR-AUC score of the proposed model reaches 0.6200, which is significantly higher than that of the baseline models (0.2497∼0.5407). Meanwhile, for medication recommendation, the average PR-AUC of the proposed model is 0.6682 (Jaccard: 0.4070; F1: 0.5672; Recall: 0.7832) whereas the K-nearest model can only reach 0.3805 (Jaccard: 0.3911; F1: 0.5465; Recall: 0.5705). In addition, our proposed model achieves a lower DDI rate. ConclusionWe propose a novel dynamic patient similarity analysis model, which can be implemented into a decision support system for clinical tasks including diagnosis prediction, surgical procedure selection, medication recommendation, etc. Also, the proposed model serves as an explainable protocol in clinical practice thanks to its analogy to real clinical reasoning where a doctor diagnoses diseases and prescribes medications according to the previous cured patients empirically.

Hidden Representations Research Articles

Articles published on Hidden Representations

Extracting Sentence Embeddings from Pretrained Transformer Models

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI.

An Attentive Inductive Bias for Sequential Recommendation beyond the Self-Attention

Redundant representations help generalization in wide neural networks * ,

LaenNet: Learning robust GCNs by propagating labels

Retrosynthesis Prediction with Local Template Retrieval

ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding

TTS-Guided Training for Accent Conversion Without Parallel Data

Learning Representations by Graphical Mutual Information Estimation and Maximization.

A novel feature integration and entity boundary detection for named entity recognition in cybersecurity

Adaptation to CT Reconstruction Kernels by Enforcing Cross-Domain Feature Maps Consistency.

Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal information

Deep Dynamic Patient Similarity Analysis: Model Development and Validation in ICU

Feature Encoding With Autoencoders for Weakly Supervised Anomaly Detection.

Masked Contrastive Representation Learning for Reinforcement Learning.

МОДЕЛЬ РАСПРОСТРАНЕНИЯ ДЕСТРУКТИВНЫХ ВОЗДЕЙСТВИЙ В ОГРАНИЧЕННЫХ КОЛЛЕКТИВАХ НА ОСНОВЕ ОГРАНИЧЕННОЙ МАШИНЫ БОЛЬЦМАНА

Blind microscopy image denoising with a deep residual and multiscale encoder/decoder network.

ML-ANet: A Transfer Learning Approach Using Adaptation Network for Multi-label Image Classification in Autonomous Driving

Interpretable deep recommender system model for prediction of kinase inhibitor efficacy across cancer cell lines

Information Flows of Diverse Autoencoders

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Hidden Representations Research Articles

Articles published on Hidden Representations

Extracting Sentence Embeddings from Pretrained Transformer Models

Deciphering the Feature Representation of Deep Neural Networks for High-Performance AI.

An Attentive Inductive Bias for Sequential Recommendation beyond the Self-Attention

Redundant representations help generalization in wide neural networks * ,

LaenNet: Learning robust GCNs by propagating labels

Retrosynthesis Prediction with Local Template Retrieval

ReCoMIF: Reading comprehension based multi-source information fusion network for Chinese spoken language understanding

TTS-Guided Training for Accent Conversion Without Parallel Data

Learning Representations by Graphical Mutual Information Estimation and Maximization.

A novel feature integration and entity boundary detection for named entity recognition in cybersecurity

Adaptation to CT Reconstruction Kernels by Enforcing Cross-Domain Feature Maps Consistency.

Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal information

Deep Dynamic Patient Similarity Analysis: Model Development and Validation in ICU

Feature Encoding With Autoencoders for Weakly Supervised Anomaly Detection.

Masked Contrastive Representation Learning for Reinforcement Learning.

МОДЕЛЬ РАСПРОСТРАНЕНИЯ ДЕСТРУКТИВНЫХ ВОЗДЕЙСТВИЙ В ОГРАНИЧЕННЫХ КОЛЛЕКТИВАХ НА ОСНОВЕ ОГРАНИЧЕННОЙ МАШИНЫ БОЛЬЦМАНА

Blind microscopy image denoising with a deep residual and multiscale encoder/decoder network.

ML-ANet: A Transfer Learning Approach Using Adaptation Network for Multi-label Image Classification in Autonomous Driving

Interpretable deep recommender system model for prediction of kinase inhibitor efficacy across cancer cell lines

Information Flows of Diverse Autoencoders