Neural Generators Research Articles

Fault localization (FL) and automated program repair (APR) are two main tasks of automatic software debugging. Compared with traditional methods, deep learning-based approaches have been demonstrated to achieve better performance in FL and APR tasks. However, the existing deep learning-based FL methods ignore the deep semantic features or only consider simple code representations. And for APR tasks, existing template-based APR methods are weak in selecting the correct fix templates for more effective program repair, which are also not able to synthesize patches via the embedded end-to-end code modification knowledge obtained by training models on large-scale bug-fix code pairs. Moreover, in most of FL and APR methods, the model designs and training phases are performed separately, leading to ineffective sharing of updated parameters and extracted knowledge during the training process. This limitation hinders the further improvement in the performance of FL and APR tasks. To solve the above problems, we propose a novel approach called MTL-TRANSFER, which leverages a multi-task learning strategy to extract deep semantic features and transferred knowledge from different perspectives. First, we construct a large-scale open-source bug datasets and implement 11 multi-task learning models for bug detection and patch generation sub-tasks on 11 commonly used bug types, as well as one multi-classifier to learn the relevant semantics for the subsequent fix template selection task. Second, an MLP-based ranking model is leveraged to fuse spectrum-based, mutation-based and semantic-based features to generate a sorted list of suspicious statements. Third, we combine the patches generated by the neural patch generation sub-task from the multi-task learning strategy with the optimized fix template selecting order gained from the multi-classifier mentioned above. Finally, the more accurate FL results, the optimized fix template selecting order, and the expanded patch candidates are combined together to further enhance the overall performance of APR tasks. Our extensive experiments on widely-used benchmark Defects4J show that MTL-TRANSFER outperforms all baselines in FL and APR tasks, proving the effectiveness of our approach. Compared with our previously proposed FL method TRANSFER-FL (which is also the state-of-the-art statement-level FL method), MTL-TRANSFER increases the faults hit by 8/11/12 on Top-1/3/5 metrics (92/159/183 in total). And on APR tasks, the number of successfully repaired bugs of MTL-TRANSFER under the perfect localization setting reaches 75, which is 8 more than our previous APR method TRANSFER-PR. Furthermore, another experiment to simulate the actual repair scenarios shows that MTL-TRANSFER can successfully repair 15 and 9 more bugs (56 in total) compared with TBar and TRANSFER, which demonstrates the effectiveness of the combination of our optimized FL and APR components.

Read full abstract

Online shopping has become a crucial way to encourage daily consumption, where the User-generated, or crowdsourced product comments, can offer a broad range of feedback on e-commerce products. As a result, integrating critical opinions or major attitudes from the crowdsourced comments can provide valuable feedback for marketing strategy adjustment or product-quality monitoring. Unfortunately, the scarcity of annotated ground truth on the integrated comment, or the limited gold integration reference, has incurred the infeasibility of the regular supervised-learning-based comment integration. To resolve this problem, in this article, inspired by the principle of Transfer Learning, we propose a three-stage transferable and generative crowdsourced comment integration framework ( TTGCIF ) based on zero-and-few-shot learning with the support of domain distribution alignment. The proposed framework aims at generating abstractive integrated comment in target domain via the enhanced neural text generation model, by referring the available integration resource in related source domains, to avoid the exhausted effort on resource annotation devoted to the target domain. Specifically, at the first stage, to enhance the domain transferability, representations on the crowdsourced comments have been aligned up between the source and target domain, by minimizing the domain distribution discrepancy in the kernel space. At the second stage, Zero-shot comment integration mechanism has been adopted to deal with the dilemma that none of the gold integration reference may be available in target domain. In other words, taking the sample-level semantic prototype as input, the enhanced neural text generation model in TTGCIF is trained to learn data semantic association among different domains via semantic prototype transduction, so that the “ unlabeled ” crowdsourced comments in target domain can be associated with existing integration references in related source domains. At the third stage, based on the parameters trained at the second stage, fast domain adaptation mechanism in a Few-shot manner has also been adopted by seeking most potential parameters along the gradient direction constrained by instances across multiple source domains. In this way, parameters in TTGCIF can be sensitive to any alteration on training data, ensuring that even if only few annotated resource in target domain are available for “Fine-tune,” TTGCIF can still react promptly to achieve effective target domain adaptation. According to the experimental results, TTGCIF can achieve the best transferable product comment integration performance in target domain, with fast and stable domain adaption effect depending on no more than 10% annotated resource in target domain. More importantly, even if TTGCIF has not been fine-tuned on the target domain, yet by referring to the available integration resource in related source domains, the integrated comments generated by TTGCIF on the target domain are still superior to those generated by models already fine-tuned on the target domain.

Read full abstract

Neural Generators Research Articles

Related Topics

Articles published on Neural Generators

MTL-TRANSFER: Leveraging Multi-task Learning and Transferred Knowledge for Improving Fault Localization and Program Repair

Residential PV-battery scheduling with stochastic optimization and neural network-driven scenario generation

Meta-learning based blind image super-resolution approach to different degradations

Neural Sequence Generation with Constraints via Beam Search with Cuts: A Case Study on VRP

A cross-attention augmented model for event-triggered context-aware story generation

Swallowing-like activity elicited in neonatal rat medullary slice preparation

Pre-Trained Language Models for Text Generation: A Survey

Deep Learning Analysis With Gray Scale and Doppler Ultrasonography Images to Differentiate Graves' Disease.

What can rhetoric bring us? Incorporating rhetorical structure into neural related work generation

Chaotic neural network for information security

Event type induction using latent variables with hierarchical relationship analysis

Neural Network-Based Sum-Frequency Generation Spectra of Pure and Acidified Water Interfaces with Air.

Are the P600 and P3 ERP components linked to the task-evoked pupillary response as a correlate of norepinephrine activity?

Процессы когнитивного контроля в тесте Струпа и их отражение в связанных с событиями потенциалах (обзор)

Generative adversarial reduced order modelling

Electromyography-Based Biomechanical Cybernetic Control of a Robotic Fish Avatar

Node-Wise Monotone Barrier Coupling Law for Formation Control.

Content policy and access limitations on commercial neural networks as an incentive to artivism

Three-stage Transferable and Generative Crowdsourced Comment Integration Framework Based on Zero- and Few-shot Learning with Domain Distribution Alignment

Virtual Topologies for Populating Overhead Low-Voltage Broadband over Powerlines Topology Classes by Exploiting Neural Network Topology Generator Methodology (NNTGM) - Part 1: Theory

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neural Generators Research Articles

Related Topics

Articles published on Neural Generators

MTL-TRANSFER: Leveraging Multi-task Learning and Transferred Knowledge for Improving Fault Localization and Program Repair

Residential PV-battery scheduling with stochastic optimization and neural network-driven scenario generation

Meta-learning based blind image super-resolution approach to different degradations

Neural Sequence Generation with Constraints via Beam Search with Cuts: A Case Study on VRP

A cross-attention augmented model for event-triggered context-aware story generation

Swallowing-like activity elicited in neonatal rat medullary slice preparation

Pre-Trained Language Models for Text Generation: A Survey

Deep Learning Analysis With Gray Scale and Doppler Ultrasonography Images to Differentiate Graves' Disease.

What can rhetoric bring us? Incorporating rhetorical structure into neural related work generation

Chaotic neural network for information security

Event type induction using latent variables with hierarchical relationship analysis

Neural Network-Based Sum-Frequency Generation Spectra of Pure and Acidified Water Interfaces with Air.

Are the P600 and P3 ERP components linked to the task-evoked pupillary response as a correlate of norepinephrine activity?

Процессы когнитивного контроля в тесте Струпа и их отражение в связанных с событиями потенциалах (обзор)

Generative adversarial reduced order modelling

Electromyography-Based Biomechanical Cybernetic Control of a Robotic Fish Avatar

Node-Wise Monotone Barrier Coupling Law for Formation Control.

Content policy and access limitations on commercial neural networks as an incentive to artivism

Three-stage Transferable and Generative Crowdsourced Comment Integration Framework Based on Zero- and Few-shot Learning with Domain Distribution Alignment

Virtual Topologies for Populating Overhead Low-Voltage Broadband over Powerlines Topology Classes by Exploiting Neural Network Topology Generator Methodology (NNTGM) - Part 1: Theory