General Modeling Language Research Articles

The tremendous success of Stack Overflow has accumulated an extensive corpus of software engineering knowledge, thus motivating researchers to propose various solutions for analyzing its content. The performance of such solutions hinges significantly on the selection of representation models for Stack Overflow posts. As the volume of literature on Stack Overflow continues to burgeon, it highlights the need for a powerful Stack Overflow post representation model and drives researchers’ interest in developing specialized representation models that can adeptly capture the intricacies of Stack Overflow posts. The state-of-the-art (SOTA) Stack Overflow post representation models are Post2Vec and BERTOverflow, which are built upon neural networks such as convolutional neural network and transformer architecture (e.g., BERT). Despite their promising results, these representation methods have not been evaluated in the same experimental setting. To fill the research gap, we first empirically compare the performance of the representation models designed specifically for Stack Overflow posts (Post2Vec and BERTOverflow) in a wide range of related tasks (i.e., tag recommendation, relatedness prediction, and API recommendation). The results show that Post2Vec cannot further improve the SOTA techniques of the considered downstream tasks, and BERTOverflow shows surprisingly poor performance. To find more suitable representation models for the posts, we further explore a diverse set of transformer-based models, including (1) general domain language models (RoBERTa, Longformer, and GPT2) and (2) language models built with software engineering related textual artifacts (CodeBERT, GraphCodeBERT, seBERT, CodeT5, PLBart, and CodeGen). This exploration shows that models like CodeBERT and RoBERTa are suitable for representing Stack Overflow posts. However, it also illustrates the “No Silver Bullet” concept, as none of the models consistently wins against all the others. Inspired by the findings, we propose SOBERT, which employs a simple yet effective strategy to improve the representation models of Stack Overflow posts by continuing the pre-training phase with the textual artifact from Stack Overflow. The overall experimental results demonstrate that SOBERT can consistently outperform the considered models and increase the SOTA performance significantly for all the downstream tasks.

Read full abstract

ImportanceThe workload of clinical documentation contributes to healthcare costs and professional burnout. The advent of generative AI language models presents a promising solution. The perspective of clinicians may contribute to effective and responsible implementation of such tools. ObjectiveTo evaluate 3 uses for generative AI for clinical documentation in pediatric emergency medicine (PEM), measuring time savings, effort reduction, physician attitudes, and identifying potential risks and barriers. DesignA mixed-methods study. SettingSingle Pediatric Emergency Department. Participants10 PEM attending physicians. InterventionParticipants were asked to write a supervisory note for four clinical scenarios, with varying level of complexity, twice without any assistance and twice with the assistance of ChatGPT Version 4.0. Participants evaluated two additional ChatGPT-generated clinical summaries: a structured handoff and a visit summary for a family, written at an 8th grade reading level. Finally, a semi-structured interview was performed to assess physicians’ perspective on the use of ChatGPT in PEM. Main Outcomes and MeasuresBetween subjects’ comparison of the effort and time taken to complete the supervisory note with and without ChatGPT assistance. Effort was measured using a self-reported Likert scale of 0-10. Physicians’ scoring of and attitude toward the ChatGPT-generated summaries was measured using a 0-10 Likert scale and open-ended questions. Summaries were scored for completeness, accuracy, efficiency, readability, and overall satisfaction. A thematic analysis was performed to analyze the content of the open-ended questions and to identify key themes. ResultsChatGPT yielded a 40% reduction in time and a 33% decrease in effort for supervisory notes in intricate cases, with no discernible impact on simpler ones. ChatGPT-generated summaries for structured handoffs and family-letters were highly rated, ranging from 7.0 to 9.0 out of 10, and most participants favored their inclusion in clinical practice. However, there were several critical reservations, out of which a set of general recommendations for applying ChatGPT to clinical summaries was formulated. ConclusionsPEM attendings in our study perceived that ChatGPT can deliver high-quality summaries while saving time and effort in many scenarios, but not all.

Read full abstract

General Modeling Language Research Articles

Related Topics

Articles published on General Modeling Language

Learning to Rank in Generative Retrieval

Emotion detection in educational dialogues by transfer learning

AI-assisted literature exploration of innovative Chinese medicine formulas.

Explanation–Question–Response dialogue: An argumentative tool for explainable AI

Representation Learning for Stack Overflow Posts: How Far Are We?

Assisting the infection preventionist: Use of artificial intelligence for health care–associated infection surveillance

The Latest Progress in Human-Computer Dialogue System

Performance of Two Artificial Intelligence Generative Language Models on the Orthopaedic In-Training Examination.

Study on the Impact of Utilizing ChatGPT and Other AI Tools for Feedback in EAP Writing Classrooms on the Discursive Writing Performance of English Major Students

Framework-based qualitative analysis of free responses of Large Language Models: Algorithmic fidelity.

Evaluation of a Generative Language Model Tool for Writing Examination Questions

CodeKGC: Code Language Model for Generative Knowledge Graph Construction

Standardized nomenclature for litigational legal prompting in generative language models

Exploring the use of ChatGPT in predicting anterior circulation stroke functional outcomes after mechanical thrombectomy: a pilot study

Incorporating evidence into mental health Q&A: a novel method to use generative language models for validated clinical content extraction

Large-scale text analysis using generative language models: A case study in discovering public value expressions in AI patents

Generating Role-Playing Game Quests With GPT Language Models

Harnessing the Power of Generative AI for Clinical Summaries: Perspectives From Emergency Physicians

Application of generative language models to orthopaedic practice

Observing Schrödinger’s cat with artificial intelligence: emergent classicality from information bottleneck

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

General Modeling Language Research Articles

Related Topics

Articles published on General Modeling Language

Learning to Rank in Generative Retrieval

Emotion detection in educational dialogues by transfer learning

AI-assisted literature exploration of innovative Chinese medicine formulas.

Explanation–Question–Response dialogue: An argumentative tool for explainable AI

Representation Learning for Stack Overflow Posts: How Far Are We?

Assisting the infection preventionist: Use of artificial intelligence for health care–associated infection surveillance

The Latest Progress in Human-Computer Dialogue System

Performance of Two Artificial Intelligence Generative Language Models on the Orthopaedic In-Training Examination.

Study on the Impact of Utilizing ChatGPT and Other AI Tools for Feedback in EAP Writing Classrooms on the Discursive Writing Performance of English Major Students

Framework-based qualitative analysis of free responses of Large Language Models: Algorithmic fidelity.

Evaluation of a Generative Language Model Tool for Writing Examination Questions

CodeKGC: Code Language Model for Generative Knowledge Graph Construction

Standardized nomenclature for litigational legal prompting in generative language models

Exploring the use of ChatGPT in predicting anterior circulation stroke functional outcomes after mechanical thrombectomy: a pilot study

Incorporating evidence into mental health Q&A: a novel method to use generative language models for validated clinical content extraction

Large-scale text analysis using generative language models: A case study in discovering public value expressions in AI patents

Generating Role-Playing Game Quests With GPT Language Models

Harnessing the Power of Generative AI for Clinical Summaries: Perspectives From Emergency Physicians

Application of generative language models to orthopaedic practice

Observing Schrödinger’s cat with artificial intelligence: emergent classicality from information bottleneck