Word Count Research Articles

Due to recent advances in artificial intelligence (AI), language model applications can generate logical text output that is difficult to distinguish from human writing. ChatGPT (OpenAI) and Bard (subsequently rebranded as "Gemini"; Google AI) were developed using distinct approaches, but little has been studied about the difference in their capability to generate the abstract. The use of AI to write scientific abstracts in the field of spine surgery is the center of much debate and controversy. The objective of this study is to assess the reproducibility of the structured abstracts generated by ChatGPT and Bard compared to human-written abstracts in the field of spine surgery. In total, 60 abstracts dealing with spine sections were randomly selected from 7 reputable journals and used as ChatGPT and Bard input statements to generate abstracts based on supplied paper titles. A total of 174 abstracts, divided into human-written abstracts, ChatGPT-generated abstracts, and Bard-generated abstracts, were evaluated for compliance with the structured format of journal guidelines and consistency of content. The likelihood of plagiarism and AI output was assessed using the iThenticate and ZeroGPT programs, respectively. A total of 8 reviewers in the spinal field evaluated 30 randomly extracted abstracts to determine whether they were produced by AI or human authors. The proportion of abstracts that met journal formatting guidelines was greater among ChatGPT abstracts (34/60, 56.6%) compared with those generated by Bard (6/54, 11.1%; P<.001). However, a higher proportion of Bard abstracts (49/54, 90.7%) had word counts that met journal guidelines compared with ChatGPT abstracts (30/60, 50%; P<.001). The similarity index was significantly lower among ChatGPT-generated abstracts (20.7%) compared with Bard-generated abstracts (32.1%; P<.001). The AI-detection program predicted that 21.7% (13/60) of the human group, 63.3% (38/60) of the ChatGPT group, and 87% (47/54) of the Bard group were possibly generated by AI, with an area under the curve value of 0.863 (P<.001). The mean detection rate by human reviewers was 53.8% (SD 11.2%), achieving a sensitivity of 56.3% and a specificity of 48.4%. A total of 56.3% (63/112) of the actual human-written abstracts and 55.9% (62/128) of AI-generated abstracts were recognized as human-written and AI-generated by human reviewers, respectively. Both ChatGPT and Bard can be used to help write abstracts, but most AI-generated abstracts are currently considered unethical due to high plagiarism and AI-detection rates. ChatGPT-generated abstracts appear to be superior to Bard-generated abstracts in meeting journal formatting guidelines. Because humans are unable to accurately distinguish abstracts written by humans from those produced by AI programs, it is crucial to exercise special caution and examine the ethical boundaries of using AI programs, including ChatGPT and Bard.

e13610 Background: Artificial intelligence (AI)-driven tools, like ChatGPT, have become widely-available sources for online health information. Limited research has explored the congruity between AI-generated content and professional treatment guidelines. This study seeks to compare recommendations for cancer-related symptoms generated from ChatGPT with guidelines from National Comprehensive Cancer Network (NCCN), a provider focused source requiring users to log in or register to access its recommendations. Utilizing a provider-focused source like NCCN serves as a benchmark to assess whether ChatGPT recommendations align with the standards typically endorsed by clinicians. Methods: We extracted treatment recommendations from four NCCN Supportive Care webpages (Cancer Pain, Antiemesis, Cancer-Related Fatigue, and Distress Management) and five subsections of the NCCN Palliative Care webpage (dyspnea, constipation, diarrhea, sleep disturbances, and anorexia/cachexia). We then entered "How can I reduce my cancer-related [symptom]" into ChatGPT 3.5 and extracted its recommendations. We calculated and compared word count and Flesch-Kincaid Grade Level readability for each NCCN and ChatGPT section. We completed a comparative content analysis focusing on recommendations for medications, consultations, and non-pharmacological strategies. Results: Across the nine NCCN Supportive Care and Palliative Care webpages, the mean word count was 2393.8 (SD=2601.4) and the mean Flesch-Kincaid Grade Level was 17.3 (SD=1.4) vs 382.4 (SD=29.6) and 11.6 (SD=0.8) for ChatGPT. The mean percent agreement between NCCN and ChatGPT recommendations was 44.6% (range 14.3%-81.8%). ChatGPT and NCCN guidelines shared fewer than half of their symptom-related recommendations in all but one section, fatigue. NCCN offered specific medication recommendations across all sections. ChatGPT's recommendations lacked the specificity observed in NCCN's guidelines including often not suggesting any medications. ChatGPT recommended specific medications in the shortness of breath and diarrhea sections that were not recommended by NCCN. NCCN's guidelines often did not include recommendations related to spirituality or palliative care consults, areas that ChatGPT addressed. Conclusions: While ChatGPT provides concise, accessible supportive care advice including many non-medical support recommendations, discrepancies with guidelines raise concerns for patient-facing symptom management recommendations. Overall, AI-generated content like ChatGPT can assist in providing preliminary information but should be researched in conjunction with comprehensive, evidence-based guidance provided by healthcare professionals. Healthcare providers should work with AI developers to ensure data sources are high-quality and accurate.

Word Count Research Articles

Related Topics

Articles published on Word Count

Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.

The Reddit cannabis subjective highness rating scale: Applying computational social science to explore psychological and environmental correlates of naturalistic cannabis use.

Natural Language Processing Approaches to Text Data Augmentation: A Computational Linguistic Analysis

Exploring barriers to care home research recruitment during the COVID-19 pandemic: The influence of social media recruitment posts and public sentiment.

Advanced optimization-based weighted features for ensemble deep learning smart occupancy detection network for road traffic parking

Range-limited Heaps’ law for functional DNA words in the human genome

Health information-seeking on Reddit, by people who use opioids

Forecasting the Spread of Sustainability Movement: Computational Analysis on Social Media Messages Promoting Climate Actions

DICTIONARY-BASED COMPARATIVE STUDY OF THE CHINESE “个 (GÈ)” AND THE MALAY “BUAH” NUMERAL CLASSIFIERS

Objective Linguistic Markers Associated with Callous-Unemotional Traits in Early Childhood

Automatic Detection of Verbal Deception in Romanian With Artificial Intelligence Methods

Cultural values and the P-O fit: comparative NLP analysis of German online job advertisements

The influence of President Trump's micro-expressions during his COVID-19 national address on viewers' emotional response.

Exploring the Use of Natural Language Processing to Understand Emotions of Trainees and Faculty Regarding Entrustable Professional Activity Assessments.

Enhancing Diagnostic Support for Chiari Malformation and Syringomyelia: A Comparative Study of Contextualized ChatGPT Models

Self-mediatisation and the format of Swedish parliamentary speeches: Speech length and political slogans, 1920–2019

Exploring AI-generated content and professional guidelines in cancer symptom management: A comparative analysis between ChatGPT and NCCN guidelines.

Assessing the missions and visions of NCI-designated cancer centers and their affiliated hospitals.

EFL Learners’ English Writing Feedback and Their Perception of Using ChatGPT

Corporate reporting behavior: Factors influencing the adoption of integrated reporting in India

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Word Count Research Articles

Related Topics

Articles published on Word Count

Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.

The Reddit cannabis subjective highness rating scale: Applying computational social science to explore psychological and environmental correlates of naturalistic cannabis use.

Natural Language Processing Approaches to Text Data Augmentation: A Computational Linguistic Analysis

Exploring barriers to care home research recruitment during the COVID-19 pandemic: The influence of social media recruitment posts and public sentiment.

Advanced optimization-based weighted features for ensemble deep learning smart occupancy detection network for road traffic parking

Range-limited Heaps’ law for functional DNA words in the human genome

Health information-seeking on Reddit, by people who use opioids

Forecasting the Spread of Sustainability Movement: Computational Analysis on Social Media Messages Promoting Climate Actions

DICTIONARY-BASED COMPARATIVE STUDY OF THE CHINESE “个 (GÈ)” AND THE MALAY “BUAH” NUMERAL CLASSIFIERS

Objective Linguistic Markers Associated with Callous-Unemotional Traits in Early Childhood

Automatic Detection of Verbal Deception in Romanian With Artificial Intelligence Methods

Cultural values and the P-O fit: comparative NLP analysis of German online job advertisements

The influence of President Trump's micro-expressions during his COVID-19 national address on viewers' emotional response.

Exploring the Use of Natural Language Processing to Understand Emotions of Trainees and Faculty Regarding Entrustable Professional Activity Assessments.

Enhancing Diagnostic Support for Chiari Malformation and Syringomyelia: A Comparative Study of Contextualized ChatGPT Models

Self-mediatisation and the format of Swedish parliamentary speeches: Speech length and political slogans, 1920–2019

Exploring AI-generated content and professional guidelines in cancer symptom management: A comparative analysis between ChatGPT and NCCN guidelines.

Assessing the missions and visions of NCI-designated cancer centers and their affiliated hospitals.

EFL Learners’ English Writing Feedback and Their Perception of Using ChatGPT

Corporate reporting behavior: Factors influencing the adoption of integrated reporting in India