Character Recognition Research Articles

More efforts are being put into improving Large Language Models’ (LLM) capabilities than into dealing with their implications. Current LLMs are able to generate high quality texts seemingly indistinguishable from those written by human experts. While offering great potentials, such breakthroughs also pose new challenges for safe and ethical uses of LLMs in education, science, and a multitude of other areas. To add up, the majority of current approaches in LLM text detection are either computationally expensive or need access to the LLMs’ internal computations, both of which hinder their public accessibility. With such motivation, this paper presents a novel metric learning paradigm for detection of LLM-generated texts that is able to balance among computational costs, accessibility, and performances. Specifically, the detection is based on learning a similarity function between a given text and an equivalent example generated by LLMs that outputs high values for LLM-LLM text pairs and low values for LLM-human text pairs. In terms of architecture, the detection framework includes a pretrained language model for the text embedding task and a newly designed deep metric model. The metric component can be trained on triplets or pairs of same-context instances to signify the distances between human texts and LLM ones while reducing that among LLM texts. Next, we develop five datasets totalling over 95,000 contexts and triplets of responses in which one from human and two from GPT-3.5 TURBO or GPT-4 TURBO for benchmarking. Experiment studies show that our best architectures maintain F1 scores in between 0.87 to 0.95 across the tested corpora in multiple experiment settings. The metric framework also demands significantly less time in training and inference compared to the RoBERTa, LLaMA 3, Mistral v0.3, and Ghostbuster, while keeping 90% to 150% performances of the best benchmark.

Read full abstract

Recently, as the types of imported food and the design of their packaging become more complex and diverse, digital recognition technologies such as barcodes, QR (quick response) codes, and OCR (optical character recognition) are attracting attention in order to quickly and easily check safety information (e.g., food ingredient information and recalls). However, consumers are still exposed to inaccurate and inconvenient situations because legacy technologies require dedicated terminals or include information other than safety information. In this paper, we propose a deep learning-based packaging recognition system which can easily and accurately determine food safety information with a single image captured through a smartphone camera. The detection algorithm learned a total of 100 kinds of product images and optimized YOLOv7 to secure an accuracy of over 95%. In addition, a new SUS (system usability scale)-based questionnaire was designed and conducted on 71 consumers to evaluate the usability of the system from the individual consumer’s perspective. The questionnaire consisted of three categories, namely convenience, accuracy, and usefulness, and each received a score of at least 77, which confirms that the proposed system has excellent overall usability. Moreover, in terms of task completion rate and task completion time, the proposed system is superior when it compared to existing QR code- or Internet-based recognition systems. These results demonstrate that the proposed system provides consumers with more convenient and accurate information while also confirming the sustainability of smart food consumption.

Read full abstract

Character Recognition Research Articles

Related Topics

Articles published on Character Recognition

Emotion detection in text: advances in sentiment analysis

The Effect of a Sibling Shared Reading Intervention on the Reading Development of Early School-Aged Children in Rural China

A Metric-Based Detection System for Large Language Model Texts

Comparative Analysis of Object Detection Models for Sheet Music Recognition: A Focus on YOLO and OMR Technologies

Retraction Note: Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier

Natural Language Processing for Electronic Health Record Optimization in Android Applications

Vision Aid: Developing An Assistive Mobile Application for Visually Impaired Indivuals

A Survey on Multimodal Large Language Models

A Multi-Level Embedding Framework for Decoding Sarcasm Using Context, Emotion, and Sentiment Feature

Reverse Sign Language Recognition System Using Machine Learning

Research on Closed-Loop Control of Screen-Based Guidance Operations in High-Speed Railway Passenger Stations Based on Visual Detection Model

Application of Binary Image Quality Assessment Methods to Predict the Quality of Optical Character Recognition Results

Text kernel expansion for real-time scene text detection

Ensemble automated approaches for producing high‐quality herbarium digital records

Text Detection on Industrial Barrel Label with Convolutional Attention and Dual‐Branch Feature Network

Consumer Usability Test of Mobile Food Safety Inquiry Platform Based on Image Recognition

Similarity Distractors Increase the Burden of Chinese Character Selection and Updating in Working Memory.

Enhancing scene text detectors with realistic text image synthesis using diffusion models

EGO-LM: An efficient, generic, and out-of-the-box language model for handwritten text recognition

Navigating the Landscape of AI-Generated Text Detection: Issues and Solutions for Upholding Academic Integrity

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Character Recognition Research Articles

Related Topics

Articles published on Character Recognition

Emotion detection in text: advances in sentiment analysis

The Effect of a Sibling Shared Reading Intervention on the Reading Development of Early School-Aged Children in Rural China

A Metric-Based Detection System for Large Language Model Texts

Comparative Analysis of Object Detection Models for Sheet Music Recognition: A Focus on YOLO and OMR Technologies

Retraction Note: Optical handwritten character recognition for Tamil language using CNN-VGG-16 model with RF classifier

Natural Language Processing for Electronic Health Record Optimization in Android Applications

Vision Aid: Developing An Assistive Mobile Application for Visually Impaired Indivuals

A Survey on Multimodal Large Language Models

A Multi-Level Embedding Framework for Decoding Sarcasm Using Context, Emotion, and Sentiment Feature

Reverse Sign Language Recognition System Using Machine Learning

Research on Closed-Loop Control of Screen-Based Guidance Operations in High-Speed Railway Passenger Stations Based on Visual Detection Model

Application of Binary Image Quality Assessment Methods to Predict the Quality of Optical Character Recognition Results

Text kernel expansion for real-time scene text detection

Ensemble automated approaches for producing high‐quality herbarium digital records

Text Detection on Industrial Barrel Label with Convolutional Attention and Dual‐Branch Feature Network

Consumer Usability Test of Mobile Food Safety Inquiry Platform Based on Image Recognition

Similarity Distractors Increase the Burden of Chinese Character Selection and Updating in Working Memory.

Enhancing scene text detectors with realistic text image synthesis using diffusion models

EGO-LM: An efficient, generic, and out-of-the-box language model for handwritten text recognition

Navigating the Landscape of AI-Generated Text Detection: Issues and Solutions for Upholding Academic Integrity