Real-time Algorithm Research Articles

Vocal biomarkers, derived from acoustic analysis of vocal characteristics, offer noninvasive avenues for medical screening, diagnostics, and monitoring. Previous research demonstrated the feasibility of predicting type 2 diabetes mellitus through acoustic analysis of smartphone-recorded speech. Building upon this work, this study explores the impact of audio data compression on acoustic vocal biomarker development, which is critical for broader applicability in health care. The objective of this research is to analyze how common audio compression algorithms (MP3, M4A, and WMA) applied by 3 different conversion tools at 2 bitrates affect features crucial for vocal biomarker detection. The impact of audio data compression on acoustic vocal biomarker development was investigated using uncompressed voice samples converted into MP3, M4A, and WMA formats at 2 bitrates (320 and 128 kbps) with MediaHuman (MH) Audio Converter, WonderShare (WS) UniConverter, and Fast Forward Moving Picture Experts Group (FFmpeg). The data set comprised recordings from 505 participants, totaling 17,298 audio files, collected using a smartphone. Participants recorded a fixed English sentence up to 6 times daily for up to 14 days. Feature extraction, including pitch, jitter, intensity, and Mel-frequency cepstral coefficients (MFCCs), was conducted using Python and Parselmouth. The Wilcoxon signed rank test and the Bonferroni correction for multiple comparisons were used for statistical analysis. In this study, 36,970 audio files were initially recorded from 505 participants, with 17,298 recordings meeting the fixed sentence criteria after screening. Differences between the audio conversion software, MH, WS, and FFmpeg, were notable, impacting compression outcomes such as constant or variable bitrates. Analysis encompassed diverse data compression formats and a wide array of voice features and MFCCs. Wilcoxon signed rank tests yielded P values, with those below the Bonferroni-corrected significance level indicating significant alterations due to compression. The results indicated feature-specific impacts of compression across formats and bitrates. MH-converted files exhibited greater resilience compared to WS-converted files. Bitrate also influenced feature stability, with 38 cases affected uniquely by a single bitrate. Notably, voice features showed greater stability than MFCCs across conversion methods. Compression effects were found to be feature specific, with MH and FFmpeg showing greater resilience. Some features were consistently affected, emphasizing the importance of understanding feature resilience for diagnostic applications. Considering the implementation of vocal biomarkers in health care, finding features that remain consistent through compression for data storage or transmission purposes is valuable. Focused on specific features and formats, future research could broaden the scope to include diverse features, real-time compression algorithms, and various recording methods. This study enhances our understanding of audio compression's influence on voice features and MFCCs, providing insights for developing applications across fields. The research underscores the significance of feature stability in working with compressed audio data, laying a foundation for informed voice data use in evolving technological landscapes.

Read full abstract

Rapid and non-destructive automatic statistics of cherry tomatoes at different ripeness stages help better manage resources during harvesting, storage, and transportation processes. Currently, the inspection of cherry tomatoes (ripeness assessment and counting) still faces challenges, such as excluding background cherry tomatoes, detecting heavily obscured ones, and tracking similar feature extraction across frames. This study presented a statistical algorithm for cherry tomatoes with different ripeness. Firstly, a complete field of view was achieved by stitching images from dual cameras. Then, during the detection phase, the proposed depth information mapping and morphological operations were employed to filter out background cherry tomatoes effectively. Secondly, the SimAM attention module was introduced to enhance the focus of the YOLO v7-tiny model on small and occluded targets. The ReID feature extraction model was replaced with the lighter and more powerful MobileNeXt model, with the input resolution adapted to 64 × 64 based on the morphological characteristics of cherry tomatoes. Finally, the statistics of cherry tomatoes at different ripeness levels were conducted using the improved DeepSORT algorithm. The ablation experimental results prove the effectiveness of the proposed algorithm. The improved YOLO v7-tiny has a mAP of 87.3 % on the dataset, combining depth information mapping with morphological operations. Compared with the original DeepSORT algorithm, the improved DeepSORT algorithm has RMSE decreased by 15.46 to 3.11, and R2 increased by 0.071 to 0.998. The statistical algorithm enables real-time statistical of the number of cherry tomatoes at different ripeness levels during inspection.

Read full abstract

Real-time Algorithm Research Articles

Articles published on Real-time Algorithm

Near real-time interpolative algorithm for modelling air quality in underground mines

Research on Complementary Filtered Attitude Solution Method for Quadcopter Based on Double Filter Preprocessing

AIDER: Aircraft Icing Potential Area DEtection in Real-Time Using 3-Dimensional Radar and Atmospheric Variables

Comprehensive Travel Companion: A Hybrid Recommendation and Route Planning System for Globetrotting

Fire and smoke real-time detection algorithm for coal mines based on improved YOLOv8s.

Efficiency Comparison CRAFT vs. SLP Methods in Production Optimization

Anomaly Detection Algorithm for Urban Infrastructure Construction Equipment based on Multidimensional Time Series

Impact of Audio Data Compression on Feature Extraction for Vocal Biomarker Detection: Validation Study.

Adapting support vector optimisation algorithms to textual gender classification

A Compact Handheld Sensor Package with Sensor Fusion for Comprehensive and Robust 3D Mapping.

A Lightweight Vehicle Detection Method Fusing GSConv and Coordinate Attention Mechanism.

Voice Recognition Based Vital Parameter Monitoring Patient BED

Decision Support System for Optimizing Tactics and Strategies of Sports Competition Using Reinforcement Learning Algorithm

Real-time statistical algorithm for cherry tomatoes with different ripeness based on depth information mapping

Interferometer-based chemical sensor on chip with enhanced responsivity and low-cost interrogation

Development of a vision system integrated with industrial robots for online weld seam tracking

Real-Time Point Cloud Clustering Algorithm Based on Roadside LiDAR

Structurally Aware 3D Gas Distribution Mapping Using Belief Propagation: A Real-Time Algorithm for Robotic Deployment

Optimal and Quasi-Optimal Automatic Tuning of Vibration Neutralizers

Vision SLAM algorithm for wheeled robots integrating multiple sensors.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-time Algorithm Research Articles

Articles published on Real-time Algorithm

Near real-time interpolative algorithm for modelling air quality in underground mines

Research on Complementary Filtered Attitude Solution Method for Quadcopter Based on Double Filter Preprocessing

AIDER: Aircraft Icing Potential Area DEtection in Real-Time Using 3-Dimensional Radar and Atmospheric Variables

Comprehensive Travel Companion: A Hybrid Recommendation and Route Planning System for Globetrotting

Fire and smoke real-time detection algorithm for coal mines based on improved YOLOv8s.

Efficiency Comparison CRAFT vs. SLP Methods in Production Optimization

Anomaly Detection Algorithm for Urban Infrastructure Construction Equipment based on Multidimensional Time Series

Impact of Audio Data Compression on Feature Extraction for Vocal Biomarker Detection: Validation Study.

Adapting support vector optimisation algorithms to textual gender classification

A Compact Handheld Sensor Package with Sensor Fusion for Comprehensive and Robust 3D Mapping.

A Lightweight Vehicle Detection Method Fusing GSConv and Coordinate Attention Mechanism.

Voice Recognition Based Vital Parameter Monitoring Patient BED

Decision Support System for Optimizing Tactics and Strategies of Sports Competition Using Reinforcement Learning Algorithm

Real-time statistical algorithm for cherry tomatoes with different ripeness based on depth information mapping

Interferometer-based chemical sensor on chip with enhanced responsivity and low-cost interrogation

Development of a vision system integrated with industrial robots for online weld seam tracking

Real-Time Point Cloud Clustering Algorithm Based on Roadside LiDAR

Structurally Aware 3D Gas Distribution Mapping Using Belief Propagation: A Real-Time Algorithm for Robotic Deployment

Optimal and Quasi-Optimal Automatic Tuning of Vibration Neutralizers

Vision SLAM algorithm for wheeled robots integrating multiple sensors.