Data Simulation in Machine Olfaction with the R Package Chemosensors

Andrey Ziyatdinov,Alexandre Perera-Lluna

doi:10.1371/journal.pone.0088839

Abstract

In machine olfaction, the design of applications based on gas sensor arrays is highly dependent on the robustness of the signal and data processing algorithms. While the practice of testing the algorithms on public benchmarks is not common in the field, we propose software for performing data simulations in the machine olfaction field by generating parameterized sensor array data. The software is implemented as an R language package chemosensors which is open-access, platform-independent and self-contained. We introduce the concept of a virtual sensor array which can be used as a data generation tool. In this work, we describe the data simulation workflow which basically consists of scenario definition, virtual array parameterization and the generation of sensor array data. We also give examples of the processing of the simulated data as proof of concept for the parameterized sensor array data: the benchmarking of classification algorithms, the evaluation of linear- and non-linear regression algorithms, and the biologically inspired processing of sensor array data. All the results presented were obtained under version 0.7.6 of the chemosensors package whose home page is chemosensors.r-forge.r-project.org.

Highlights

Data sharing plays an important role in the fields of computer science, statistics and machine learning
The web site of The University of California at Irvine (UCI) Machine Learning Repository is an example of the way the machine learning community sets data repository standards and provides educational resources and open-access benchmarking material
The chemosensors package is organized around the S4 classes of simulation models (See Table 2), and the implementation of the classes shares some common features

Summary

Introduction

Data sharing plays an important role in the fields of computer science, statistics and machine learning. That has been one of the key factors in enabling impressive developments, in fields related to biological science, and in statistical genetics and bioinformatics. The web site of The University of California at Irvine (UCI) Machine Learning Repository is an example of the way the machine learning community sets data repository standards and provides educational resources and open-access benchmarking material. This web site contains over 200 data sets from different theoretical domains, including results from data generators. The Genetic Analysis Workshops approach current analytical problems by making both real and simulated data sets available to investigators worldwide. The use of simulated data is a widely accepted practice for evaluating the performance of computer algorithms and can be found in many computer science publications

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Feb 26, 2014
Citations: 22	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Data Simulation in Machine Olfaction with the R Package Chemosensors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

A software tool for large-scale synthetic experiments based on polymeric sensor arrays
A Ziyatdinov ... A Perera
Sensors and Actuators B: Chemical | VOL. 177
A Ziyatdinov, et. al.A Ziyatdinov ... A Perera
18 Oct 2012
Sensors and Actuators B: Chemical | VOL. 177

Feature Ensemble Learning for Sensor Array Data Classification Under Low-Concentration Gas
Leilei Zhao ... Fengchun Tian
IEEE Transactions on Instrumentation and Measurement | VOL. 72
Leilei Zhao, et. al.Leilei Zhao ... Fengchun Tian
01 Jan 2023
IEEE Transactions on Instrumentation and Measurement | VOL. 72

A study of an electronic nose for detection of lung cancer based on a virtual SAW gas sensors array and imaging recognition method
Xing Chen ... Kejing Ying
Measurement Science and Technology | VOL. 16
Xing Chen, et. al.Xing Chen ... Kejing Ying
29 Jun 2005
Measurement Science and Technology | VOL. 16

The Odor Characterizations and Reproductions in Machine Olfactions: A Review.
Tengteng Wen ... Dehan Luo
Sensors | VOL. 18
Tengteng Wen, et. al.Tengteng Wen ... Dehan Luo
18 Jul 2018
Sensors | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Simulation in Machine Olfaction with the R Package Chemosensors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE