Enable Machine Learning Research Articles

Monitoring of pH is crucial for several chemical and biochemical processes. ISFET (Ion-Sensitive Field-Effect Transistors)-based pH sensors are promising candidates for pH monitoring applications. However, ISFET devices are prone to temporal and temperature drifts, which severely affects the precision of pH measurements. In this work, we collect experimental data of temporal and temperature drifts in an ISFET sensor to formulate an accurate SPICE macro model, incorporating both temporal and temperature non-idealities. The developed macro model is utilized for generating training data for state-of-the-art machine learning models for drift compensation, with a primary focus on the temporal characteristics. We utilize recurrent neural networks (RNNs) to model the temporal characteristics of ISFET, and thus, compensate the non-ideality. The sensor data is collected in various pH buffer solutions and a data set of sequences containing time-dependent voltage readings are generated by the device and the RNNs are trained to learn the crucial features from the data and map them to the precise pH of the solution. We compare two variants of RNNs, i.e. LSTM (long short-term memory) and GRU (gated recurrent unit), and their bidirectional low computational cost variants - biLSTM and biGRU. Each model is tested in a memory-constrained environment with the availability of a 32-bit and 64-bit floating-point number. Empirically, we find biLSTMs to perform best, where the achieved root mean square error (RMSE) between the model predicted pH and the true pH of the test solution is less than 0.212 pH, with an average RMSE of 0.126 pH. For temperature drift compensation, we collect data for four different temperatures and adapt well-established MLPs (Multi-layer Perceptrons) to compensate the intrinsic temperature drift in the sensor. We observe an average RMSE of the model predicted pH to the true pH to be less than 0.286 pH. The developed RNN models were implemented on Xilinx ZCU104 FPGA development kit using PYNQ framework, which demonstrates low power consumption. The proposed framework establishes the efficacy of Machine Learning (ML) techniques for drift compensation in ISFET-based pH sensors for deployment in IoT applications.

Over the last decades, oil and gas companies have been facing a continuous increase of data collected in unstructured textual format. New disruptive technologies, such as natural language processing and machine learning, present an unprecedented opportunity to extract a wealth of valuable information within these documents. Word embedding models are one of the most fundamental units of natural language processing, enabling machine learning algorithms to achieve great generalization capabilities by providing meaningful representations of words, being able to capture syntactic and semantic features based on their context. However, the oil and gas domain-specific vocabulary represents a challenge to those algorithms, in which words may assume a completely different meaning from a common understanding. The Brazilian pre-salt is an important exploratory frontier for the oil and gas industry, with increasing attractiveness for international investments in exploration and production projects, and most of its documentation is in Portuguese. Moreover, Portuguese is one of the largest languages in terms of number of native speakers. Nonetheless, despite the importance of the petroleum sector of Portuguese speaking countries, specialized public corpora in this domain are scarce. This work proposes PetroVec, a representative set of word embedding models for the specific domain of oil and gas in Portuguese. We gathered an extensive collection of domain-related documents from leading institutions to build a large specialized oil and gas corpus in Portuguese, comprising more than 85 million tokens. To provide an intrinsic evaluation, assessing how well the models can encode domain semantics from the text, we created a semantic relatedness test set, comprising 1,500 word pairs labeled by selected experts in geoscience and petroleum engineering from both academia and industry. In addition, we performed an extrinsic quantitative evaluation on a downstream task of named entity recognition in geoscience, plus a set of qualitative analyses, and conducted a comparative evaluation against a public general-domain embedding model. The obtained results suggest that our domain-specific models outperformed the general model on their ability to represent specialized terminology. To the best of our knowledge, this is the first attempt to generate and evaluate word embedding models for the oil and gas domain in Portuguese. Finally, all the resources developed by this work are made available for public use, including the pre-trained specialized models, corpora, and validation datasets.

Enable Machine Learning Research Articles

Related Topics

Articles published on Enable Machine Learning

Network-Aware Optimization of Distributed Learning for Fog Computing

Enabling Machine Learning with Service Function Chaining for Security Enhancement at 5G Edges

Towards Development of an ISFET-Based Smart pH Sensor: Enabling Machine Learning for Drift Compensation in IoT Applications

Bag-of-Words Technique in Natural Language Processing: A Primer for Radiologists.

Binary classification of single qubits using quantum machine learning method

High Performance DAQ Infrastructure to Enable Machine Learning for the Advanced Photon Source Upgrade

On the distributed software architecture of a data analysis workflow: A case study

Abstract 2177: Tumor neoantigen profiling with validated patient-specific TCR characterization to improve neoepitope prediction

SDN-Enabled FiWi-IoT Smart Environment Network Traffic Classification Using Supervised ML Models

Using Knowledge Graphs to Plausibly Infer Missing Associations in EMR Data.

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

AIEMLA: artificial intelligence enabled machine learning approach for routing attacks on internet of things

A Highly Scalable, Modular Test Bench Architecture for Large-Scale DC Power Cycling of SiC MOSFETs: Towards Data Enabled Reliability

A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.

In-Network Flow Classification With Knowledge Distillation

An Artificial Intelligence-based Intrusion Detection System

An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture.

Portuguese word embeddings for the oil and gas industry: Development and evaluation

Neural Network Vessel Lumen Regression for Automated Lumen Cross-Section Segmentation in Cardiovascular Image-Based Modeling.

Robust Coreset Construction for Distributed Machine Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Enable Machine Learning Research Articles

Related Topics

Articles published on Enable Machine Learning

Network-Aware Optimization of Distributed Learning for Fog Computing

Enabling Machine Learning with Service Function Chaining for Security Enhancement at 5G Edges

Towards Development of an ISFET-Based Smart pH Sensor: Enabling Machine Learning for Drift Compensation in IoT Applications

Bag-of-Words Technique in Natural Language Processing: A Primer for Radiologists.

Binary classification of single qubits using quantum machine learning method

High Performance DAQ Infrastructure to Enable Machine Learning for the Advanced Photon Source Upgrade

On the distributed software architecture of a data analysis workflow: A case study

Abstract 2177: Tumor neoantigen profiling with validated patient-specific TCR characterization to improve neoepitope prediction

SDN-Enabled FiWi-IoT Smart Environment Network Traffic Classification Using Supervised ML Models

Using Knowledge Graphs to Plausibly Infer Missing Associations in EMR Data.

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

AIEMLA: artificial intelligence enabled machine learning approach for routing attacks on internet of things

A Highly Scalable, Modular Test Bench Architecture for Large-Scale DC Power Cycling of SiC MOSFETs: Towards Data Enabled Reliability

A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.

In-Network Flow Classification With Knowledge Distillation

An Artificial Intelligence-based Intrusion Detection System

An embedded system for the automated generation of labeled plant images to enable machine learning applications in agriculture.

Portuguese word embeddings for the oil and gas industry: Development and evaluation

Neural Network Vessel Lumen Regression for Automated Lumen Cross-Section Segmentation in Cardiovascular Image-Based Modeling.

Robust Coreset Construction for Distributed Machine Learning