Sampling Paradigm Research Articles

Negative sampling plays a crucial role in implicit-feedback-based collaborative filtering, where it leverages massive unlabeled data to generate negative signals for guiding supervised learning. The current state-of-the-art approaches focus on utilizing hard negative samples that contain more information to establish a better decision boundary. To strike a balance between efficiency and effectiveness, most existing methods adopt a two-pass approach: in the first pass, a fixed number of unobserved items are sampled using a simple static distribution, while, in the second pass, a more sophisticated negative sampling strategy is employed to select the final negative items. However, selecting negative samples solely from the original items in a dataset is inherently restricted due to the limited available choices, and thus may not be able to effectively contrast positive samples. In this paper, we empirically validate this observation through meticulously designed experiments and identify three major limitations of existing solutions: ambiguous trap, information discrimination, and false negative samples. Our response to such limitations is to introduce “denoised” and “augmented” negative samples that may not exist in the original dataset. This direction renders a few substantial technical challenges. First, constructing augmented negative samples may introduce excessive noise that eventually distorts the decision boundary. Second, the scarcity of supervision signals hampers the denoising process. To this end, we introduce a novel generic denoising and augmented negative sampling (DANS) paradigm and provide a concrete instantiation. First, we disentangle the hard and easy factors of negative items. Then, we regulate the augmentation of easy factors by carefully considering the direction and magnitude. Next, we propose a reverse attention mechanism to learn a user’s negative preference, which allows us to perform a dimension-level denoising procedure on hard factors. Finally, we design an advanced negative sampling strategy to identify the final negative samples, taking into account both the score function used in existing methods and a novel metric called synthesization gain. Through extensive experiments on real-world datasets, we demonstrate that our method substantially outperforms state-of-the-art baselines. Our code is publicly available at https://github.com/Asa9aoTK/ANS-Recbole.

Read full abstract

AbstractIntroductionSleep is an important behavioral biomarker for patients with serious mental illness (SMI). The ability to accurately quantify sleep in a real-world setting could thus provide insight into patient well-being. In this study, patients in a sleep lab wore a patch that is part of a digital medicine system (aripiprazole with sensor (AS)) designed to provide objective records of medication ingestion. The patch provided accelerometer and electrocardiogram (ECG) data; polysomnography (PSG) data was collected to be used as the gold standard for sleep stage classification. The accelerometer and ECG data were used to build machine learning classification models to distinguish periods of wake from periods of sleep. To optimize these models for a real-world environment, different data sampling paradigms and methodologies were explored, and resultant model performances were analyzed.MethodsData was collected for a total of 220 nights, across 73 unique subjects—42 subjects had a diagnosed SMI (schizophrenia, bipolar disorder I, or major depressive disorder) and 31 subjects were healthy volunteers. PSG data, which provides a sleep stage designation at 30-second intervals, was combined into 5-minute windows, labeled as either “Sleep” or “Wake” based on which class comprised the majority of the 30-second intervals within the window. Accelerometer and ECG features were derived for each 5-minute window. Models were trained with three learning methodologies: a light gradient boosting machine (LGBM), a conditional random field (CRF), and a long short-term memory (LSTM) network. Model performance was tested with the full complement of accelerometer and ECG data, as well as down-sampled subsets of data. Additionally, ECG data from the PSG system was incorporated to test the effect of other ECG sampling strategies.ResultsCRF models produced the best classification performance (AUC = 0.91) with the full patch dataset. Down-sampling to include less than half of the accelerometer data did begin to degrade the specificity of the model. Down-sampling to include less frequent ECG collection did not have a significant effect on model performance; however, changing the sampling paradigm to continuous ECG collection from a block sampling paradigm did lead to more robust classification of when a patient was awake.ConclusionsAccurately recording sleep in a logistically simple way can provide insights into the well-being of SMI patients. Combining these insights with the objective medication ingestion records provided by AS would be of great value to SMI patients, as well as their caregivers and physicians. This research explores what amount of sensor data is required to accurately quantify sleep and some of the machine learning strategies that can ameliorate data limitations, providing guidance for the optimization of digital device design.FundingOtsuka Pharmaceutical Development & Commercialization, Inc.

Read full abstract

Sampling Paradigm Research Articles

Related Topics

Articles published on Sampling Paradigm

Dynamics of second-language learners' semantic memory networks: Evidence from a snowball sampling paradigm.

A deployable film method to enable replicable sampling of low-abundance environmental microbiomes

Denoising and Augmented Negative Sampling for Collaborative Filtering

Neural Network With a Preference Sampling Paradigm for Imbalanced Data Classification.

Integrating Mobile and Fixed-Site Black Carbon Measurements to Bridge Spatiotemporal Gaps in Urban Air Quality.

BosonSampling.jl: A Julia package for quantum multi-photon interferometry

The “bubbles”-study: validation of ultra-short scales for the assessment of positive mental health, life satisfaction, and perceived social support

Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality Assessment

The "bubbles"-study: Validation of ultra-short scales for the assessment of depression, anxiety and stress symptoms.

Force modulation: A behavioural marker of mind-wandering.

Reducing the In band Network Telemetry overhead through the spatial sampling: Theory and experimental results

Eelbrain, a Python toolkit for time-continuous analysis with temporal response functions.

Dynamic multi-objective workflow scheduling for combined resources in cloud

A temporal shift reconstruction network for compressive video sensing

Attention Constraints and Learning in Categories

Sampling by Difference as a Method of Applying the Sampling Kantorovich Model in Digital Image Processing

Optimization of Sleep classification in Patients With Serious Mental Illness Using Accelerometer and ECG Data From a Wearable Patch

Shame on me for needing you: A multistudy examination of links between receiving interpersonal emotion regulation and experiencing shame.

The Effect of Situational Experiment Conditions on Hasty Decision Making in the 'Beads Task'.

Spectral Gap-Based Seismic Survey Design

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sampling Paradigm Research Articles

Related Topics

Articles published on Sampling Paradigm

Dynamics of second-language learners' semantic memory networks: Evidence from a snowball sampling paradigm.

A deployable film method to enable replicable sampling of low-abundance environmental microbiomes

Denoising and Augmented Negative Sampling for Collaborative Filtering

Neural Network With a Preference Sampling Paradigm for Imbalanced Data Classification.

Integrating Mobile and Fixed-Site Black Carbon Measurements to Bridge Spatiotemporal Gaps in Urban Air Quality.

BosonSampling.jl: A Julia package for quantum multi-photon interferometry

The “bubbles”-study: validation of ultra-short scales for the assessment of positive mental health, life satisfaction, and perceived social support

Scaling and Masking: A New Paradigm of Data Sampling for Image and Video Quality Assessment

The "bubbles"-study: Validation of ultra-short scales for the assessment of depression, anxiety and stress symptoms.

Force modulation: A behavioural marker of mind-wandering.

Reducing the In band Network Telemetry overhead through the spatial sampling: Theory and experimental results

Eelbrain, a Python toolkit for time-continuous analysis with temporal response functions.

Dynamic multi-objective workflow scheduling for combined resources in cloud

A temporal shift reconstruction network for compressive video sensing

Attention Constraints and Learning in Categories

Sampling by Difference as a Method of Applying the Sampling Kantorovich Model in Digital Image Processing

Optimization of Sleep classification in Patients With Serious Mental Illness Using Accelerometer and ECG Data From a Wearable Patch

Shame on me for needing you: A multistudy examination of links between receiving interpersonal emotion regulation and experiencing shame.

The Effect of Situational Experiment Conditions on Hasty Decision Making in the 'Beads Task'.

Spectral Gap-Based Seismic Survey Design