Algorithm-Dependent Generalization of AUPRC Optimization: Theory and Algorithm.

Peisong Wen,Qianqian Xu,Zhiyong Yang,Yuan He,Qingming Huang

doi:10.1109/tpami.2024.3361861

Abstract

Stochastic optimization of the Area Under the Precision-Recall Curve (AUPRC) is a crucial problem for machine learning. Despite extensive studies on AUPRC optimization, generalization is still an open problem. In this work, we present the first trial in the algorithm-dependent generalization of stochastic AUPRC optimization. The obstacles to our destination are three-fold. First, according to the consistency analysis, the majority of existing stochastic estimators are biased with biased sampling strategies. To address this issue, we propose a stochastic estimator with sampling-rate-invariant consistency and reduce the consistency error by estimating the full-batch scores with score memory. Second, standard techniques for algorithm-dependent generalization analysis cannot be directly applied to listwise losses. To fill this gap, we extend the model stability from instance-wise losses to listwise losses. Third, AUPRC optimization involves a compositional optimization problem, which brings complicated computations. In this work, we propose to reduce the computational complexity by matrix spectral decomposition. Based on these techniques, we derive the first algorithm-dependent generalization bound for AUPRC optimization. Motivated by theoretical results, we propose a generalization-induced learning framework, which improves the AUPRC generalization by equivalently increasing the batch size and the number of valid training examples. Practically, experiments on image retrieval and long-tailed classification speak to the effectiveness and soundness of our framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Algorithm-Dependent Generalization of AUPRC Optimization: Theory and Algorithm.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Similar Papers

Pediatric ECG-Based Deep Learning to Predict Left Ventricular Dysfunction and Remodeling.
Akhil Vaid ... William G La Cava
Circulation | VOL. 149
Akhil Vaid, et. al.Akhil Vaid ... William G La Cava
05 Feb 2024
Circulation | VOL. 149

Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques
Jun Li ... Jiatuo Xu
International Journal of Medical Informatics | VOL. 149
Jun Li, et. al.Jun Li ... Jiatuo Xu
22 Feb 2021
International Journal of Medical Informatics | VOL. 149

The performance of VCS(volume, conductivity, light scatter) parameters in distinguishing latent tuberculosis and active tuberculosis by using machine learning algorithm
Lijiao Chen ... Shaoli Deng
BMC Infectious Diseases | VOL. 23
Lijiao Chen, et. al.Lijiao Chen ... Shaoli Deng
16 Dec 2023
BMC Infectious Diseases | VOL. 23

Predicting Postoperative Mortality With Deep Neural Networks and Natural Language Processing: Model Development and Validation.
Pei-Fu Chen ... Kuan-Chih Chen
JMIR Medical Informatics | VOL. 10
Pei-Fu Chen, et. al.Pei-Fu Chen ... Kuan-Chih Chen
10 May 2022
JMIR Medical Informatics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Algorithm-Dependent Generalization of AUPRC Optimization: Theory and Algorithm.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence