知識蒸留した代理モデルの対照学習によるモデル性能評価

河野 慎; 河村 和紀

doi:10.11517/pjsai.JSAI2024.0_1B4GS201

Abstract

Real-world machine learning system operation suffers from performance degradation due to data distribution shift, which occurs during operation and leads to lower accuracy compared to model validation. Detecting this performance degradation enables appropriate measures such as model retraining or structural revision. However, continuous labeling of operational data is not realistic due to the high cost. Therefore, this study focuses on estimating the performance of a model on unlabeled test data. Since direct calculation of accuracy on test data is impossible without labels, previous studies have attempted to estimate test accuracy using distances or metrics correlated with it. One such study utilizes adversarial accuracy, but it requires simultaneous adversarial training with the model to be evaluated, rendering it inapplicable to pre-trained models. To address this, we propose CoLDS, a method that estimates the test performance of any model without labels by converting the model to be evaluated into a surrogate model using knowledge distillation and performing adversarial training on the surrogate model. This paper evaluates the effectiveness of CoLDS through experiments and reports the results.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!