Publication details

Journal Article

General framework for binary classification on top samples

Adam L., Mácha V., Šmídl Václav, Pevný T.

: Optimization Methods & Software vol.37, 5 (2022), p. 1636-1667

: GA18-21409S, GA ČR

: general framework, classification, ranking, accuracy at the top, Neyman–Pearson, Pat&Mat

: 10.1080/10556788.2021.1965601

: http://library.utia.cas.cz/separaty/2022/AS/smidl-0551866.pdf

: https://www.tandfonline.com/doi/full/10.1080/10556788.2021.1965601

(eng): Many binary classification problems minimize misclassification above (or below) a threshold. We show that instances of ranking problems, accuracy at the top, or hypothesis testing may be written in this form. We propose a general framework to handle these classes of problems and show which formulations (both known and newly proposed) fall into this framework. We provide a theoretical analysis of this framework and mention selected possible pitfalls the formulations may encounter. We show the convergence of the stochastic gradient descent for selected formulations even though the gradient estimate is inherently biased. We suggest several numerical improvements, including the implicit derivative and stochastic gradient descent. We provide an extensive numerical study.

: BC

: 10102