Document Type

Article

Original Publication Date

2010

Journal/Book/Conference Title

Annals of Operations Research

Volume

174

Issue

1

First Page

147

Last Page

168

DOI

10.1007/s10479-008-0424-0

Comments

This is the peer reviewed version. The final publication is available at Springer via http://dx.doi.org/10.1007/s10479-008-0424-0

Date of Submission

June 2014

Abstract

Classification is concerned with the development of rules for the allocation of observations to groups, and is a fundamental problem in machine learning. Much of previous work on classification models investigates two-group discrimination. Multi-category classification is less-often considered due to the tendency of generalizations of two-group models to produce misclassification rates that are higher than desirable. Indeed, producing “good” two-group classification rules is a challenging task for some applications, and producing good multi-category rules is generally more difficult. Additionally, even when the “optimal” classification rule is known, inter-group misclassification rates may be higher than tolerable for a given classification model. We investigate properties of a mixed-integer programming based multi-category classification model that allows for the pre-specification of limits on inter-group misclassification rates. The mechanism by which the limits are satisfied is the use of a reserved judgment region, an artificial category into which observations are placed whose attributes do not sufficiently indicate membership to any particular group. The method is shown to be a consistent estimator of a classification rule with misclassification limits, and performance on simulated and real-world data is demonstrated.

Rights

Copyright © Springer

Is Part Of

VCU Statistical Sciences and Operations Research Faculty Publications

Share

COinS