Comparison between SVM and Logistic Regression: Which One is Better to Discriminate?

The classification of individuals is a common problem in applied statistics. If X is a data set corresponding to a sample from an specific population in which observations belong to g different categories, the goal of classification methods is to determine to which of them a new observation will bel...

Descrizione completa

Dettagli Bibliografici
Autori principali: DIEGO ALEJANDRO SALAZAR, JORGE IVÁN VÉLEZ, JUAN CARLOS SALAZAR
Natura: Articolo
Lingua:English
Pubblicazione: Universidad Nacional de Colombia 2012-06-01
Serie:Revista Colombiana de Estadística
Soggetti:
Accesso online:http://www.scielo.org.co/scielo.php?script=sci_arttext&pid=S0120-17512012000200003&lng=en&tlng=en
Descrizione
Riassunto:The classification of individuals is a common problem in applied statistics. If X is a data set corresponding to a sample from an specific population in which observations belong to g different categories, the goal of classification methods is to determine to which of them a new observation will belong to. When g=2, logistic regression (LR) is one of the most widely used classification methods. More recently, Support Vector Machines (SVM) has become an important alternative. In this paper, the fundamentals of LR and SVM are described, and the question of which one is better to discriminate is addressed using statistical simulation. An application with real data from a microarray experiment is presented as illustration.
ISSN:0120-1751