Classification of microarray data with penalized logistic regression (English)

Eilers, P.H.C. / Boer, J.M. / Ommen, G.J. van / Houwelingen, H.C. van

In: Microarrays: Optical Technologies and Informatics, 1 ; 187-198 ; 2001

ISSN:

0277-786X

Conference paper / Print

How to get this title?

Local TIB services

TIB document delivery Purchase

Pricing information

Alternative Version

Electronic version available

Export, share and cite

Classification of microarray data needs a firm statistical basis. In principle, logistic regression can provide it, modeling the probability of membership of a class with (transforms of) linear combinations of explanatory variables. However, classical logistic regression does not work for microarrays, because generally there are far more variables than observations. One problem is multicollinearity: estimating equations become singular and have no unique and stable solution. A second problem is over-fitting: a model may fit well to a data set, but perform badly when used to classify new data. We propose penalized likelihood as a solution to both problems. The values of the regression coefficients are constrained in a similar way as in ridge regression. All variables play an equal role, there is no ad-hoc selection of "most relevant" or "most expressed" genes. The dimension of the resulting systems of equations is equal to the number of variables, and is generally too large for most computers, but it can be dramatically reduced with the singular value decomposition of some matrices. The penalty is optimized with AIC (Akaike's Information Criterion), which is essentially a measure of prediction performance. We found that penalized logistic regression performs well on a public data set (the MIT ALL/AML data).

Title:

Classification of microarray data with penalized logistic regression
Contributors:

Eilers, P.H.C. ( author ) / Boer, J.M. ( author ) / Ommen, G.J. van ( author ) / Houwelingen, H.C. van ( author )
Published in:

Microarrays: Optical Technologies and Informatics, 1 ; 187-198

Proceedings of the SPIE - The International Society for Optical Engineering ; 4266 ; 187-198
Publisher:

Publication date:

2001
Size:

12 Seiten, 22 Quellen
ISSN:

0277-786X
Coden:

PSISDG
DOI:

https://doi.org/10.1117/12.427987
Type of media:

Conference paper
Type of material:

Print
Language:

English
Keywords:

Array , Biotechnik , mathematische Gleichung , Genetik , Informationstheorie , Bildklassifikation , Wahrscheinlichkeit , statistisches Verfahren , Statistik , Matrix , Gen , Datenverarbeitung in der Biologie
Source:

Tema Archive

How to get this title?

Local TIB services

TIB document delivery Purchase

Pricing information

Alternative Version

Electronic version available

Quicklinks

Borrowing & Ordering

Quicklinks

Search & discover

Quicklinks

Learning & working

Quicklinks

Publishing & Archiving

Quicklinks

About the TIB

Quicklinks

Research & Development

Classification of microarray data with penalized logistic regression (English)

How to get this title?

Export, share and cite

More details on this result

Similar titles

How to get this title?

Export, share and cite