Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Evaluation of the lasso and the elastic net in genome-wide association studies
Linköping University, Department of Computer and Information Science, Statistics. Linköping University, The Institute of Technology. University of Natural Resources and Life Sciences, Vienna, Austria .
University of Natural Resources and Life Sciences, Vienna, Austria.
Qualitas AG, Zug, Switzerland.
ZuchtData EDV-Dienstleistungen GmbH, Vienna, Austria .
Show others and affiliations
2013 (English)In: Frontiers in Genetics, ISSN 1664-8021, E-ISSN 1664-8021, Vol. 4, no 270Article in journal (Refereed) Published
Abstract [en]

The number of publications performing genome-wide association studies (GWAS) has increased dramatically. Penalized regression approaches have been developed to overcome the challenges caused by the high dimensional data, but these methods are relatively new in the GWAS field. In this study we have compared the statistical performance of two methods (the least absolute shrinkage and selection operator—lasso and the elastic net) on two simulated data sets and one real data set from a 50 K genome-wide single nucleotide polymorphism (SNP) panel of 5570 Fleckvieh bulls. The first simulated data set displays moderate to high linkage disequilibrium between SNPs, whereas the second simulated data set from the QTLMAS 2010 workshop is biologically more complex. We used cross-validation to find the optimal value of regularization parameter λ with both minimum MSE and minimum MSE + 1SE of minimum MSE. The optimal λ values were used for variable selection. Based on the first simulated data, we found that the minMSE in general picked up too many SNPs. At minMSE + 1SE, the lasso didn't acquire any false positives, but selected too few correct SNPs. The elastic net provided the best compromise between few false positives and many correct selections when the penalty weight α was around 0.1. However, in our simulation setting, this α value didn't result in the lowest minMSE + 1SE. The number of selected SNPs from the QTLMAS 2010 data was after correction for population structure 82 and 161 for the lasso and the elastic net, respectively. In the Fleckvieh data set after population structure correction lasso and the elastic net identified from 1291 to 1966 important SNPs for milk fat content, with major peaks on chromosomes 5, 14, 15, and 20. Hence, we can conclude that it is important to analyze GWAS data with both the lasso and the elastic net and an alternative tuning criterion to minimum MSE is needed for variable selection.

Place, publisher, year, edition, pages
Frontiers Research Foundation , 2013. Vol. 4, no 270
Keyword [en]
GWAS, cattle, elastic net, lasso, population structure, simulation
National Category
Genetics
Identifiers
URN: urn:nbn:se:liu:diva-105345DOI: 10.3389/fgene.2013.00270PubMedID: 24363662OAI: oai:DiVA.org:liu-105345DiVA: diva2:706038
Available from: 2014-03-18 Created: 2014-03-18 Last updated: 2017-12-05Bibliographically approved

Open Access in DiVA

fulltext(1407 kB)175 downloads
File information
File name FULLTEXT01.pdfFile size 1407 kBChecksum SHA-512
a0ca4338ba127ba1cbcdc1ee201faa5a9a35bc9bcf16595faa0f90e70ae896825421700158f403feffdabef7c55a901e5ddf7011dcdcb41aea66e82866b71f03
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Waldmann, Patrik
By organisation
StatisticsThe Institute of Technology
In the same journal
Frontiers in Genetics
Genetics

Search outside of DiVA

GoogleGoogle Scholar
Total: 175 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 272 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf