Journal article

+ 1 other files

Estimating and accounting for genotyping errors in RAD-seq experiments

  • Bresadola, Luisa Department of Biology University of Fribourg Fribourg Switzerland
  • Link, Vivian Department of Biology University of Fribourg Fribourg Switzerland - Swiss Institute of Bioinformatics Fribourg Switzerland
  • Buerkle, C. Alex Department of Botany University of Wyoming Laramie WY USA
  • Lexer, Christian Department of Botany and Biodiversity Research University of Vienna Vienna Austria
  • Wegmann, Daniel Department of Biology University of Fribourg Fribourg Switzerland - Swiss Institute of Bioinformatics Fribourg Switzerland
Show more…
Published in:
  • Molecular Ecology Resources. - 2020, vol. 20, no. 4, p. 856–870
English In non-model organisms, evolutionary questions are frequently addressed using reduced representation sequencing techniques due to their low cost, ease of use, and because they do not require genomic resources such as a reference genome. However, evidence is accumulating that such techniques may be affected by specific biases, questioning the accuracy of obtained genotypes, and as a consequence, their usefulness in evolutionary studies. Here, we introduce three strategies to estimate genotyping error rates from such data: through the comparison to high quality genotypes obtained with a different technique, from individual replicates, or from a population sample when assuming Hardy-Weinberg equilibrium. Applying these strategies to data obtained with Restriction site Associated DNA sequencing (RAD- seq), arguably the most popular reduced representation sequencing technique, revealed per-allele genotyping error rates that were much higher than sequencing error rates, particularly at heterozygous sites that were wrongly inferred as homozygous. As we exemplify through the inference of genome-wide and local ancestry of well characterized hybrids of two Eurasian poplar (Populus) species, such high error rates may lead to wrong biological conclusions. By properly accounting for these error rates in downstream analyses, either by incorporating genotyping errors directly or by recalibrating genotype likelihoods, we were nevertheless able to use the RAD-seq data to support biologically meaningful and robust inferences of ancestry among Populus hybrids. Based on these findings, we strongly recommend carefully assessing genotyping error rates in reduced representation sequencing experiments, and to properly account for these in downstream analyses, for instance using the tools presented here.
Faculté des sciences et de médecine
Département de Biologie
  • English
Biological sciences
License undefined
Persistent URL
Other files


Document views: 38 File downloads:
  • weg_eag.pdf: 184
  • weg_eag_sm.pdf: 41