Linking genomics and population genetics with R.

Details

Serval ID
serval:BIB_90179BE9C8A4
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Linking genomics and population genetics with R.
Journal
Molecular ecology resources
Author(s)
Paradis E., Gosselin T., Goudet J., Jombart T., Schliep K.
ISSN
1755-0998 (Electronic)
ISSN-L
1755-098X
Publication state
Published
Issued date
04/2017
Volume
17
Number
1
Pages
54-66
Language
english
Notes
Publication types: Journal Article
Publication Status: ppublish
Abstract
Population genetics and genomics have developed and been treated as independent fields of study despite having common roots. The continuous progress of sequencing technologies is contributing to (re-)connect these two disciplines. We review the challenges faced by data analysts and software developers when handling very big genetic data sets collected on many individuals. We then expose how r, as a computing language and development environment, proposes some solutions to meet these challenges. We focus on some specific issues that are often encountered in practice: handling and analysing single-nucleotide polymorphism data, handling and reading variant call format files, analysing haplotypes and linkage disequilibrium and performing multivariate analyses. We illustrate these implementations with some analyses of three recently published data sets that contain between 60 000 and 1 000 000 loci. We conclude with some perspectives on future developments of r software for population genomics.

Keywords
Biostatistics/methods, Computational Biology/methods, Genetics, Population/methods, Genomics/methods, Haplotypes, Linkage Disequilibrium, Polymorphism, Single Nucleotide, Software
Pubmed
Open Access
Yes
Create date
12/01/2017 16:27
Last modification date
20/08/2019 15:53
Usage data