GenPipes: an open-source framework for distributed and scalable genomic analyses.
Details
Serval ID
serval:BIB_52A465B393FA
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
GenPipes: an open-source framework for distributed and scalable genomic analyses.
Journal
GigaScience
ISSN
2047-217X (Electronic)
ISSN-L
2047-217X
Publication state
Published
Issued date
01/06/2019
Peer-reviewed
Oui
Volume
8
Number
6
Language
english
Notes
Publication types: Journal Article
Publication Status: ppublish
Publication Status: ppublish
Abstract
With the decreasing cost of sequencing and the rapid developments in genomics technologies and protocols, the need for validated bioinformatics software that enables efficient large-scale data processing is growing.
Here we present GenPipes, a flexible Python-based framework that facilitates the development and deployment of multi-step workflows optimized for high-performance computing clusters and the cloud. GenPipes already implements 12 validated and scalable pipelines for various genomics applications, including RNA sequencing, chromatin immunoprecipitation sequencing, DNA sequencing, methylation sequencing, Hi-C, capture Hi-C, metagenomics, and Pacific Biosciences long-read assembly. The software is available under a GPLv3 open source license and is continuously updated to follow recent advances in genomics and bioinformatics. The framework has already been configured on several servers, and a Docker image is also available to facilitate additional installations.
GenPipes offers genomics researchers a simple method to analyze different types of data, customizable to their needs and resources, as well as the flexibility to create their own workflows.
Here we present GenPipes, a flexible Python-based framework that facilitates the development and deployment of multi-step workflows optimized for high-performance computing clusters and the cloud. GenPipes already implements 12 validated and scalable pipelines for various genomics applications, including RNA sequencing, chromatin immunoprecipitation sequencing, DNA sequencing, methylation sequencing, Hi-C, capture Hi-C, metagenomics, and Pacific Biosciences long-read assembly. The software is available under a GPLv3 open source license and is continuously updated to follow recent advances in genomics and bioinformatics. The framework has already been configured on several servers, and a Docker image is also available to facilitate additional installations.
GenPipes offers genomics researchers a simple method to analyze different types of data, customizable to their needs and resources, as well as the flexibility to create their own workflows.
Keywords
bioinformatics, frameworks, genomics, pipeline, workflow, workflow management systems
Pubmed
Open Access
Yes
Create date
24/06/2019 7:11
Last modification date
30/04/2021 6:10