Journal article

+ 3 other files

DeltaRpkm: an R package for a rapid detection of differential gene presence between related bacterial genomes

  • Akarsu, Hatice Department of Biology, University of Fribourg, Fribourg, Switzerland - Swiss Institute of Bioinformatics, BUGFri group, Fribourg, Switzerland
  • Aguilar-Bultet, Lisandra Swiss Institute of Bioinformatics, BUGFri group, Fribourg, Switzerland - Institute of Veterinary Bacteriology, Vetsuisse Faculty, University of Bern, Switzerland - Graduate School for Cellular and Biomedical Sciences, University of Bern, Switzerland - Currently at the Department of Infectious Diseases and Hospital Epidemiology, University Hospital Basel, Switzerland
  • Falquet, Laurent Department of Biology, University of Fribourg, Fribourg, Switzerland - Swiss Institute of Bioinformatics, BUGFri group, Fribourg, Switzerland
    02.12.2019
Published in:
  • BMC Bioinformatics. - 2019, vol. 20, no. 1, p. 621
English Comparative genomics has seen the development of many software performing the clustering, polymorphism and gene content analysis of genomes at different phylogenetic levels (isolates, species). These tools rely on de novo assembly and/or multiple alignments that can be computationally intensive for large datasets. With a large number of similar genomes in particular, e.g., in surveillance and outbreak detection, assembling each genome can become a redundant and expensive step in the identification of genes potentially involved in a given clinical feature.Results: We have developed deltaRpkm, an R package that performs a rapid differential gene presence evaluation between two large groups of closely related genomes. Starting from a standard gene count table, deltaRpkm computes the RPKM per gene per sample, then the inter-group δRPKM values, the corresponding median δRPKM (m) for each gene and the global standard deviation value of m (sm). Genes with m >  = 2  ∗ sm (standard deviation s of all the m values) are considered as “differentially present” in the reference genome group. Our simple yet effective method of differential RPKM has been successfully applied in a recent study published by our group (N = 225 genomes of Listeria monocytogenes) (Aguilar-Bultet et al. Front Cell Infect Microbiol 8:20, 2018).Conclusions: To our knowledge, deltaRpkm is the first tool to propose a straightforward inter-group differential gene presence analysis with large datasets of related genomes, including non-coding genes, and to output directly a list of genes potentially involved in a phenotype.
Faculty
Faculté des sciences et de médecine
Department
Département de Biologie
Language
  • English
Classification
Biological sciences
License
License undefined
Identifiers
Persistent URL
https://folia.unifr.ch/unifr/documents/308329
Other files

Statistics

Document views: 47 File downloads:
  • fal_drp.pdf: 86
  • fal_drp_sm1.pdf: 58
  • fal_drp_sm2.txt: 34
  • fal_drp_sm3.pdf: 74