Email updates

Keep up to date with the latest news and content from Microbial Informatics and Experimentation and BioMed Central.

Open Access Open Badges Research

Computational genomics-proteomics and Phylogeny analysis of twenty one mycobacterial genomes (Tuberculosis & non Tuberculosis strains)

Fathiah Zakham1, Othmane Aouane23, David Ussery4, Abdelaziz Benjouad2 and Moulay Mustapha Ennaji1*

Author Affiliations

1 Laboratoire de Virologie et Hygiène & Microbiologie, Faculté des Sciences et Techniques, BP 146, Mohammedia, 20650, Morocco

2 Faculté des Sciences, Université Mohammed V-Agdal, Rabat, Morocco

3 Experimental physik, Universität des Saarlandes, Postfach 151150, 66041, Saarbrücken, Germany

4 Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark

For all author emails, please log on.

Microbial Informatics and Experimentation 2012, 2:7  doi:10.1186/2042-5783-2-7

Published: 28 August 2012



The genus Mycobacterium comprises different species, among them the most contagious and infectious bacteria. The members of the complex Mycobacterium tuberculosis are the most virulent microorganisms that have killed human and other mammals since millennia. Additionally, with the many different mycobacterial sequences available, there is a crucial need for the visualization and the simplification of their data. In this present study, we aim to highlight a comparative genome, proteome and phylogeny analysis between twenty-one mycobacterial (Tuberculosis and non tuberculosis) strains using a set of computational and bioinformatics tools (Pan and Core genome plotting, BLAST matrix and phylogeny analysis).


Considerably the result of pan and core genome Plotting demonstrated that less than 1250 Mycobacterium gene families are conserved across all species, and a total set of about 20,000 gene families within the Mycobacterium pan-genome of twenty one mycobacterial genomes.

Viewing the BLAST matrix a high similarity was found among the species of the complex Mycobacterium tuberculosis and less conservation is found with other slow growing pathogenic mycobacteria.

Phylogeny analysis based on both protein conservation, as well as rRNA clearly resolve known relationships between slow growing mycobacteria.


Mycobacteria include important pathogenic species for human and animals and the Mycobacterium tuberculosis complex is the most cause of death of the humankind. The comparative genome analysis could provide a new insight for better controlling and preventing these diseases.

BLAST matrix; Comparative genome analysis; Evolution; Mycobacterium tuberculosis; Pan- core genome; Phylogeny