CoreGenesUniqueGenes:CGUG, and enhancement of "CoreGenes3.0"

CoreGenes has been updated to allow the analysis of genes or proteins in common from sets of up to five small bacterial genomes, in addition to the viral, chloroplast and mitochondrial genomes.

This version (CGUG) includes features that allow the in silico parsing of the proteome, particularly analzying for unique proteins found in pairs of genomes, as well as providing a count of proteins in common amongst five genomes. These were developed at the request of Dr. Andrew Kropinski (Public Health Agency of Canada, Queen's University, University of Guelph). It has been validated and applied in a collaboration for the re-examination and re-classification of bacteriophages.

In brief, the new in silico proteome analysis features are as follows:

• Tallies the total number of genes in common between genomes compared.

• Displays the percent value of genes in common with a specific genome.

• Determines unique genes contained in a pair of proteomes.

• (Caveat: Currently only supports pairs of genomes to analyze for this function)

• (Multiple genome comparison for unique genes will be available in the future)

• (Applies to genomes with sizes up to ~ 2.0 Megabase)

• Note: If you provide an email address, the window can be closed and the results will be emailed to you.

NCBI Accession number for Reference genome :
NCBI Accession number for genome1:
NCBI Accession number for genome2:
NCBI Accession number for genome3:
NCBI Accession number for genome4:
Please enter blastp threshold score in the box below
For Batch Analysis, please enter your email address in the box below
Email Address:

Click here to compare genomes that are not archived at NCBI

1. Mahadevan, P., King, J.F. and Seto, D. (2009). CGUG: in silico proteome and genome parsing tool for the determination of "core" and unique genes in the analysis of genomes up to ca. 1.9 Mb. BMC Research Methods 2:168.
2. Mahadevan, P., King, J.F. and Seto, D. (2009). Data mining pathogen genomes using GeneOrder, CoreGenes and CGUG: gene order, synteny, and in silico proteomes. International Journal of Computational Biology and Drug Design 2:100-114.
3. Zafar, N., Mazumder, R. and Seto, D. (2002). CoreGenes: A computational tool for identifying and cataloging "core" genes in a set of small genomes. BMC Bioinformatics 3:12.
E-mail for help