Get a Quality Score For Your Genome

We evaluate the quality of your submitted genomes based of various measures. See Publication!
NCBI | Broad | PATRIC | KBase

30K Genomes

We examined the microbial DNA sequences available for complete, draft, and SRA genomes in GenBank as well as three other major public databases and assigned quality scores for more than 30,000 prokaryotic genome sequences.

Scoring Algorithm

Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes.

GenBank, PATRIC, Broad, KBase

We collected genomes from four major sources NCBI GenBank, Broad Institute, Virginia Bioinformatics Institute's PATRIC and DOE's KBase. We examined a total of 32000 genomes including finished and draft prokaryotic genome sequences.

Genome Quality Score Cards

Almost (~88%) of the genomes had quality scores of 0.8 or better (out of 1.0) and can be safely used for standard comparative genomics analysis, although only 6% of the genomes had a perfect quality score.