Value Addition Statistics

      

Value Addition Statistics



Accuracy Summary
 % Incorrect% SuspiciousOverall % Correct
January 20080.5%0.8%98.7%
July 20070.1%0.7%99.2%
January 2007 0.3%1.0%98.7%
 
Completeness Summary
 Functional NamesGenetic NamesGO assignmentEC #
January 200899.1%87.2%90.2%94.5%
July 200799.1%89.3%90.5%95.4%
January 2007100%78.3%78.2%55.1%
 
Consistency Summary
 Consistency Measure
January 200876.9%
July 200780.1%
January 200780.2%

Accuracy

JCVI will conduct a sampling of Pathema genes to test for accuracy. This will be done by searching the BRC genes against 100 randomly chosen TIGRFAM equivalog HMM to provide the set of genes that should have the same function. A human curator will inspect a small set of results and make a subjective assessment of the correctness of the functional name assignment. The statistic will be reported as a percentage of "correct" functional name annotations.

Completeness

JCVI will perform an exhaustive search of Pathema genes against TIGRFAM equivalog HMMs to identify sets of genes that have the same function. The TIGRFAMs members that contain functional names, genetic names, GO ids, and EC#'s will serve as the source of datatypes that are expected to appear in the BRC genes. Completeness will be measured by counting the number of functional names, genetic names, GO ids, and EC#'s that have been assigned. Note that this metric does not attempt to assess the correctness of the annotations, only that an annotation is provided. The completeness statistic will be reported as a percentage of possible annotations, based on the metric: (number of actual annotations) / (number of expected annotations).

Consistency

JCVI will perform an exhaustive search of Pathema genes against TIGRFAM equivalog HMMs to identify sets of genes that have the same function. Each set of genes will be expected to have consistent functional names. Consistency will be measured for functional name assignments within Pathema. The functional names from Pathema will only be compared to each other, the names will not be compared against the TIGRFam name. The consistency statistic will be reported as the likelihood of any 2 genes having the same annotated text string.



Contact Us | ©1999-2009 The J. Craig Venter Institute