Results
Opening the Result Table
To view the results of an analysis job:
- Select a completed analysis job in the
Jobs
tab. - Under
Edit Job:
, next to the textResult:
, click theView...
button. - A browser tab will open with the results displayed as a table. Alternatively, you can download a CSV file containing the corresponding results by clicking the link next to
View...
.
Important Table Contents
The table in the browser contains an entry for each organism that had at least one hit.
A “hit” refers to a base sequence of length 31 that uniquely belongs to the genome of the corresponding organism.
The table includes the following important columns:
Line
: Used to sort organisms primarily by relatedness.Name
: The name of the organism.Rank
: The biological rank of the organism. Typically, the ranksgenus
,species
, andstrain
are required for unambiguous identification of organisms. Higher ranks such asorder
,class
, orphylum
typically group too many related species together. An exception are viruses, which are sometimes not (clearly) structured via ranks.Tax Id
: The unique identifier of an organism from the globally used NCBI taxonomy.k-mers
: The number of hits for the corresponding organism. Important: Low values usually do not indicate the presence of the organism in the sample. Low values can instead be caused by errors or artifacts in the DNA sequencing process. For “real hits”, the number ofk-mers
is typcially at least 2 to 3 orders of magnitude above “false hit values”. Examples of “real hits” areRickettsia
,Borreliella burgdorferi
andBorrelia miyamotoi
in this result of a tick analysis.Unique k-mers
: The number of distinct hits for the corresponding organism. In contrast tok-mers
, repeated counts of the same base sequence as hits are eliminated here. The value should (except for viruses) not be considerably smaller than the value fork-mers
. Otherwise, the two values are inconsistent, and no conclusions about the presence of the organism in the sample can be drawn.U. k-mers / Exp.
: A measure assessing the consistency of the valuesUnique k-mers
andk-mers
. The measure usually resides between 0 and 1. A result close to 1 indicates high consistency. A result far below 1 indicates low consistency — this also applies to viruses.Max C. Length
: The length of the longest base sequence (a “contig”) that can be uniquely assigned to the organism. The value is at least 31, as only hits of this length or longer are considered by the analysis. Values significantly greater than 31 further support the presence of the corresponding organism in the sample.
All About the Table Columns
The GeneStrip README file contains a
detailed, technical description of all table columns for the corresponding CSV file.