Results

Opening the Result Table

To view the results of an analysis job:

Select a completed analysis job in the Jobs tab.
Under Edit Job:, next to the text Result:, click the View... button.
A browser tab will open with the results displayed as a table. Alternatively, you can download a CSV file containing the corresponding results by clicking the link next to View....

Important Table Contents

The table in the browser contains an entry for each organism that had at least one hit.
A “hit” refers to a base sequence of length 31 that uniquely belongs to the genome of the corresponding organism.

The table includes the following important columns:

Line: Used to sort organisms primarily by relatedness.
Name: The name of the organism.
Rank: The biological rank of the organism. Typically, the ranks genus, species, and strain are required for unambiguous identification of organisms. Higher ranks such as order, class, or phylum typically group too many related species together. An exception are viruses, which are sometimes not (clearly) structured via ranks.
Tax Id: The unique identifier of an organism from the globally used NCBI taxonomy.
k-mers: The number of hits for the corresponding organism. Important: Low values usually do not indicate the presence of the organism in the sample. Low values can instead be caused by errors or artifacts in the DNA sequencing process. For “real hits”, the number of k-mers is typcially at least 2 to 3 orders of magnitude above “false hit values”. Examples of “real hits” are Rickettsia, Borreliella burgdorferi and Borrelia miyamotoi in this result of a tick analysis.
Unique k-mers: The number of distinct hits for the corresponding organism. In contrast to k-mers, repeated counts of the same base sequence as hits are eliminated here. The value should (except for viruses) not be considerably smaller than the value for k-mers. Otherwise, the two values are inconsistent, and no conclusions about the presence of the organism in the sample can be drawn.
U. k-mers / Exp.: A measure assessing the consistency of the values Unique k-mers and k-mers. The measure usually resides between 0 and 1. A result close to 1 indicates high consistency. A result far below 1 indicates low consistency — this also applies to viruses.
Max C. Length: The length of the longest base sequence (a “contig”) that can be uniquely assigned to the organism. The value is at least 31, as only hits of this length or longer are considered by the analysis. Values significantly greater than 31 further support the presence of the corresponding organism in the sample.

All About the Table Columns

The GeneStrip README file contains a
detailed, technical description of all table columns for the corresponding CSV file.

Genestrip Web Help Pages

Results

Opening the Result Table

Important Table Contents

All About the Table Columns

Continue to “Further Information” …