2.5 Report File
The ROCS report file format appears as a tab-delimited file with the
following fields. Since the names of the query and the hits are of
indeterminate length, fixed size fields for these names could result
in loss of information. Unfortunately this gives a file that is hard
to read in a terminal session, but it can easily be read into a
spreadsheet program or into the data manager in VIDA.
- Name
- This is the name of the database molecule. If the database contains
multi-conformer molecules, the specific conformer index is appended to the
molecule name with an underscore.
- ShapeQuery
- The is the name of the query molecule. If the query is a
multi-conformer molecule, then the specific conformer index is appended to
the molecule name with an underscore.
- Rank
- The numerical ranking in the hitlist, based on the chosen score to
sort by. If no changes are made, this is based solely on shape Tanimoto. Can
be altered by using
-rankby command line switches. If the -stats
command line switch is used with 'best' or 'all', data is written into the
report file in the order that the search is performed. If no hitlist was used
in the calculation, this field will be 0 (zero).
- ShapeTanimoto
- This column gives the shape Tanimoto, a value between 0
and 1 as calculated by equation (1.6).
- Tversky(d)
- Tversky is calculated using equation (1.7) with
the database molecule as the main self-overlap with beta = 0.95.
- Tversky(q)
- Tversky is calculated using equation (1.7) with
the query molecule as the main self-overlap term with alpha = 0.95.
- ScaledColor
- In order to provide a color score somewhat independent
of the actual color force field definition, a scaled color value is
calculated by taking a hit's actual score value and dividing it by the
color score of the query molecule against itself. Since the query self-color
is a theoretical maximum value, the scaled color score is bounded between
0 and 1 like the Tanimoto coefficient.
- ComboScore
- To provide a score that includes both shape fit and color,
the shape Tanimoto is added to the scaled color score, resulting in the
combo score. This has a value between 0 and 2 and is the score used for
ranking the hitlist when the
-rankby combo command line switch is used.
- ColorScore
- This column provides the actual color score. Since differing
color force field files can use different strengths for the color forces and
since each molecule may have a different number of color atoms, there is
no upper bound on this score. By default, the color score is calculated
by looping over all the color atoms in the query molecule and summing the
single best color interaction with the hit molecule. This leads to scores that
mirror the one-to-one correspondence of features normally seen in pharmacophore
matching programs. To included all possible interactions in the color
score, use the
-allcolor command line switch.
- SubTan
- One additional score can be calculated by giving the
-subtan commandline argument. Since there is an additional
time cost for this calculation, it is not included by default. Subtan
is defined by taking the positions of the query and dbase molecule
at the final overlay and removing all dbase atoms greater than 1.5
Angstroms from any query atom. Then a shape Tanimoto calculation is
performed using these 2 structures and this Tanimoto coefficient is
recorded as SubTan. Note that this has the effect of raising scores for small
queries against much larger dbase molecules. In some respects, this is
similar to Tversky for a sub-shape match, but does result in
different rankings than Tversky. It is recommended that for searches
involving a small query against a dbase of large molecules that both
Tversky and SubTan be considered.
- Overlap
- This is the absolute value of volume overlap between the
query and the dbase molecule. The value is in arbitrary units, and is
most useful when using a grid as query.
- DBConformerIndex
- By default, the actual conformer index of each
hit is appended to the molecule title. If the
-conflabel
sdtag or -conflabel both is specified, the conformer index
will appear in this column.