A description of the command line interface can be obtained by
executing tautomers with -help all.
prompt> tautomers --help all
will generate the following output:
Complete parameter list
tautomer
-all : Enumerate all (up to level 5) tautomers
-can : Write tautomers in canonical SMILES
-ch3 : Tautomerize alpha methyl/methylene groups
-count : Only count rather than generate the states
-in : Input filename
-kekule : Write Kekule structures
-level : Acceptable pseudo-enegy level of tautomers
-max : Maximum number of tautomers per molecule
-out : Output filename
-param : parameter file of tautomers settings
-paramfile : Parameter filename for output
-prefix : Prefix to use to name output files
-reasonable : Returns a reasonable looking uniqie tautomer
-uniq : Return a single unique tautomer
Command line options are distinguished from real filenames by having a `-' prefix. Options can appear anywhere on the command line, i.e. before, after or in between filenames. When incompatible options are specified the last one given on the command line takes effect.
The first filename given on the command line is taken to be the input file, and the optional second filename is treated as the output file. A minus character may be used in place of the input filename to specify that the input is to be read from standard input, stdin, and in place of the output filename to specify that the output is to be written to standard output, stdout. If only one filename is specified on the command line, the output is written to stdout by default.
-level 5 and -ch3.
-all.
-can.
-all command line option. Unfortunately, the -ch3 flag may cause the calculation time to increase by many orders of magnitude for some molecules.
-count.
-all command
line option increases this value to at least five. By default, the
tautomers program enumerates all tautomers in the lowest non-empty
low energy state; first trying level zero and if no tautomers are
found increasing to one, then two and so on.
-level.
-uniq.
Consider the following input file, guanine.smi, that contains
just the following line.
c1[nH]c2c(=O)[nH]c(nc2n1)N
The following command can be used to enumerate all of the reasonably low energy tautomers of this structure.
prompt> tautomers guanine.smi output.smi
Which should write the following 15 structures to the file output.smi.
c1[nH]c2c(=O)nc([nH]c2n1)N c1[nH]c2c(=O)[nH]c(nc2n1)N c1nc2c(=O)nc([nH]c2[nH]1)N c1nc2c(=O)[nH]c(nc2[nH]1)N c1[nH]c2c(=O)[nH]c(=N)[nH]c2n1 c1nc2c(=O)[nH]c(=N)[nH]c2[nH]1 c1[nH]c2c(nc(nc2n1)N)O c1nc-2c(nc([nH]c2n1)N)O c1nc2c(nc(nc2[nH]1)N)O c1nc-2c([nH]c(nc2n1)N)O c1[nH]c2c(nc(=N)[nH]c2n1)O c1[nH]c2c([nH]c(=N)nc2n1)O c1nc2c(nc(=N)[nH]c2[nH]1)O c1nc-2c([nH]c(=N)[nH]c2n1)O c1nc2c([nH]c(=N)nc2[nH]1)O
A more exhaustive enumeration of the tautomers of guanine can be performed
using the -all (or -a) option.
prompt> tautomers -a guanine.smi output.smi
which will write a total of 300 guanine tautomers to the file
output.smi. This can be verified using the -count command
line option, sending the output to the screen.
prompt> tautomers -count -all guanine.smi 300
Finally, the canonicalization abilities of the tautomers program can be
assessed by canonicalizing all 300 of the guanine tautomers generated
above. The example below uses the UNIX command line utility ``uniq'' which
counts the number of repeated lines in a file. By piping the output of
tautomers with the -u flag to UNIX command uniq, the
output below confirms we generate 300 identical copies of the SMILES
string ``c1[nH]c2c(=O)nc([nH]c2n1)N''.
prompt> tautomers -u output.smi | /usr/bin/uniq -c 300 c1[nH]c2c(=O)nc([nH]c2n1)N