A.1 Filter 2.0

January 2007 2.0.1

This release is a broad-sweeping bug fix release with a dramatically improved introduction in the documentation. It includes many simple and complex bug fixes including; conceptual clarifications of several algorithms, fixes and improvements to the data representation, several advances in the pKa model, and significantly more documentation. While this is only a bug-fix release, it represents a significant step-forward in the refinement of the product. We hope you find it suits your needs.

Bug Fixes and Refinements:

  1. Added three additional filters that are designed around the best-selling prescription drugs

  2. Fixed two bugs in the algorithm for recognizing hydrogen-bond donors and acceptors

  3. Abbott Bioavailability Score corrected to be reported as a probability

  4. Aggregator function augmented to include more than 250 additional known aggregators

  5. Predicted aggregator QSAR model introduced to further enhance aggregator recognition capability

  6. XLogP and LogS corrected to always examine the hypervalent form of molecules (as are used in the training set)

  7. Improved data reporting and filter control for the pharmacokinetic properties

  8. Fixed category reporting for solubility calculation

  9. Changed the default TPSA calculation to ignore rather than include P and S surface area

  10. Renamed and clarified aliphatic_group filter to ``maximum connected non-ring atoms''

  11. Fixed typos in names beta_carbonyl_quat_nitrogen and t_butoxycarbonyl_tBOC

  12. Updated chloramidine filter name to the more common imidoyl_chloride

  13. Fixed error in beta_azo_carbonyl pattern

  14. Made patterns for cytochalasin, monensin, squalestatin and saponin more specific

  15. Clarified the exclusion patterns for acyclic_NCN and SCN2

  16. Dramatically improved the specificity and range of the michael_acceptor pattern

  17. Implemented an algorithm for ``total functional group count'' that takes account of all functional groups rather than only those included in the filter patterns

  18. Fixed error in alkyl_halide pattern that limited it to bromine and iodine

  19. Fixed multiple patterns that matched their function group multiple times because of symmetry

  20. Limited enamine definition to exclude exceedingly delocalized instances

  21. Added -tableFlag parameter to mark all failure values in the table file with an asterix

  22. Fix t_butyl_ether pattern to avoid matching t_butyl_alcohols

  23. Removed dependence of rigid bond count upon implicit or explicit hydrogens

  24. Extended the unique flag to allow values of ``true'', ``false'' or a filename used to pre-load a file of uniques

  25. Cleaned-up table data for the select parameter

  26. Corrected problem that MIN_HALIDE_FRACTION and ALLOWED_ELEMENTS could not properly be removed from the filter file

  27. Fixed bug so NEWRULE lines with tailing white space are not ignored

  28. Fixed crash bug that occurred when a filter file was specified that was not a valid file

  29. Added pKa for phenol groups with significant electron withdrawing groups

  30. Improved pKa value for barbituate-like compounds

  31. Clarified alpha-acyl carbon deprotonation

  32. Removed 1,2,3 triazole pKa rule

  33. Clarified tertiary amine rule

  34. Improved efficiency for applying pKa rules to multi-conformer molecules

October 2005 2.0.0

This is the first official "2.0" version of filter. It includes many bug-fixes and feature expansions based on the long beta period. Some of these include higher level filtering flags such as Lipinski and other similar measures. This release also includes a pKa model that was missed in the beta release as well as non-pH normalization capabilities. This version is makes exporting filtering data to other programs facile with both table and sdtag data export.

New Features:

  1. Neutral pKa model optionally applied to all molecules

  2. Support for molecular graph normalization

  3. Optional salt file can be used to remove salts from database

  4. Calculated properties can be attached to molecules as SD tag data

  5. All filtering data can be saved in tab-separated value format

  6. Filter on fractional halide weight

  7. Removal of known aggregator molecules

  8. Veber bioavailability filtering

  9. Lipinski violation filtering

  10. Abbott/Martin bioavailability filtering

  11. Pharmacopia bioavailability filtering

  12. Solubility class filtering

  13. Chiral center count filtering

February 2005 2.0b1

This is the first version of Filter based on OEChem. As such, it does not alter the graph of your molecules. This is a departure from Filter 1.x, that was based on OELib and applied a pH model to the molecules in an attempt to fix them in protonation state for pH=7.0.

This version of filter has also eliminated the need for cumbersome external data files while still allowing user control over the filtering process. Several new functional groups have been added to the filters and more realistic filter values have been used for several functional groups.

High-level filters such as ``total number for functional groups'' and ``carbon to heteroatom ratio'' have been added to prevent unfunctionalized molecules from slipping through the filters.

New Features:

  1. Based on OEChem

  2. Superior Data Integrity

  3. Valence checking (broken molecule filter)

  4. Substructure selection (for selection of filtered reagent lists)

  5. Not dependent on external data files

  6. Improved filters and filter properties

  7. >500 molecules/second with SMILES I/O

  8. Support for additional platforms

  9. Documentation

  10. Improved command-line interface including command-line help