1.1 Input Name Representation

The oeiupac library currently processes NUL (zero) terminated ASCII character strings, therefore greek characters, symbols, fonts and superscripts must be transliterated into the printable subset of ASCII.

When parsing compound names, the oeiupac library considers both spaces and tab characters as interchangeable, and any number of consecutive `whitespace' characters are treated as a single space.

Currently, the name parsing is case insensitive, allowing arbitrary mixing of upper and lower case characters, e.g. initial letter capitalization.

Greek characters are understood in a number of different representations. The strings `$a', `${a}', `alpha', `.alpha.', `α', `α' and `α' are all understood to represent the greek character alpha, $\alpha$.

There is no special representation for italic characters. Compound names such as `tert-butyl' and `p-aminobenzamidine' are represented as `tert-butyl' and `p-aminobenzamidine'. Both the long and short forms of prefixes can be used, allowing the above examples to also be written as `t-butyl' and `para-aminobenzamidine'.