README: This file beta.tex: A discussion of the beta installation bit-layout, fakelex, taglex, numbers.tex: Discussion and examples of the bit layout used in vectorization. cl.doc: Brief description of the cleaner module matchers: Abbreviated description of matcher rules. A rule compiler generating Pattern-Action rules from such descriptions automatically was planned, but never implemented. mbox: A random mailbox from the days of the first live downloads. mr.doc: Rudimentary documentation for matchers ms.doc: Rudimentary documentation for mark-short professiondoc: how profession tags get defined sb.doc: Brief description of the sentence breaker module sentence-bound: where sentence boundaries are hypothesized tags: A fuller list of the SGML-style tags used in the analysis template: documentation template zipf: selected from a frequency count