A word-based model (words are basically contiguous alphanumeric sequences) coded using Huffman coding.