3 genpyt - generate the PINYIN lexicon
7 B<genpyt> I<lexicon-file> I<result-file> I<log-file> I<slm-file>
11 B<genpyt> is used to generate the PINYIN lexicon.
12 It only works on zh_CN.UTF-8 locale.
20 Specify a dictionary file. It should be a line-based text file in utf-8 encoding
21 . Each line looks like:
23 CCC id [pinyin'pinyin'pinyin]*
25 A default dictionary file can be found at F</usr/share/sunpinyin/dict.utf8>.
30 The output binary PINYIN lexicon file. This lexicon contains a trie presenting the key tree of PINYIN. And all of the candiate words are sorted using the unigram in I<slm-file>. This file can be used with sunpinyin input method engines.
35 Specify the file to where the log goes. The I<log-file> can be seen as the human-readble presentation of the binary output file.
40 The language model from which the unigram information are retrieved. Typically, the I<slm-file> is generated by B<slmthread>.
46 Originally written by Phill.Zhang E<lt>phill.zhang@sun.comE<gt>.
47 Currently maintained by Kov.Chai E<lt>tchaikov@gmail.comE<gt>.