Forum: textsearch_ja 8.4.1 and textsearch_senna 8.4.0 released
Posted by: Takahiro Itagaki http://pgfoundry.org/frs/shownotes.php?release_id=1359 textsearch_ja is a full text search parser for Japanese text using mecab library. It supports 8.3 and 8.4. In 8.4, it can create more compact indexes compared with earlier versions because verbs are normalized in basic form. Also, utility functions hiragana(), katakana() and furigana() are added. hiragana() converts all katakana characters to hiragana, and katakana() does the reverse. furigana() converts all kanji and hiragana characters to katakana. 8.4.1 is a bugfix release: * Add encoding check between database and mecab * Fix normalization routine of handakuten. * Fix hiragana + (dakuten or handakuten) conversions. == textsearch_senna 8.4.0 == http://pgfoundry.org/frs/shownotes.php?release_id=1360 textsearch_senna is a N-gram based full-text search index using senna library. It supports 8.2, 8.3 and 8.4. If you want a character-based search, use textsearch_senna instead of textsearch_ja, that does word-based search. N-gram based search could return results more similar to LIKE.
|
Latest Newstextsearch_senna 9.0.1 supports 64bit WindowsTakahiro Itagaki - 2011-05-02 21:15 -
0 Comment Read More/Comment
textsearch_ja 8.4.1 and textsearch_senna 8.4.0 releasedTakahiro Itagaki - 2009-04-28 22:00 -
0 Comment Read More/Comment
textsearch-ja 8.3.2 and 8.4beta1 releasedTakahiro Itagaki - 2008-12-15 01:31 -
0 Comment Read More/Comment
|
Monitor Forum | 
