SCM

Forum: textsearch_ja 8.4.1 and textsearch_senna 8.4.0 released

Posted by: Takahiro Itagaki
Date: 2009-04-28 22:00
Summary:textsearch_ja 8.4.1 and textsearch_senna 8.4.0 released
Project:textsearch-ja

== textsearch_ja 8.4.1 ==
http://pgfoundry.org/frs/shownotes.php?release_id=1359

textsearch_ja is a full text search parser for Japanese text
using mecab library. It supports 8.3 and 8.4.

In 8.4, it can create more compact indexes compared with
earlier versions because verbs are normalized in basic form.

Also, utility functions hiragana(), katakana() and furigana()
are added. hiragana() converts all katakana characters to
hiragana, and katakana() does the reverse. furigana() converts
all kanji and hiragana characters to katakana.

8.4.1 is a bugfix release:
* Add encoding check between database and mecab
* Fix normalization routine of handakuten.
* Fix hiragana + (dakuten or handakuten) conversions.

== textsearch_senna 8.4.0 ==
http://pgfoundry.org/frs/shownotes.php?release_id=1360

textsearch_senna is a N-gram based full-text search index
using senna library. It supports 8.2, 8.3 and 8.4.

If you want a character-based search, use textsearch_senna
instead of textsearch_ja, that does word-based search.
N-gram based search could return results more similar to LIKE.

Sponsors Ads

Latest News

textsearch_senna 9.0.1 supports 64bit Windows

Takahiro Itagaki - 2011-05-02 21:15 -

textsearch_ja 8.4.1 and textsearch_senna 8.4.0 released

Takahiro Itagaki - 2009-04-28 22:00 -

textsearch-ja 8.3.2 and 8.4beta1 released

Takahiro Itagaki - 2008-12-15 01:31 -

textsearch-ja 8.3.0 released

Takahiro Itagaki - 2008-02-22 16:18 -
Monitor Forum | Start New Thread Start New Thread
Powered By FusionForge