红联Linux门户
Linux帮助

全文本搜索引擎 Egothor

发布时间:2012-11-19 15:26:19来源:红联作者:empast
Egothor是一个用Java编写的开源而高效的全文本搜索引擎。借助Java的跨平台特性,Egothor能应用于任何环境的应用,既可配置为单独的搜索引擎,又能用于你的应用作为全文检索之用。

关键特性:

Written in JAVA for cross platform compatibility.
New dynamization algorithm for fast index updating.
Fully 64-bit kernel
Transactions (ACID)
Document revisions, Xdelta
Incremental updates
Queries can be solved in a parallel manner
Able to recognize the most familiar file formats: HTML, PDF, PS, and Microsoft's DOC, and XLS.
High capacity robot which supports robots.txt recommendation.
The best compression methods are used, i.e. Golomb, Elias-Gamma, Block coding.
Based on the extended Boolean model which can operate as the Vector or Boolean models.
Universal stemmer that processes any language.

主页:http://www.egothor.org/

来自:开源中国社区
文章评论

共有 0 条评论