Apache Lucene 4.5.0 发布,Java全文搜索引擎

jopen 10年前

Apache Lucene是一个高性能,全功能的全文搜索引擎库,完全用Java编写。它适用于几乎需要全文搜索功能的任何应用程序,尤其是跨平台的。

Lucene通过一个简单的API,提供了强大的功能:

Scalable, High-Performance Indexing

  • over 150GB/hour on modern hardware
  • small RAM requirements -- only 1MB heap
  • incremental indexing as fast as batch indexing
  • index size roughly 20-30% the size of text indexed

Powerful, Accurate and Efficient Search Algorithms

  • ranked searching -- best results returned first
  • many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more
  • fielded searching (e.g. title, author, contents)
  • sorting by any field
  • multiple-index searching with merged results
  • allows simultaneous update and searching
  • flexible faceting, highlighting, joins and result grouping
  • fast, memory-efficient and typo-tolerant suggesters
  • pluggable ranking models, including the Vector Space Model and Okapi BM25
  • configurable storage engine (codecs)

Cross-Platform Solution

Apache Lucene 4.5.0 发布,Java全文搜索引擎
版本说明:基于磁盘的DOC值和改进的过滤器缓存。