Lucene的Field选项解释

jayghost

浏览: 443729 次
性别:
来自: 成都

最近访客更多访客>>

liangzai951

南方老牛

wanmbv

casiert123

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Lucene

引用Lucene In Action第二版2.4节内容：

域索引选项：

The options for indexing (Field.Index.* ) control how the text in the field will be

made searchable via the inverted index. Here are the choices:

Index.ANALYZED—Use the analyzer to break the field’s value into a stream of separate tokens and make each token searchable. This option is useful for normal text fields (body, title, abstract, etc.).
Index.NOT_ANALYZED —Do index the field, but don’t analyze the String value. Instead, treat the Field ’s entire value as a single token and make that token searchable. This option is useful for fiel ds that you’d like to search on but that shouldn’t be broken up, such as URLs, file system paths, dates, personal names, Social Security numbers, and telephone nu mbers. This option is especially useful for enabling “exact match精确匹配” searching. We indexed the id field in listings 2.1 and 2.3 using this option.
Index.ANALYZED_NO_NORMS —A variant of Index.ANALYZED that doesn’t store norms information in the index. Norms record index-time boost information in the index but can be memory consuming when you’re searching. Section 2.5.3 describes norms in detail.
Index.NOT_ANALYZED_NO_NORMS—Just like Index.NOT_ANALYZED , but also doesn’t store norms. This option is frequently used to save index space and memory usage during searching, because single-token fields don’t need the norms information un less they’re boosted.
Index.NO —Don’t make this field’s value available for searching.

域存储选项：

The options for stored fields ( Field.Store.* ) determine whether the field’s exact

value should be stored away so that you can later retrieve it during searching:

Store.YES—Stores the value. When the value is stored, the original String in its entirety is recorded in the index and may be retrieved by an IndexReader. This option is useful for fields that you’d like to use when displaying the search results (such as a URL, title, or database primary key). Try not to store very large fields, if index size is a concern, as stored fields consume space in the index.
Store.NO —Doesn’t store the value. This option is often used along with Index.ANALYZED to index a large text field that doesn’t need to be retrieved in its original form, such as bodies of web pages, or any other type of text document.

分享到：

网摘-java面试笔试题大汇总 | 【转】mahout应用kmeans进行文本聚类2之— ...

2012-05-14 22:55
浏览 1145
评论(0)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene的Field选项解释

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene的Field选项解释

评论

发表评论

相关推荐

【转】将lucene索引转化成mahout输入向量

【转】深入 Lucene 索引机制

【转】lucene文件格式

最近访客更多访客>>