Solr DocValues

ljhupahu

浏览: 34346 次
性别:
来自: 重庆

最近访客更多访客>>

enock

picksun

飞往罗布泊

gzb001

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Solr

当Sorting, faceting时，通过设置DocValues来保存记录域值的方式是非常高效的。

Solr通过倒排建立索引，首先建立term list, 然后每个term对应一个document list。这样

因为传统的Solr建立Index是通过倒排索引建立的Index，首先建立term list, 然后每个term对应一个document list, 这样的结构使得查询能够非常快速，因为terms在内存中有已经准备好的term-to-documentList.

对于其它的我们的搜索，如sorting, faceting, 和highlighting, 倒排索引并不是非常有效。比如对faceting来讲，必须先得到在每一个document中的term组装成结果集，再得到document IDs. 这些所有操作都是在内存中，当数据量大的时候，会使速度减慢，并且会占用宝贵的内存资源。

在Lucene4.0以后，DocValue 以面向列的存储方式被引用。这是以document-to-value mapping 的方式在index时被创建。减轻了fieldCache的内存需求，使faceting, sorting, grouping更加快速。

启动DocValues

只需在field type 中添加docValues="true". 例如： schema.xml

如果没有添加DocValues之前索引已经建好，现在又需要DocValues则需要重新创建。

DocValues只有在特定的field types中起作用，不同的filed type决定不同的Lucene docValue 类型。可用的Solr field 类型有：

StrField and UUIDField.
field is single-valued (i.e., multi-valued is false), Lucene will use the SORTED type.
field is multi-valued, Lucene will use the SORTED_SET type.
以Trie开头的 numeric fields, date fields and EnumField.
field is single-valued (i.e., multi-valued is false), Lucene will use the NUMERIC type.
If the field is multi-valued, Lucene will use the SORTED_SET type.

对于multi-valued DocValues，有两种实现方式作为SORTED_SET储存

返回的值并非是原始输入的顺序，而是经过排序之后进行返回
当有多个相同的值时，只有一个被返回

2. 检索DocValues

被设置成stored="true"的field在搜索时会被返回。然而，useDocValuesAsStored可以控制Field values是否可以被返回，换句话说，如果useDocValuesAsStored="true" ，那么没有被设置成stored="true"的docValues Field同样会被返回。

分享到：

Solr Filed 一般类型属性 | Solr 空或非空查询

2016-09-11 15:33
浏览 1690
评论(1)
分类:开源软件
查看更多

1 楼 summer7611 2017-04-14

你好，cwiki.apache.com上有句需要怎么理解？
Note that returning DocValues along with "regular" stored fields at query time has performance implications that stored fields may not because DocValues are column-oriented and may therefore incur additional cost to retrieve for each returned document.
是不是DocValues和store都是true的时候，会造成性能的消耗？

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Solr DocValues

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Solr DocValues

评论

发表评论

相关推荐

Solr 外部文件

Field Default Properties

Solr Filed 一般类型属性

Solr 空或非空查询

最近访客更多访客>>