Lucene相关度排序的调整

caocao

浏览: 276121 次
来自: 上海

最近访客更多访客>>

u012363178

ljjr13

ybbid

stonecold1108

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Java

lucene

如欲转载，请注明作者：caocao，来源http://caocao.iteye.com/。

Lucene的搜索结果默认按相关度排序，这个相关度排序是基于内部的Score和DocID，Score又基于关键词的内部评分和做索引时的boost。默认Score高的排前面，如果Score一样，再按索引顺序，先索引的排前面。那么有人问了，如果我要先索引的排后面怎么办呢？隐士研究了源码后发现这是相当简单的事情。以下代码基于Lucene 2.0。

看Sort的默认构造函数，相关度就是SortField.FIELD_SCORE和SortField.FIELD_DOC的组合。

java 代码

/**
* Sorts by computed relevance. This is the same sort criteria as calling
* {@link Searcher#search(Query) Searcher#search()}without a sort criteria,
* only with slightly more overhead.
*/
public Sort() {
this(new SortField[] { SortField.FIELD_SCORE, SortField.FIELD_DOC });
}

那么该如何构造我们需要的SortField呢？请看SortField的一个构造函数，有一个参数reverse可供我们调整结果集的顺序。

java 代码

/** Creates a sort, possibly in reverse, by terms in the given field with the
* type of term values explicitly given.
* @param field Name of field to sort by. Can be <code>null</code> if
* <code>type</code> is SCORE or DOC.
* @param type Type of values in the terms.
* @param reverse True if natural order should be reversed.
*/
public SortField (String field, int type, boolean reverse) {
this.field = (field != null) ? field.intern() : field;
this.type = type;
this.reverse = reverse;
}

由此可见，只要构造一个SortField[]就可以实现我们要的功能，请看：

java 代码

// 评分降序，评分一样时后索引的排前面
new SortField[] { SortField.FIELD_SCORE, new SortField(null, SortField.DOC, true) }
// 评分升序，评分一样时后索引的排前面，呵呵，此为最不相关的排前面，挺有趣的
new SortField[] { new SortField(null, SortField.SCORE, true), new SortField(null, SortField.DOC, true) }

呵呵，只要将此SortField[]作为参数传入Sort的构造函数得到Sort的一个instance，将此instance传入searcher.search(query, sort)即可得到了期望的结果。

具体实例可参考隐士做的搜索站http://so.mdbchina.com。

分享到：

隐士歪批墨攻 | SVN小文件清除脚本

2007-02-12 11:58
浏览 5444
评论(2)
论坛回复 / 浏览 (2 / 13511)
分类:非技术
查看更多

2 楼 NetBus 2007-03-17

lucene搜索如果不按docid或者score sort的话，那将会是非常慢的。代码不管你怎么优化、索引库不管你如何建都是徒劳的。

我曾经使用过lucene搭建搜索平台，当记录达到50万以上，索引库达到2G以上时，lucene的搜索、优化索引等效率就非常底了(P3 500Mhz CPU、1G Ram)，并且不可接受。

1 楼 YuLimin 2007-02-15

单个排序时，直接用setSort更方便

  /**
   * Sets the sort to the terms in <code>field</code> possibly in reverse,
   * then by index order (document number).
   */
  public void setSort(String field, boolean reverse)

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论