Lucene_demo06_几种搜索

ewf_momo

浏览: 710844 次
性别:
来自: 北京

最近访客更多访客>>

paotong

sikewang

wswa

yufei466036941

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Lucene全文索引

lucene

Lucene_demo06_几种搜索

创建searcher的过程
1、创建Directory
2、根据directory创建indexReader
3、根据indexReader创建indexSearcher
4、创建搜索的Query
5、根据searcher搜索并且返回TopDocs
6、根据TopDocs获取ScordDoc对象获取具体的Document对象
7、根据searcher和ScordDoc对象获取具体的Document对象
8、根据Document对象获取需要的值
9、关闭reader

/**
 * @see 1、关键词查询
 * @see 2、查询所有的文档 重点
 * @see 3、范围查询
 * @see 4、通配符查询 重点
 * @see 5、短语查询
 * @see 6、Boolean查询 重点
 */
public class QueryTest {
	/**
	 * 关键词查询 * 因为在创建Term对象的时候，没有分词器，所以这里的字母是区分大小写的 * Term构造函数的第二个参数指的是关键词，必须存在
	 */
	@Test
	public void testTermQuery() throws Exception {
		Term term = new Term("title", "总冠军");
		Query query = new TermQuery(term);
		this.testSearchIndex(query);
	}

	/**
	 * 查询所有的文档
	 */
	@Test
	public void testAllQuery() throws Exception {
		Query query = new MatchAllDocsQuery();
		this.testSearchIndex(query);
	}

	/**
	 * 通配符查询 说明： * 代表任意多个任意字符 ? 代表一个任意字符
	 */
	@Test
	public void testWildCardQuery() throws Exception {
		Term term = new Term("title", "*总?军");
		Query query = new WildcardQuery(term);
		this.testSearchIndex(query);
	}

	/**
	 * boolean查询 可以根据Occur的常量把好几个查询结合在一起
	 */
	@Test
	public void testBooleanQuery() throws Exception {
		Term term = new Term("title", "总冠军");
		TermQuery termQuery = new TermQuery(term);

		Term term2 = new Term("content", "2?13");
		Query wildCardQuery = new WildcardQuery(term2);
		BooleanQuery query = new BooleanQuery();
		query.add(termQuery, Occur.SHOULD);// Occur.MUST必须有、Occur.MUST_NOT必须没有、Occur.SHOULD可以有
		query.add(wildCardQuery, Occur.SHOULD);
		this.testSearchIndex(query);
	}

	/**
	 * 范围查询 查询id范围在5~15间的数据
	 */
	@Test
	public void testRangeQuery() throws Exception {
		Query query = NumericRangeQuery.newLongRange("id", 5L, 15L, true, true);
		this.testSearchIndex(query);
	}

	/**
	 * 所有的Term对象只能在同一个field中进行 如果两个以上大的关键词进行组合查询，得知道其中的位置(分词后的位置)
	 */
	@Test
	public void testPharseQuery() throws Exception {
		Term term = new Term("title", "NBA总冠军");
		Term term2 = new Term("title", "NBA总冠军");
		PhraseQuery phraseQuery = new PhraseQuery();
		phraseQuery.add(term);
		phraseQuery.add(term2);
		this.testSearchIndex(phraseQuery);
	}

	// 公共输出方法
	private void testSearchIndex(Query query) throws Exception {
		IndexSearcher indexSearcher = new IndexSearcher(LuceneUtils.directory);
		TopDocs topDocs = indexSearcher.search(query, 50);
		int count = topDocs.totalHits;// 总的记录数
		ScoreDoc[] scoreDocs = topDocs.scoreDocs;
		List<Article> articleList = new ArrayList<Article>();
		for (int i = 0; i < scoreDocs.length; i++) {
			int index = scoreDocs[i].doc;
			Document document = indexSearcher.doc(index);
			Article article = DocumentUtils.document2Article(document);
			articleList.add(article);
		}

		// 输入搜索出来的内容
		for (Article article : articleList) {
			System.out.println(article.getId());
			System.out.println(article.getTitle());
			System.out.println(article.getContent());
		}
	}

}

参考：http://my.oschina.net/winHerson/blog/82194

分享到：

ajax跨域问题 | Lucene_demo05_内存索引和文件索引

2013-06-09 21:04
浏览 1127
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene_demo06_几种搜索

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene_demo06_几种搜索

评论

发表评论

相关推荐

基于 Lucene 的8 个开源搜索引擎

什么是垂直搜索引擎？

搜索引擎的工作原理

Lucene中文分词 “庖丁解牛”

Lucene_demo09_txt文件索引

Lucene_demo08_Hightlighter高亮

Lucene_demo07_Sort匹配度

Lucene简介

Lucene_demo05_内存索引和文件索引

Lucene_demo04_分页

Lucene_demo03_索引库整理

Lucene_demo00_IndexCURD

Lucene_demo02_分词

Lucene_demo01_FirstProject

最近访客更多访客>>