Lucene 搜索试用

zhangzcz1999

浏览: 148276 次
性别:
来自: 广州

最近访客更多访客>>

kaihuayu

薄荷糖981001

wwh_1

jelly54

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

java相关

lucene Apache Blog

传说中强大的Lucene搜索，首先要创建索引：

import java.io.IOException;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.document.Field;
import org.apache.lucene.index.CorruptIndexException;
import org.apache.lucene.index.IndexWriter;
import org.apache.lucene.index.Term;
import org.apache.lucene.store.LockObtainFailedException;

public class TextFileIndexer {
	private IndexWriter indexWriter;

	public TextFileIndexer(String directory) {
		try {
			indexWriter = new IndexWriter(directory, new StandardAnalyzer(),
					true);
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (LockObtainFailedException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
	}

	public static void main(String[] args) throws Exception {
		TextFileIndexer indexer = new TextFileIndexer("/home/com/tmp/index");
		indexer.create("/home/sina", "axu你发三段论法的撒肥撒旦", "了脑门大量的萨考虑我的");
		indexer.create("/home/blog", "9726", "java");
//		 indexer.delete("/home/blog");
		indexer.commit();
	}

	public void create(String path, String title, String content) {
		Document document = new Document();
		Field fieldPath = new Field("path", path, Field.Store.YES,
				Field.Index.NO);
		Field fieldTitle = new Field("title", title, Field.Store.YES,
				Field.Index.TOKENIZED);
		Field fieldContent = new Field("content", content, Field.Store.NO,
				Field.Index.TOKENIZED);
		document.add(fieldPath);
		document.add(fieldTitle);
		document.add(fieldContent);
		try {
			indexWriter.addDocument(document);
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}

	}

	public void commit() {
		try {
			indexWriter.optimize();
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}

	}

	public void delete(String path) {
		Term term = new Term("path", path);
		try {
			indexWriter.deleteDocuments(term);
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
	}

	public void update(String path, String title, String content) {
		this.delete(path);
		this.create(path, title, content);
	}
}

然后就可以进行搜索了，搜索创建好的索引：

import java.io.IOException;
import java.util.ArrayList;
import java.util.List;

import org.apache.lucene.analysis.Analyzer;
import org.apache.lucene.analysis.standard.StandardAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.index.CorruptIndexException;
import org.apache.lucene.queryParser.ParseException;
import org.apache.lucene.queryParser.QueryParser;
import org.apache.lucene.search.Hits;
import org.apache.lucene.search.IndexSearcher;
import org.apache.lucene.search.Query;
import org.apache.lucene.store.LockObtainFailedException;

public class TextFileSearcher {
	private IndexSearcher searcher;

	public TextFileSearcher(String directory) {
		try {
			searcher = new IndexSearcher(directory);
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (LockObtainFailedException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
	}

	public static void main(String[] args) throws Exception {
		TextFileSearcher indexer = new TextFileSearcher("/home/com/tmp/index");
		List<Document> list = indexer.search("9726");
		for (Document document : list) {
			System.out.println(document.get("path"));
		}
	}

	public List<Document> search(String keyword) {
		List<Document> resultset = new ArrayList<Document>();
		Analyzer analyzer = new StandardAnalyzer();
		try {
			QueryParser parser = new QueryParser(null, analyzer);
			Query query = parser.parse("title:" + keyword + " OR content:"
					+ keyword);
			Hits hits = searcher.search(query);
			for (int i = 0; i < hits.length(); i++) {
				resultset.add(hits.doc(i));
			}
		} catch (ParseException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
		return resultset;
	}

}

分享到：

Lucene已建索引的全部删除 | Ubuntu下播放声音文件出错

2008-10-22 11:57
浏览 1015
评论(0)
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene 搜索试用

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene 搜索试用

评论

发表评论

相关推荐

(转)Word文档解析介绍（using Jacob & HtmlParser）

（转）Java同步技术

Java和oracle分页处理

java写文件方法之比较

Log4j记录详细异常信息

解决iframe中session丢失的问题

转：HttpClient POST 的 UTF-8等编码问题

RCP中实现带有run in background按钮的进度条对话框

spring cron表达式

使用apache的Httpclient上传文件

eclipse中的线程

GEF初步学习

(转)Java复习

eclipse启动参数(eclipse.ini)说明

Lucene已建索引的全部删除

JXL操作excel代码实例

Webphere启动报java.lang.ClassCastException问题解决

opencsv开源 CSV文件操作包简介

Tomcat下配置与使用CAS实现单点登录

(转)TCP端口扫描程序

最近访客更多访客>>