Lucene建立索引的例子

tonybo2006

浏览: 9248 次
性别:
来自: 大连

最近访客更多访客>>

chenjingzhong

woodding2008

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

全文检索---Lucene

lucene


/**
 * @author tonybo2006
 * @version 2008/11/14
 */
public class TextFileIndexer {
	private String dataFilePath = null;
	private String indexFilePath = null;
	private File dataFile = null;
	/**
	 * @param dataFilePath
	 */
	public TextFileIndexer(String dataFilePath, String indexFilePath) {
		this.dataFilePath = dataFilePath;
		this.dataFile = new File(dataFilePath);
		this.indexFilePath = indexFilePath;
	}
	/**
	 * @return StandardAnalyzer
	 */
	private Analyzer getAnalyzer() {
		return new StandardAnalyzer();
	}
	/**
	 * @return doc Document
	 */
	private IndexWriter getDocument(IndexWriter indexWriter) {
		try {
			if (dataFile.isDirectory()) {
				File[] files = dataFile.listFiles();
				for (int i = 0; i < files.length; i++) {
					Document doc = new Document();
					System.out.println("File " + files[i].getCanonicalPath() + "正在被索引....");
					// file path
					doc.add(new Field("path", files[i].getAbsolutePath(),Field.Store.YES, Field.Index.NOT_ANALYZED));
					// file modified time
					doc.add(new Field("modified", DateTools.timeToString(files[i].lastModified(),
							DateTools.Resolution.MINUTE), Field.Store.YES,Field.Index.NOT_ANALYZED));
					// file content
					doc.add(new Field("contents", new FileReader(files[i])));
					indexWriter.addDocument(doc);
				}
			} else {
				Document doc = new Document();
				System.out.println(dataFile.getCanonicalPath() + "正在被索引....");
				// file path
				doc.add(new Field("path", dataFilePath, Field.Store.YES,Field.Index.NOT_ANALYZED));
				// file modified time
				doc.add(new Field("modified", DateTools.timeToString(dataFile.lastModified(), 
						DateTools.Resolution.MINUTE),Field.Store.YES, Field.Index.NOT_ANALYZED));
				// file content
				doc.add(new Field("contents", new FileReader(new File(dataFilePath))));
				indexWriter.addDocument(doc);
			}
		} catch (UnsupportedEncodingException e) {
			e.printStackTrace();
		} catch (FileNotFoundException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
		return indexWriter;
	}

	/**
	 * 建立索引。
	 */
	public void createIndex() {
		try {
			IndexWriter indexWriter = new IndexWriter(indexFilePath,getAnalyzer(), true, IndexWriter.MaxFieldLength.UNLIMITED);
			long startTime = new Date().getTime();
			System.out.println("开始索引……");
			getDocument(indexWriter);
			// 测试一下索引的时间
			long endTime = new Date().getTime();
			System.out.println("索引完成。花费了" + (endTime - startTime) + " 毫秒来把文档增加到索引里面去!" + indexFilePath);
			indexWriter.optimize();
			indexWriter.close();
		} catch (CorruptIndexException e) {
			e.printStackTrace();
		} catch (LockObtainFailedException e) {
			e.printStackTrace();
		} catch (IOException e) {
			e.printStackTrace();
		}
	}
	/**
	 * test
	 */
	public static void main(String[] args) throws Exception {
		String dataFilePath = "D:\\s";//数据文件路径
		String indexFilePath = "D:\\index";//索引文件路径
		TextFileIndexer textFileIndexer = new TextFileIndexer(dataFilePath,indexFilePath);
		textFileIndexer.createIndex();
	}
}

分享到：

Lucene查询索引的例子 | Lucene学习笔记1--建立索引

2008-11-14 14:51
浏览 1052
评论(0)
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene建立索引的例子

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

Lucene建立索引的例子

评论

发表评论

相关推荐

Lucene查询索引的例子

Lucene学习笔记1--建立索引

Lucene 学习前期准备

最近访客更多访客>>