lucene2.0学习文档五

zjkilly

浏览: 266014 次
性别:
来自: 天津

最近访客更多访客>>

su666

mr_gx

xiaomabobo

wulugeng-2018

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

lucene2.0

lucene C C++C#

它的运行结果为:
总结一下：
1.设置Field的长度限制只是限制了搜索。如果用了Field.Store.YES的话还是会
全部被保存进索引目录里的。

2.为什么搜the没有搜出来呢?是因为lucene分析英文的时候不会搜索the to of 等无用的词(搜这些词是无意义的)。

3.New StandardAnlayzer()对于英文的分词是按空格和一些无用的词，而中文呢是全部的单个的字。

4.设置Field的最大长度是以0开头和数组一样。

大家还可以试一下别的，以便加深一下印象

到现在我们已经可以用lucene建立索引了
下面介绍一下几个功能来完善一下：
1．索引格式
      其实索引目录有两种格式，一种是除配置文件外，每一个Document独立成为一个文件（这种搜索起来会影响速度）。另一种是全部Document成一个文件，这样属于复合模式就快了。
2.索引文件可放的位置：
    索引可以存放在两个地方1.硬盘，2.内存。放在硬盘上可以用FSDirectory()，放在内存的用RAMDirectory()不过一关机就没了。
FSDirectory.getDirectory (File file, boolean create)
FSDirectory.getDirectory(String path, boolean create)两个工厂方法返回目录
New RAMDirectory() 就直接可以，再和IndexWriter(Directory d, Analyzer a, boolean create) 一配合就行了
如：
IndexWrtier indexWriter = new IndexWriter(FSDirectory.getDirectory(“c:\\index”,true),new StandardAnlyazer(),true);
IndexWrtier indexWriter = new IndexWriter(new RAMDirectory(),new StandardAnlyazer(),true);
3.索引的合并
    这个可用IndexWriter.addIndexes(Directory[] dirs) 将目录加进去
来看个例子:
public void UniteIndex() throws IOException{
      IndexWriter writerDisk = new IndexWriter(FSDirectory.getDirectory("c:\\indexDisk",
        true),new StandardAnalyzer(),true);
       Document docDisk = new Document();
       docDisk.add(new Field("name","程序员之家",Field.Store.YES,Field.Index.TOKENIZED));
      writerDisk.addDocument(docDisk);
       RAMDirectory ramDir = new RAMDirectory();
IndexWriter writerRam = new IndexWriter(ramDir,new StandardAnalyzer(),true);
       Document docRam = new Document();
       docRam.add(new Field("name","程序员杂志",Field.Store.YES,Field.Index.TOKENIZED));
       writerRam.addDocument(docRam);
       writerRam.close();//这个方法非常重要，是必须调用的
       writerDisk.addIndexes(new Directory[]{ramDir});
       writerDisk.close();
   }
   public void UniteSearch() throws ParseException, IOException{
       QueryParser queryParser = new QueryParser("name",new StandardAnalyzer());
       Query query = queryParser.parse("程序员");
       IndexSearcher indexSearcher =new IndexSearcher("c:\\indexDisk");
       Hits hits = indexSearcher.search(query);
       System.out.println("找到了"+hits.length()+"结果");
       for(int i=0;i< hits.length();i++){
          Document doc = hits.doc(i);
          System.out.println(doc.get("name"));
        }
   }
这个例子是将内存中的索引合并到硬盘上来.
注意：合并的时候一定要将被合并的那一方的IndexWriter的close()方法调用。

分享到：

lucene2.0学习文档六 | lucene2.0学习文档四

2009-11-04 13:50
浏览 612
评论(0)
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论