lucene2.4源码学习9 搜索 norm

huangyunbin

浏览: 2630187 次
性别:
来自: 广州

最近访客更多访客>>

cht的大摩托

xiaoxiaoHer

zzqfsy

为了ta

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

lucene 2.4源码学习

 public float score() {
    int f = freqs[pointer];
    float raw =                                   // compute tf(f)*weight
      f < SCORE_CACHE_SIZE                        // check cache
      ? scoreCache[f]                             // cache hit
      : getSimilarity().tf(f)*weightValue;        // cache miss

    return raw * Similarity.decodeNorm(norms[doc]); // normalize for field
  }

计算得分的时候要用到norm。这个norm是怎么来的呢

 public Scorer scorer(IndexReader reader) throws IOException {
      TermDocs termDocs = reader.termDocs(term);

      if (termDocs == null)
        return null;

      return new TermScorer(this, termDocs, similarity,
                            reader.norms(term.field()));
    }

看到score是从reader中来的。

norm是在SegmentReader的openNorms设置的。

 private void openNorms(Directory cfsDir, int readBufferSize) throws IOException {
    long nextNormSeek = SegmentMerger.NORMS_HEADER.length; //skip header (header unused for now)
    int maxDoc = maxDoc();
    for (int i = 0; i < fieldInfos.size(); i++) {
      FieldInfo fi = fieldInfos.fieldInfo(i);
      if (norms.containsKey(fi.name)) {
        // in case this SegmentReader is being re-opened, we might be able to
        // reuse some norm instances and skip loading them here
        continue;
      }
      if (fi.isIndexed && !fi.omitNorms) {
        Directory d = directory();
        String fileName = si.getNormFileName(fi.number);
        if (!si.hasSeparateNorms(fi.number)) {
          d = cfsDir;
        }
        
        // singleNormFile means multiple norms share this file
        boolean singleNormFile = fileName.endsWith("." + IndexFileNames.NORMS_EXTENSION);
        IndexInput normInput = null;
        long normSeek;

        if (singleNormFile) {
          normSeek = nextNormSeek;
          if (singleNormStream==null) {
            singleNormStream = d.openInput(fileName, readBufferSize);
          }
          // All norms in the .nrm file can share a single IndexInput since
          // they are only used in a synchronized context.
          // If this were to change in the future, a clone could be done here.
          normInput = singleNormStream;
        } else {
          normSeek = 0;
          normInput = d.openInput(fileName);
        }

        norms.put(fi.name, new Norm(normInput, singleNormFile, fi.number, normSeek));
        nextNormSeek += maxDoc; // increment also if some norms are separate
      }
    }
  }

IndexReader：

 public void setNorm(int doc, String field, float value)
          throws StaleReaderException, CorruptIndexException, LockObtainFailedException, IOException {
    ensureOpen();
    setNorm(doc, field, Similarity.encodeNorm(value));
  }

是在这里设置norm的，但是没看到有调用这个方法的地方。

分享到：

lucene2.4源码学习10 查询 coord | lucene2.4源码学习8 得分计算方法 Weight ...

2013-05-12 12:04
浏览 1476
评论(0)
论坛回复 / 浏览 (0 / 1847)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

lucene2.4源码学习9 搜索 norm

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

lucene2.4源码学习9 搜索 norm

评论

发表评论

相关推荐

lucene2.4源码学习11 查询 tf

lucene2.4源码学习10 查询 coord

lucene2.4源码学习8 得分计算方法 Weight的变量部分

lucene2.4源码学习7 构建查询树 rewrite

lucene2.4源码学习6 搜索 TooManyClauses

lucene2.4源码学习5 写文件之WaitQueue

lucene2.4源码学习4 写文件的脉络

lucene2.4源码学习3 写文件的装饰者 + 责任链 模式

lucene2.4源码学习2 lucene的基本文件学习

lucene2.4源码学习1

最近访客更多访客>>

lucene2.4源码学习3 写文件的装饰者 + 责任链模式