Solr terms使用

hui_jing_880210

浏览: 43887 次
性别:
来自: 北京

最近访客更多访客>>

loginboot

dream_xz

ekinsa1

zjy_369

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

solr

solr

这个根据solr提供的terms

实现一：实现商品名称自动联想

 public static List<String> getSearchSuggestHjy(String selectValue,int num){
    	
    	List<String> keyValue = new ArrayList<String>();
    	SolrQuery params = new SolrQuery();
	params.set("qt", "/terms"); 
	params.set("terms.prefix", SolrjQueryValueUtil.escapeQueryChars(selectValue));		
	params.set("terms.fl", "productName");			
	        params.set("terms.mincount", "1");  
	        params.set("terms.maxcount", "100");  
	        params.set("terms.limit", num);  
	params.set("terms.raw", "true");  
	 params.set("terms.sort", "count");  
	params.set("wt","json");
			try {
				QueryResponse response =  server.query(params);
			    if(response != null ){ 
			    	 TermsResponse termsResponse = response.getTermsResponse(); 
			    	 if(termsResponse != null) {  
			                Map<String, List<TermsResponse.Term> > termsMap = termsResponse.getTermMap();  
			                for(Map.Entry<String, List<TermsResponse.Term> > termsEntry : termsMap.entrySet()) {  
			                    List<TermsResponse.Term> termList = termsEntry.getValue();  
			                    for(TermsResponse.Term term : termList) { 
			                    	keyValue.add(term.getTerm().toString());
			                    }  
			                }  
				    }
			    }
			} catch (SolrServerException e) {
				e.printStackTrace();
			}
		return keyValue;
    }

实现二：TermsComponent数据统计：

首先在solrconfig.xml里面配置

<searchComponent name="terms" class="solr.TermsComponent"/>

  <!-- A request handler for demonstrating the terms component -->
  <requestHandler name="/terms" class="solr.SearchHandler" startup="lazy">
    <lst name="defaults">
        <bool name="terms">true</bool>
        <bool name="distrib">false</bool>
    </lst>
    <arr name="components">
        <str>terms</str>
    </arr>
  </requestHandler>

通过url连接的方式请求，也可以通过如上的solrj进行查询：

方法1：url方式

http://localhost:8983/solr/test_core/terms?q=*%3A*&wt=json&indent=true&terms.fl=productName

方式2：

SolrQuery params = new SolrQuery();
	params.set("qt", "/terms"); 	
	params.set("terms.fl", "productName");	
        params.set("index", true);			 
	params.set("wt","json");

返回结果：默认按结果的出现频率倒序排序。

{
  "responseHeader":{
    "status":0,
    "QTime":31},
  "terms":{
    "field":[
      "新闻图片",4262,
      "专题",4261,
      "设计",3229,
      "凤凰网",2020,
      "凤凰新媒体",1980,
      "创意",1610,
      "艺术",1170,
      "时尚",1128,
      "nba",1025,
      "新闻",940]
    }
}

相关参数说明

erms={true|false} - Turn on the TermsComponent
terms.fl={FIELD NAME} - Required. The name of the field to get the terms from. May be specified multiple times as terms.fl=field1&terms.fl=field2...
terms.lower={The lower bound term} - Optional. The term to start at. If not specified, the empty string is used, meaning start at the beginning of the field.
terms.lower.incl={true|false} - Optional. Include the lower bound term in the result set. Default is true.
terms.mincount=<Integer> - Optional. The minimum doc frequency to return in order to be included. Results are inclusive of the mincount (i.e. >= mincount)
terms.maxcount=<Integer> - Optional. The maximum doc frequency. Default is -1 to have no upper bound. Results are inclusive of the maxcount (i.e. <= maxcount)
terms.prefix={String} - Optional. Restrict matches to terms that start with the prefix.
terms.regex={String} - Optional. Restrict matches to terms that match the regular expression. Solr3.1
terms.regex.flag={case_insensitive|comments|multiline|literal|dotall|unicode_case|canon_eq|unix_lines} - Optional. Flags to be used when evaluating the regular expression defined in the "terms.regex" parameter (see http://java.sun.com/j2se/1.5.0/docs/api/java/util/regex/Pattern.html#compile%28java.lang.String,%20int%29 fore more details). This parameter can be defined multiple times (each time with different flag) Solr3.1
terms.limit={integer} - The maximum number of terms to return. The default is 10. If < 0, then include all terms.
terms.upper={The upper bound term} - The term to stop at. Either upper or terms.limit must be set.
terms.upper.incl={true|false} - Include the upper bound term in the result set. Default is false.
terms.raw={true|false} - If true, return the raw characters of the indexed term, regardless of if it is human readable. For instance, the indexed form of numeric numbers is not human readable. The default is false.
terms.sort={count|index} - If count, sorts the terms by the term frequency (highest count first). If index, returns the terms in index order. Default is to sort by count.