`
文章列表
To build Carrot2 Document Clustering Workbench from source code: Download Eclipse Target Platform from http://download.carrot2.org/eclipse and extract to some local folder. Copy local.properties.example from Carrot2 checkout folder tolocal.properties in the same folder. In local.properties ...
In order to use custom chinese tokenizer(eg. jcseg). Following the next steps 1. download carrot2 souce code  and import it to eclipse #git clone git://github.com/carrot2/carrot2.git #cd carrot2 #ant -p #ant eclipse 2. import jecseg to eclipse and reference it to carrot2-util-text subproject. ...
deploy carrot2-webapp 1.  download soucre code #git clone git://github.com/carrot2/carrot2.git   2.compile #cd carrot2 #ant webapp   3.deploy #cp tmp/webapp/carrot2-webapp.war  /path/to/tomcat/webapps   4.configure  carrot2 #cd /path/to/tomcat/webapps/carrot2-webapp/WEB-INF/suites #mv  ...
1. Configure clutering in solrconfig.xml <searchComponent name="clustering" enable="true" class="solr.clustering.ClusteringComponent" > <lst name="engine"> <str name="name">lingo ...

Solr:eclipse开发环境

    博客分类:
  • Solr
import solr source code to eclipse 1. # tar xvf solr-4.10.1-src.tgz 2. #cd solr-4.10.1 3. #ant eclipse     (must install ant maven ivy . such as by #apt-get install ivy) 4. import project from eclipse there is a error when run #ant eclipse download activation-1.1.1.jar(javadoc) because ivy w ...
In my work, I run a situation that I want to use A mapper reading a file with to fields (questionId, questionTags) and outpute format likes key: questionId  value: questionTags, while B mapper reading a dir which contains many files named by questionId with questionContent as its file content  and ...
                    Reference https://blogs.oracle.com/datawarehousing/entry/integrating_hadoop_data_with_o
A. Covert text file to sequeces file Class: SequenceFilesFromDirectory Functions: Converts a directory of text documents into SequenceFiles of Specified chunkSize. This class takes in a  parent directory containing sub folders of text documents and recursively reads the files and creates the   {@ ...

Docker:introduction

http://baike.baidu.com/view/11854949.htm
http://www.tuicool.com/articles/fQbeya
                      References http://research.microsoft.com/en-us/people/yangsong/tagrecommendation.aspx
    http://blog.csdn.net/java2king/article/details/6010250                 inok http://segmentfault.com/q/1010000000644951#a-1020000000645067hws
import java.io.IOException; import org.apache.commons.logging.Log; import org.apache.commons.logging.LogFactory; import org.apache.hadoop.io.NullWritable; import org.apache.hadoop.io.Text; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.output.MultipleOutp ...

mysql:datajoin

mysql连接 
http://stackoverflow.com/questions/12956488/hadoop-nosuchmethodexception
Global site tag (gtag.js) - Google Analytics