- 浏览: 246645 次
- 性别:
- 来自: 成都
最新评论
-
oldrat:
https://github.com/oldratlee/tr ...
Kafka: High Qulity Posts
文章列表
To build Carrot2 Document Clustering Workbench from source code:
Download Eclipse Target Platform from http://download.carrot2.org/eclipse and extract to some local folder.
Copy local.properties.example from Carrot2 checkout folder tolocal.properties in the same folder. In local.properties ...
In order to use custom chinese tokenizer(eg. jcseg). Following the next steps
1. download carrot2 souce code and import it to eclipse
#git clone git://github.com/carrot2/carrot2.git
#cd carrot2
#ant -p
#ant eclipse
2. import jecseg to eclipse and reference it to carrot2-util-text subproject. ...
deploy carrot2-webapp
1. download soucre code
#git clone git://github.com/carrot2/carrot2.git
2.compile
#cd carrot2
#ant webapp
3.deploy
#cp tmp/webapp/carrot2-webapp.war /path/to/tomcat/webapps
4.configure carrot2
#cd /path/to/tomcat/webapps/carrot2-webapp/WEB-INF/suites
#mv ...
1. Configure clutering in solrconfig.xml
<searchComponent name="clustering"
enable="true"
class="solr.clustering.ClusteringComponent" >
<lst name="engine">
<str name="name">lingo ...
Solr:eclipse开发环境
- 博客分类:
- Solr
import solr source code to eclipse
1. # tar xvf solr-4.10.1-src.tgz
2. #cd solr-4.10.1
3. #ant eclipse (must install ant maven ivy . such as by #apt-get install ivy)
4. import project from eclipse
there is a error when run #ant eclipse
download activation-1.1.1.jar(javadoc)
because ivy w ...
In my work, I run a situation that I want to use A mapper reading a file with to fields (questionId, questionTags) and outpute format likes key: questionId value: questionTags, while B mapper reading a dir which contains many files named by questionId with questionContent as its file content and ...
Reference
https://blogs.oracle.com/datawarehousing/entry/integrating_hadoop_data_with_o
A. Covert text file to sequeces file
Class: SequenceFilesFromDirectory
Functions: Converts a directory of text documents into SequenceFiles of Specified chunkSize. This class takes in a parent directory containing sub folders of text documents and recursively reads the files and creates the {@ ...
Docker:introduction
- 博客分类:
- Docker
http://baike.baidu.com/view/11854949.htm
http://www.tuicool.com/articles/fQbeya
References
http://research.microsoft.com/en-us/people/yangsong/tagrecommendation.aspx
http://blog.csdn.net/java2king/article/details/6010250
inok
http://segmentfault.com/q/1010000000644951#a-1020000000645067hws
import java.io.IOException;
import org.apache.commons.logging.Log;
import org.apache.commons.logging.LogFactory;
import org.apache.hadoop.io.NullWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.output.MultipleOutp ...
http://stackoverflow.com/questions/12956488/hadoop-nosuchmethodexception