- 浏览: 4400321 次
- 性别:
- 来自: 湛江
博客专栏
-
SQLite源码剖析
浏览量:80073
-
WIN32汇编语言学习应用...
浏览量:70040
-
神奇的perl
浏览量:103346
-
lucene等搜索引擎解析...
浏览量:285787
-
深入lucene3.5源码...
浏览量:15012
-
VB.NET并行与分布式编...
浏览量:67553
-
silverlight 5...
浏览量:32147
-
算法下午茶系列
浏览量:45986
文章分类
最新评论
-
yoyo837:
counters15 写道目前只支持IE吗?插件的东西是跨浏览 ...
Silverlight 5 轻松开启绚丽的网页3D世界 -
shuiyunbing:
直接在前台导出方式:excel中的单元格样式怎么处理,比如某行 ...
Flex导出Excel -
di1984HIT:
写的很好~
lucene入门-索引网页 -
rjguanwen:
在win7 64位操作系统下,pygtk的Entry无法输入怎 ...
pygtk-entry -
ldl_xz:
http://www.9958.pw/post/php_exc ...
PHPExcel常用方法汇总(转载)
有以下要点要注意:
1、在WINDOWS的环境变量中要正确指定JDK目录
2、build.xml编译报错,
Nutch\nutch-0.9\build.xml:61: Specify at least one source--a file or resource collection.
将下面几行的前几行(从61行开始直到下面的<copy todir="${conf.dir}" verbose="true">
前一行为止)直接删除就OK了,
<copy todir="${conf.dir}" verbose="true">
<fileset dir="${conf.dir}" includes="**/*.template"/>
<mapper type="glob" from="*.template" to="*"/>
</copy>
3、把Nuthc-1.0目录下的所有文件复制到nutch目录下。
4、打开使用java Project from existing ant buildfile方式,打开build.xml(NUTCH-1。0根目录下)
Buildfile: G:\workspace\Nutch\build.xml
init:
[unjar] Expanding: G:\workspace\Nutch\lib\hadoop-0.19.1-core.jar into G:\workspace\Nutch\build\hadoop
[untar] Expanding: G:\workspace\Nutch\build\hadoop\bin.tgz into G:\workspace\Nutch\bin
[unjar] Expanding: G:\workspace\Nutch\lib\hadoop-0.19.1-core.jar into G:\workspace\Nutch\build
compile-core:
compile-plugins:
deploy:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: clustering-carrot2
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: creativecommons
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: parse-html
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: feed
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: index-basic
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: index-anchor
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: index-more
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: field-basic
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: field-boost
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
[echo] Copying language profiles
[echo] Copying test files
deps-jar:
compile:
[echo] Compiling plugin: language-identifier
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: parse-html
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-http
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: microformats-reltag
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: ontology
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-ftp
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-http
jar:
compile:
[echo] Compiling plugin: protocol-http
jar:
deps-test:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-http
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-http
jar:
compile:
[echo] Compiling plugin: protocol-httpclient
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-ext
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: parse-html
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-js
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
compile:
[echo] Compiling plugin: parse-msexcel
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
compile:
[echo] Compiling plugin: parse-mspowerpoint
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
compile:
[echo] Compiling plugin: parse-msword
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: lib-parsems
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: parse-oo
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-pdf
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
compile:
jar:
compile:
[echo] Compiling plugin: parse-rss
jar:
deps-test:
init:
init-plugin:
compile:
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-swf
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-text
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-zip
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: protocol-file
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: parse-text
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: query-basic
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: query-more
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: query-site
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: query-custom
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: query-url
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: response-json
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: response-xml
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: scoring-opic
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: scoring-link
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: summary-basic
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: subcollection
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: summary-lucene
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: tld
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
compile-test:
compile:
[echo] Compiling plugin: urlfilter-automaton
jar:
deps-test:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlfilter-domain
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlfilter-prefix
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
jar:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
compile-test:
compile:
[echo] Compiling plugin: urlfilter-regex
jar:
deps-test:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: lib-regex-filter
jar:
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlfilter-suffix
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlfilter-validator
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlnormalizer-basic
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlnormalizer-pass
jar:
deps-test:
deploy:
copy-generated-lib:
init:
init-plugin:
deps-jar:
compile:
[echo] Compiling plugin: urlnormalizer-regex
jar:
deps-test:
init:
init-plugin:
compile:
jar:
[jar] Warning: skipping jar archive G:\workspace\Nutch\build\nutch-extensionpoints\nutch-extensionpoints.jar because no files were included.
deps-test:
deploy:
copy-generated-lib:
deploy:
copy-generated-lib:
compile:
job:
BUILD SUCCESSFUL
Total time: 4 seconds
发表评论
-
lucene3.5之Bits
2012-03-27 17:23 1692package org.apache.lucene.util; ... -
lucene3.5之SmallFloat
2012-03-26 09:21 2363package org.apache.lucene.util; ... -
lucene 3.5之SimpleStringInterner
2012-03-23 10:25 2122public class SimpleStringIntern ... -
lucene3.5之ToStringUtils
2012-03-22 17:25 1750package org.apache.lucene.util; ... -
lucene3.5之StringHelper
2012-03-21 14:54 1761这个类主要是完成2个功能: 1、版本号比较 2、生成字符串实例 ... -
lucene3.5之StringInterner
2012-03-21 12:15 1905首先讲解关于java的intern public String ... -
lucene的until包当前lucene的版本号
2012-03-21 11:54 3145首先介绍一下Java中枚举实现: public enum C ... -
LucenePackage 包获取当前lucene包信息
2012-03-21 11:16 1966package org.apache.lucene; /** ... -
linux下的简单中文分词器-dpSegmentation
2010-09-19 15:34 3186deepfuture简单中文分词器(deepfuture ... -
搜狗实验室(Sogou Labs)
2010-03-15 09:49 2874http://www.sogou.com/labs/ 搜狗实 ... -
搜索引擎开发lucene-笔者博客的大部分lucene习作源码
2010-03-01 20:36 6097有一个小小较完整的实例,详见说明.txt 见附件,解压密码: ... -
搜索引擎中网络爬虫的设计分析(转)
2010-02-21 11:56 2045说的简单易懂一些,网络爬虫跟你使用的〖离线阅读〗工具差不多。说 ... -
lucene-内存索引、内存索引保存在硬盘、索引优化
2010-01-19 21:25 5378索引代码 package bindex; ... -
lucene-索引的优化和索引过程查看
2009-12-23 14:39 1740代码:(索引建立) package bindex; ... -
lucene-索引信息、索引删除、索引删除恢复、索引物理删除
2009-12-23 14:39 2658代码: package bindex; import ja ... -
lucene-内存索引、内存索引保存在硬盘、索引优化
2009-12-23 16:41 1534索引代码 package bindex; import j ... -
lucene-索引文件格式
2009-12-23 16:43 1867索引文件结构 Lucene使 ... -
lucene-对每个字段指定分析器及较复杂搜索页面(对QQ国内新闻搜索)
2009-12-23 16:47 46171、 JAVA代码(索引) package bindex; ... -
lucene-使用lius解析html
2009-12-23 16:53 55401、代码 package liusextract; imp ... -
lucene-使用lius解析pdf、ppt、rtf、txt、xml
2009-12-23 16:54 24121、代码 package liusextract; imp ...
相关推荐
在Linux环境中使用Eclipse编译Nutch-1.0,首要任务是确保开发环境满足项目需求。这包括确认Eclipse的JDK、JRE版本至少为1.6或更高版本。这是因为Nutch作为Apache旗下的开源Web爬虫项目,其运行依赖于Java平台,并对...
Nutch1.0的API,不过注意没有搜索功能
本文详细介绍了在Eclipse环境下编译Nutch-0.9的完整流程,从环境搭建、项目导入,到解决编译错误、外部库集成,再到配置文件调整和最终的运行测试,每一个步骤都旨在帮助用户顺利地启动和操作这个强大的网络爬虫工具...
通过上述步骤,你可以在Eclipse中成功配置并运行Nutch,从而利用其强大的网络爬取和数据处理能力。这不仅为学习和研究Nutch提供了便利,也为实际应用中的数据采集和分析打下了坚实的基础。记住,配置过程中遇到任何...
Nutch 是一个开源的、Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。
总的来说,这个Nutch1.0修改版体现了开源社区的力量,通过协作和共享,解决了特定语言环境下(如中文)的问题,提升了工具的适用性和实用性。对于那些想要深入理解搜索引擎工作原理,或者想要在自己的项目中使用...
在本文中,我们将深入探讨如何在 Linux 环境下使用 Eclipse 编译 Apache Nutch 1.0。Apache Nutch 是一个开源的网络爬虫框架,主要用于抓取和索引网页内容。Eclipse 是一个广泛使用的 Java 开发集成环境,它支持多种...
Nutch 是一个开源Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。包括全文搜索和Web爬虫。
Nutch 是一个开源Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。包括全文搜索和Web爬虫。
Nutch 是一个开源Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。包括全文搜索和Web爬虫。
Nutch 是一个开源Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。包括全文搜索和Web爬虫。
Nutch 是一个开源的、Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。 nutch 1.0
Nutch-1.0分布式安装手册是一份详细指导如何在多台计算机上部署和配置Apache Nutch的文档。Apache Nutch是一款开源的网络爬虫软件,用于抓取互联网上的网页并进行索引,是大数据领域中搜索引擎构建的重要工具。这份...
Nutch 是一个开源的、Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。 nutch 1.0
Nutch 是一个开源的、Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。 nutch 1.0
Nutch 1.16是该项目的一个稳定版本,已经预先编译完成,方便开发者直接在Eclipse或IntelliJ IDEA这样的集成开发环境中导入使用,无需自行配置和编译源代码。 **Nutch的组成部分** 1. **Web爬虫**:Nutch的爬虫负责...
nutch-1.0-dev.jar nutch devlope
ant-eclipse-1.0 nutch
apache-nutch-2.2.1(Eclipse直接运行版)今天刚做的,发现有很多坑,分享给大家实验,JDK1.7 Win10。我分享的两个压缩卷一起下载才可以用,资源限制太小了 002地址:...