- Download
- 浏览: 243671 次
- 性别:
- 来自: 成都
文章分类
- 全部博客 (294)
- Hadoop (34)
- mysql (9)
- operatingsystem (13)
- Hive (8)
- Hue (8)
- Pig (11)
- oozie (8)
- ZooKeeper (1)
- HBase (1)
- Spark (4)
- Impala (1)
- Lily (0)
- Solr (41)
- RS (0)
- Sqoop (10)
- Avro (0)
- Thrift (0)
- HDP (0)
- Bigtop (0)
- Redis (7)
- Java (6)
- Tez (6)
- Ambari (1)
- Mahout (25)
- MongoDB (9)
- Lucene (9)
- Nutch (1)
- Katta (1)
- UIMA (0)
- MediaProcess (1)
- linux (1)
- Design (2)
- AI (1)
- RTR (1)
- Docker (1)
- Splunk (0)
- OpenNLP (1)
- Carrot (3)
- LingPipe (0)
- Weka (0)
- Hama (9)
- CloudStack (0)
- Helix (0)
- Rave (0)
- jclouds (0)
- Giraph (0)
- Drill (0)
- Tajo (0)
- Kafka (3)
- Samza (0)
- Storm (23)
- Flume (15)
- Sifarish (1)
- ML (3)
- android (2)
- Theory (2)
- 系统架构 (2)
- Kiji (1)
- Neo4j (16)
- spanner (0)
- Ejabberd (0)
- Dropwizard (1)
- Tigon (1)
- OrientDB (1)
- Kite (2)
- Jubatus (4)
- Logstash (2)
- Kibana (0)
- Cassandra (0)
- Curator (1)
最新评论
-
oldrat:
https://github.com/oldratlee/tr ...
Kafka: High Qulity Posts
- Build
According to official document, when run build command
$ bin/mkdistro.sh
there are some errors. By default it builds against Hadoop 1.1.1 and the build configuration enforces that JDK 1.6.* is being used to compile and run the testcases.
So do the following steps to fix the problem(You should replace 2.x.0 to your hadoop version)
#cd oozie-4.0.0/
#find . -name pom.xml | xargs sed -ri 's/(2.2.0\-SNAPSHOT)/2.x.0/'
//or you should alter pom.xml manually
//java 1.6
mvn -DskipTests=true -P hadoop-2 clean package assembly:single
//java 1.7
$ mvn clean package assembly:single -Dhadoop.version=2.x.0 \
-DjavaVersion=1.7 -DtargetJavaVersion=1.7 -DskipTests
-Dpig.version=0.12.0 \
-DgenerateDocs
Note: I can build oozie4.0.1 succesfully by the following steps
#tar -xzf oozie-4.0.1.tar.gz
#cd oozie-4.0.1
#mvn clean package assembly:single -Dhadoop.version=2.3.0 \
-DjavaVersion=1.7 -DtargetJavaVersion=1.7 -DskipTests -Dpig.version=0.12.0 \
-DgenerateDocs
You should change your system language to english, otherwise some pom or jar cannot download from
couldera site.
Server Installation
#cd oozie-4.0.1/distro/target
#cp oozie-4.0.1-distro.tar.gz /path/to/oozie-install-dir
#cp oozie-4.0.1/hadooplibs/target/oozie-4.0.1-hadooplibs.tar.gz /path/to/oozie-install-dir
#cd /path/to/oozie-install-dir
#tar -xzf oozie-4.0.1-distro.tar.gz
#tar -xzf oozie-4.0.1-hadooplibs.tar.gz
#cd oozie-4.0.1/
#mkdir libext
#cp hadooplibs/hadooplib-2.3.0.oozie-4.0.1/* libext/
#cp /path/to/ext-2.2.zip libext/
#cp /path/to/mysql-connector-java-5.1.29-bin.jar libext/
#tree -L 1
#bin/oozie-setup.sh prepare-war #bin/oozie-setup.sh sharelib create -fs hdfs://192.168.122.1:2014 #bin/oozie-setup.sh db create -run
Expand the oozie-sharelib TAR.GZ file bundled with the distribution.
The share/ directory must be copied to the Oozie HOME directory in HDFS:
$ hadoop fs -put share share
IMPORTANT: This must be done using the Oozie Hadoop (HDFS) user. If a share directory already exists in HDFS, it must be deleted before copying it again.
$ bin/oozied.sh start
$ bin/oozied.sh run
$ bin/oozie admin -oozie http://localhost:11000/oozie -status
http://zhj-c0:11000/oozie/
Client Installation
expand the oozie-client TAR.GZ file bundled with the distribution. Add the bin/ directory to the PATH .
#oozie
ref:
https://oozie.apache.org/docs/4.0.1/DG_QuickStart.html
http://oozie.apache.org/docs/4.0.0/DG_QuickStart.html
https://oozie.apache.org/docs/4.0.0/ENG_Building.html
http://archive.cloudera.com/cdh/3/oozie/AG_Install.html
http://stackoverflow.com/questions/22077120/install-oozie-on-hadoop-2-2
grepalex.com/2014/02/16/oozie-and-hadoop-2.2/
http://mail-archives.apache.org/mod_mbox/oozie-user/201312.mbox/%3CCEDDEB80.76815%25chitnis@yahoo-inc.com%3E
http://www.hadoopinrealworld.com/building-running-and-testing-apache-oozie-4-0-0-2/
https://issues.apache.org/jira/browse/OOZIE-1551
https://issues.apache.org/jira/
发表评论
-
Add third party jars in a job
2014-08-18 15:10 1336When I submit a java job (inc ... -
Oozie: sqoop 1.4.4 sharelib
2014-04-11 13:31 810ENV : oozie4.0.1 hadoop2.3.0 ... -
Oozie: Using Mysql to store oozie metadata
2014-04-04 14:50 6341. configure oozie-site.xml ... -
Oozie: configuration
2014-04-04 12:42 719---conf/oozie-site.xml--- ... -
Oozie: Run examples
2014-04-04 12:43 849#cd /path/to/oozie-4.0.1 #t ... -
oozie: common errors
2014-04-04 10:41 18721. when run oozie examples, t ... -
Oozie: Sqoop Action Extension
2014-04-03 16:04 0Refere ... -
oozie:Expression Language Functions
2014-04-03 15:50 0... -
oozie: Workflow
2014-04-03 16:30 956Workflow Definition A workfl ... -
oozie:Command Line Interface Utilities
2014-04-03 14:16 0Refere ...
相关推荐
- **Hadoop 兼容性**:Oozie 4.1 支持 Hadoop 2.x 版本,这意味着它可以与 YARN 集成,利用 YARN 的资源管理和调度能力。 - **改进的性能**:在这一版本中,Oozie 对作业提交和调度进行了优化,减少了延迟,提高了...
oozie-5.1.0.tar.gz 编译结果,受限上传大小,文件分3部分
8. 启动Oozie服务器:`start-oozie.sh` 五、Oozie使用技巧 1. 工作流设计:熟练掌握DAG(有向无环图)结构,合理规划任务间依赖关系。 2. 错误处理:利用Oozie的错误节点和重试策略,提高作业的健壮性。 3. 日志...
Oozie 是一个针对Apache Hadoop生态系统的开源工作流调度系统,用于管理和协调Hadoop集群中的各种作业和任务。在4.1.0版本中,Oozie提供了一个灵活、可扩展的框架,允许用户定义和执行复杂的作业流程,支持多种类型...
ERROR : FAILED: Execution Error, return code 30041 from org.apache.hadoop.hive.ql.exec.spark.SparkTask. 前言报错信息异常分析配置改动后记 前言 在成功消除Cloudare管理界面上那些可恶的警告之后,我又对yarn...
《Hadoop中级教程(2.x)》是一份深入学习Hadoop技术的重要资料,涵盖了Hadoop在2.x版本中的核心概念、配置以及实际操作。这份教程包括了多个子文件,如B01_Eclipse的Hadoop插件安装与使用、B02_Hadoop2.x Namenode...
Hadoop 2.6.0是Hadoop发展过程中的一个重要版本,它在Hadoop 2.x系列中引入了许多改进和优化,包括提升系统性能、增强容错性和稳定性,以及提供了更丰富的功能。CDH(Cloudera Distribution Including Apache Hadoop...
- **Oozie**:工作流调度系统,管理Hadoop作业和协调其他Hadoop相关的工具。 6. **监控与维护** - **Ambari**:一个Web界面,用于Hadoop集群的安装、配置、管理和监控。 - **Ganglia**或**Nagios**:监控系统,...
Oozie由Cloudera公司贡献给Apache的基于工作流引擎的开源框架,是用于Hadoop平台的开源的工作流调度引擎,是用来管理Hadoop作业,属于web应用程序,
- **Oozie**:工作流管理系统,协调Hadoop作业的执行。 4. **Hadoop的安装与配置**: - **环境变量配置**:设置HADOOP_HOME、PATH等环境变量,指向Hadoop安装目录。 - **配置文件修改**:如core-site.xml、hdfs-...
Apache Hadoop 中,Oozie 是一个至关重要的组件,它是一个开源的工作流调度引擎,专门设计用于管理和协调在Hadoop生态系统中的各种作业。Oozie 由Cloudera公司贡献给Apache社区,它是一个基于工作流引擎的系统,能够...
By the end of the book, you will have a good knowledge of Apache Oozie. You will be capable of using Oozie to handle large Hadoop workflows and even improve the availability of your Hadoop environment...
2. **调度器**:Oozie 提供了一个强大的调度引擎,可以按照时间间隔、数据触发或其他条件自动启动工作流。用户可以通过CRON表达式定义定时任务,实现对Hadoop作业的精确调度。 3. **工作流监控**:Oozie 提供了Web...
例如,`oozie.service.JPAService.jdbc.url`用于指定Oozie数据库的URL,`oozie.service.HadoopAccessorService.hadoop.configurations.*.configuration`用于配置Oozie访问不同Hadoop版本的安全设置。 2. **oozie-...
linux.64 下的 oozie-4.3.0.tar.gz 源码包 解压后编译即可 注意对应的版本 cd bin ./mkdistro.sh -Phadoop-2 -Dhadoop.auth.version=2.8.2 -Ddistcp.version=2.8.2 -Dhadoop.version=2.8.2 -Dsqoop.version=1.4.6 -...
2. **任务类型支持**:Oozie支持多种Hadoop任务类型,包括MapReduce、Pig、Hive、Sqoop、Java动作等,覆盖了Hadoop生态系统的大部分任务需求。 3. **表达能力**:Oozie的XML配置支持EL(Expression Language)常量...