- 浏览: 218829 次
- 性别:
- 来自: 北京
文章分类
- 全部博客 (114)
- hbase (3)
- akka (7)
- hdfs (6)
- mapreduce (1)
- hive (0)
- zookeeper (8)
- storm (0)
- geese (0)
- leaf (0)
- stormbase (0)
- scala (2)
- oozie (11)
- zeromq (1)
- netty (3)
- mongodb (0)
- sqoop (2)
- flume (3)
- mahout (1)
- redis (0)
- lucene (1)
- solr (1)
- ganglia (3)
- 分布式理论 (2)
- hadoop (42)
- others (14)
- mq (1)
- clojure (3)
- flume ng (1)
- linux (1)
- esper (0)
最新评论
-
javalogo:
[b][i][u]引用[list]
[*][*][flash= ...
什么是Flume -
leibnitz:
what are they meanings
Hadoop Ganglia Metric Item -
di1984HIT:
没用过啊。
akka 介绍-Actor 基础 -
di1984HIT:
写的不错。
Hadoop管理-集群维护 -
developerinit:
很好,基本上介绍了
什么是Flume
blocksize:35M
filesize 96M
zk-session-timeout:10s
logs:
active nn:Wed Sep 5 13:20:25 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 19] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn1bd10 \ufffdF(\ufffd>
cZxid = 0xd90
ctime = Wed Sep 05 13:20:58 CST 2012
mZxid = 0xd90
mtime = Wed Sep 05 13:20:58 CST 2012
pZxid = 0xd90
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a045a
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 20] get /hadoop-ha/mycluster/Active
ActiveBreadCrumb ActiveStandbyElectorLock
[zk: localhost:2181(CONNECTED) 20] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn1bd10 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xd93
mtime = Wed Sep 05 13:21:13 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 89
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
client
copy 0 ...
Wed Sep 5 13:18:45 CST 2012
copy 1 ...
Wed Sep 5 13:18:55 CST 2012
copy 2 ...
Wed Sep 5 13:19:16 CST 2012
copy 3 ...
Wed Sep 5 13:19:50 CST 2012
copy 4 ...
Wed Sep 5 13:20:09 CST 2012
12/09/05 13:20:49 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 13:20:49 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 643ms.
12/09/05 13:21:09 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 13:21:09 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
12/09/05 13:21:12 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 1851ms.
12/09/05 13:21:15 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 549ms.
12/09/05 13:21:15 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
copy 5 ...
Wed Sep 5 13:21:18 CST 2012
copy 6 ...
Wed Sep 5 13:21:25 CST 2012
blocksize:35M
filesize 96M
zk-session-timeout:10s
Active NN:Wed Sep 5 13:51:28 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 30] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0xd9b
ctime = Wed Sep 05 13:51:38 CST 2012
mZxid = 0xd9b
mtime = Wed Sep 05 13:51:38 CST 2012
pZxid = 0xd9b
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a045c
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 31] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xd9c
mtime = Wed Sep 05 13:52:07 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 91
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 32]
client:
copy 0 ...
Wed Sep 5 13:50:30 CST 2012
copy 1 ...
Wed Sep 5 13:50:42 CST 2012
copy 2 ...
Wed Sep 5 13:51:01 CST 2012
copy 3 ...
Wed Sep 5 13:51:27 CST 2012
12/09/05 13:51:49 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 703ms.
12/09/05 13:52:02 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 1761ms.
12/09/05 13:52:04 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 3 fail over attempts. Trying to fail over after sleeping for 2651ms.
12/09/05 13:52:09 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 4 fail over attempts. Trying to fail over after sleeping for 9203ms.
copy 4 ...
Wed Sep 5 13:52:22 CST 2012
copy 5 ...
Wed Sep 5 13:52:52 CST 2012
copy 6 ...
Wed Sep 5 13:53:07 CST 2012
copy 7 ...
Wed Sep 5 13:53:19 CST 2012
blocksize:35M
filesize 96M
zk-session-timeout:10M
Active NN shutdown:Wed Sep 5 15:46:43 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 1] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0xdbe
ctime = Wed Sep 05 15:47:20 CST 2012
mZxid = 0xdbe
mtime = Wed Sep 05 15:47:20 CST 2012
pZxid = 0xdbe
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a0463
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 2] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xdbf
mtime = Wed Sep 05 15:47:26 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 99
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 3]
client:
copy 0 ...
Wed Sep 5 15:42:51 CST 2012
copy 1 ...
Wed Sep 5 15:43:00 CST 2012
copy 2 ...
Wed Sep 5 15:43:46 CST 2012
copy 3 ...
Wed Sep 5 15:43:52 CST 2012
copy 4 ...
Wed Sep 5 15:44:11 CST 2012
copy 5 ...
Wed Sep 5 15:44:47 CST 2012
copy 6 ...
Wed Sep 5 15:45:28 CST 2012
copy 7 ...
Wed Sep 5 15:45:51 CST 2012
copy 8 ...
Wed Sep 5 15:46:08 CST 2012
copy 9 ...
Wed Sep 5 15:46:35 CST 2012
12/09/05 15:47:09 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 15:47:09 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 971ms.
12/09/05 15:47:29 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 15:47:29 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
12/09/05 15:47:41 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 2610ms.
12/09/05 15:47:53 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 1440ms.
copy 10 ...
Wed Sep 5 15:47:53 CST 2012
copy 11 ...
Wed Sep 5 15:48:00 CST 2012
zkfc:
2012-09-05 15:41:57,592 INFO org.apache.hadoop.ha.ZKFailoverController: Successfully transitioned NameNode at bd09/10.1.1.83:9000 to standby state
2012-09-05 15:47:19,975 INFO org.apache.hadoop.ha.ActiveStandbyElector: Checking for any old active which needs to be fenced...
2012-09-05 15:47:20,001 INFO org.apache.hadoop.ha.ActiveStandbyElector: Old node exists: 0a096d79636c757374657212036e6e311a046264313020a84628d33e
2012-09-05 15:47:20,003 INFO org.apache.hadoop.ha.ZKFailoverController: Should fence: NameNode at bd10/10.1.1.144:9000
2012-09-05 15:47:23,272 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: bd10/10.1.1.144:9000. Already tried 0 time(s).
2012-09-05 15:47:26,274 WARN org.apache.hadoop.ha.FailoverController: Unable to gracefully make NameNode at bd10/10.1.1.144:9000 standby (unable to connect)
java.net.NoRouteToHostException: No Route to Host from bd09/10.1.1.83 to bd10:9000 failed on socket timeout exception: java.net.NoRouteToHostException: No route to host; For more details see: http://wiki.apache.org/hadoop/NoRouteToHost
at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:756)
at org.apache.hadoop.ipc.Client.call(Client.java:1165)
at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:184)
at $Proxy8.transitionToStandby(Unknown Source)
at org.apache.hadoop.ha.protocolPB.HAServiceProtocolClientSideTranslatorPB.transitionToStandby(HAServiceProtocolClientSideTranslatorPB.java:112)
at org.apache.hadoop.ha.FailoverController.tryGracefulFence(FailoverController.java:154)
at org.apache.hadoop.ha.ZKFailoverController.doFence(ZKFailoverController.java:510)
at org.apache.hadoop.ha.ZKFailoverController.fenceOldActive(ZKFailoverController.java:501)
at org.apache.hadoop.ha.ZKFailoverController.access$1100(ZKFailoverController.java:59)
at org.apache.hadoop.ha.ZKFailoverController$ElectorCallbacks.fenceOldActive(ZKFailoverController.java:838)
at org.apache.hadoop.ha.ActiveStandbyElector.fenceOldActive(ActiveStandbyElector.java:859)
at org.apache.hadoop.ha.ActiveStandbyElector.becomeActive(ActiveStandbyElector.java:760)
at org.apache.hadoop.ha.ActiveStandbyElector.processResult(ActiveStandbyElector.java:407)
at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:609)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:497)
Caused by: java.net.NoRouteToHostException: No route to host
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:567)
at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524)
at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489)
at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:472)
at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:566)
at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:215)
at org.apache.hadoop.ipc.Client.getConnection(Client.java:1271)
at org.apache.hadoop.ipc.Client.call(Client.java:1141)
... 13 more
2012-09-05 15:47:26,275 INFO org.apache.hadoop.ha.NodeFencer: ====== Beginning Service Fencing Process... ======
2012-09-05 15:47:26,275 INFO org.apache.hadoop.ha.NodeFencer: Trying method 1/1: org.apache.hadoop.ha.ShellCommandFencer(/opt/hadoop/etc/hadoop/fencing.sh)
2012-09-05 15:47:26,319 INFO org.apache.hadoop.ha.ShellCommandFencer: Launched fencing command '/opt/hadoop/etc/hadoop/fencing.sh' with pid 2777
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.ShellCommandFencer: [PID 2777] /opt/had...encing.sh: OK
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.NodeFencer: ====== Fencing successful by method org.apache.hadoop.ha.ShellCommandFencer(/opt/hadoop/etc/hadoop/fencing.sh) ======
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.ActiveStandbyElector: Writing znode /hadoop-ha/mycluster/ActiveBreadCrumb to indicate that the local node is the most recent active...
2012-09-05 15:47:26,325 INFO org.apache.hadoop.ha.ZKFailoverController: Trying to make NameNode at bd09/10.1.1.83:9000 active...
2012-09-05 15:47:40,960 INFO org.apache.hadoop.ha.ZKFailoverController: Successfully transitioned NameNode at bd09/10.1.1.83:9000 to active state
another client @ha switch
./runRW.sh
Wed Sep 5 16:19:56 CST 2012
12/09/05 16:20:09 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 958ms.
12/09/05 16:20:10 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 2238ms.
12/09/05 16:20:15 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 3 fail over attempts. Trying to fail over after sleeping for 3806ms.
12/09/05 16:20:19 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 4 fail over attempts. Trying to fail over after sleeping for 6348ms.
12/09/05 16:20:29 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 5 fail over attempts. Trying to fail over after sleeping for 20625ms.
Found 6 items
-rw-r--r-- 3 peter supergroup 100291546 2012-09-05 16:15 4
drwxr-xr-x - peter supergroup 0 2012-08-24 17:03 abc
-rw-r--r-- 3 peter supergroup 100291546 2012-09-05 15:10 hadoop-2.0.0-cdh4.0.0.tar.gz
drwxr-xr-x - peter supergroup 0 2012-09-04 16:41 smallfiles
drwxr-xr-x - peter supergroup 0 2012-08-28 16:57 smallfiles1
Deleted 4
Wed Sep 5 16:20:57 CST 2012
filesize 96M
zk-session-timeout:10s
logs:
active nn:Wed Sep 5 13:20:25 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 19] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn1bd10 \ufffdF(\ufffd>
cZxid = 0xd90
ctime = Wed Sep 05 13:20:58 CST 2012
mZxid = 0xd90
mtime = Wed Sep 05 13:20:58 CST 2012
pZxid = 0xd90
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a045a
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 20] get /hadoop-ha/mycluster/Active
ActiveBreadCrumb ActiveStandbyElectorLock
[zk: localhost:2181(CONNECTED) 20] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn1bd10 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xd93
mtime = Wed Sep 05 13:21:13 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 89
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
client
copy 0 ...
Wed Sep 5 13:18:45 CST 2012
copy 1 ...
Wed Sep 5 13:18:55 CST 2012
copy 2 ...
Wed Sep 5 13:19:16 CST 2012
copy 3 ...
Wed Sep 5 13:19:50 CST 2012
copy 4 ...
Wed Sep 5 13:20:09 CST 2012
12/09/05 13:20:49 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 13:20:49 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 643ms.
12/09/05 13:21:09 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 13:21:09 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
12/09/05 13:21:12 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 1851ms.
12/09/05 13:21:15 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 549ms.
12/09/05 13:21:15 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
copy 5 ...
Wed Sep 5 13:21:18 CST 2012
copy 6 ...
Wed Sep 5 13:21:25 CST 2012
blocksize:35M
filesize 96M
zk-session-timeout:10s
Active NN:Wed Sep 5 13:51:28 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 30] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0xd9b
ctime = Wed Sep 05 13:51:38 CST 2012
mZxid = 0xd9b
mtime = Wed Sep 05 13:51:38 CST 2012
pZxid = 0xd9b
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a045c
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 31] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xd9c
mtime = Wed Sep 05 13:52:07 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 91
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 32]
client:
copy 0 ...
Wed Sep 5 13:50:30 CST 2012
copy 1 ...
Wed Sep 5 13:50:42 CST 2012
copy 2 ...
Wed Sep 5 13:51:01 CST 2012
copy 3 ...
Wed Sep 5 13:51:27 CST 2012
12/09/05 13:51:49 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 703ms.
12/09/05 13:52:02 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 1761ms.
12/09/05 13:52:04 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 3 fail over attempts. Trying to fail over after sleeping for 2651ms.
12/09/05 13:52:09 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 4 fail over attempts. Trying to fail over after sleeping for 9203ms.
copy 4 ...
Wed Sep 5 13:52:22 CST 2012
copy 5 ...
Wed Sep 5 13:52:52 CST 2012
copy 6 ...
Wed Sep 5 13:53:07 CST 2012
copy 7 ...
Wed Sep 5 13:53:19 CST 2012
blocksize:35M
filesize 96M
zk-session-timeout:10M
Active NN shutdown:Wed Sep 5 15:46:43 CST 2012
zk:
[zk: localhost:2181(CONNECTED) 1] get /hadoop-ha/mycluster/ActiveStandbyElectorLock
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0xdbe
ctime = Wed Sep 05 15:47:20 CST 2012
mZxid = 0xdbe
mtime = Wed Sep 05 15:47:20 CST 2012
pZxid = 0xdbe
cversion = 0
dataVersion = 0
aclVersion = 0
ephemeralOwner = 0x13971759a9a0463
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 2] get /hadoop-ha/mycluster/ActiveBreadCrumb
myclusternn2bd09 \ufffdF(\ufffd>
cZxid = 0x41
ctime = Thu Aug 30 09:50:56 CST 2012
mZxid = 0xdbf
mtime = Wed Sep 05 15:47:26 CST 2012
pZxid = 0x41
cversion = 0
dataVersion = 99
aclVersion = 0
ephemeralOwner = 0x0
dataLength = 28
numChildren = 0
[zk: localhost:2181(CONNECTED) 3]
client:
copy 0 ...
Wed Sep 5 15:42:51 CST 2012
copy 1 ...
Wed Sep 5 15:43:00 CST 2012
copy 2 ...
Wed Sep 5 15:43:46 CST 2012
copy 3 ...
Wed Sep 5 15:43:52 CST 2012
copy 4 ...
Wed Sep 5 15:44:11 CST 2012
copy 5 ...
Wed Sep 5 15:44:47 CST 2012
copy 6 ...
Wed Sep 5 15:45:28 CST 2012
copy 7 ...
Wed Sep 5 15:45:51 CST 2012
copy 8 ...
Wed Sep 5 15:46:08 CST 2012
copy 9 ...
Wed Sep 5 15:46:35 CST 2012
12/09/05 15:47:09 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 15:47:09 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 971ms.
12/09/05 15:47:29 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB. Trying to fail over immediately.
12/09/05 15:47:29 WARN retry.RetryInvocationHandler: A failover has occurred since the start of this method invocation attempt.
12/09/05 15:47:41 WARN retry.RetryInvocationHandler: Exception while invoking addBlock of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 2610ms.
12/09/05 15:47:53 WARN retry.RetryInvocationHandler: Exception while invoking renewLease of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 1440ms.
copy 10 ...
Wed Sep 5 15:47:53 CST 2012
copy 11 ...
Wed Sep 5 15:48:00 CST 2012
zkfc:
2012-09-05 15:41:57,592 INFO org.apache.hadoop.ha.ZKFailoverController: Successfully transitioned NameNode at bd09/10.1.1.83:9000 to standby state
2012-09-05 15:47:19,975 INFO org.apache.hadoop.ha.ActiveStandbyElector: Checking for any old active which needs to be fenced...
2012-09-05 15:47:20,001 INFO org.apache.hadoop.ha.ActiveStandbyElector: Old node exists: 0a096d79636c757374657212036e6e311a046264313020a84628d33e
2012-09-05 15:47:20,003 INFO org.apache.hadoop.ha.ZKFailoverController: Should fence: NameNode at bd10/10.1.1.144:9000
2012-09-05 15:47:23,272 INFO org.apache.hadoop.ipc.Client: Retrying connect to server: bd10/10.1.1.144:9000. Already tried 0 time(s).
2012-09-05 15:47:26,274 WARN org.apache.hadoop.ha.FailoverController: Unable to gracefully make NameNode at bd10/10.1.1.144:9000 standby (unable to connect)
java.net.NoRouteToHostException: No Route to Host from bd09/10.1.1.83 to bd10:9000 failed on socket timeout exception: java.net.NoRouteToHostException: No route to host; For more details see: http://wiki.apache.org/hadoop/NoRouteToHost
at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:756)
at org.apache.hadoop.ipc.Client.call(Client.java:1165)
at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:184)
at $Proxy8.transitionToStandby(Unknown Source)
at org.apache.hadoop.ha.protocolPB.HAServiceProtocolClientSideTranslatorPB.transitionToStandby(HAServiceProtocolClientSideTranslatorPB.java:112)
at org.apache.hadoop.ha.FailoverController.tryGracefulFence(FailoverController.java:154)
at org.apache.hadoop.ha.ZKFailoverController.doFence(ZKFailoverController.java:510)
at org.apache.hadoop.ha.ZKFailoverController.fenceOldActive(ZKFailoverController.java:501)
at org.apache.hadoop.ha.ZKFailoverController.access$1100(ZKFailoverController.java:59)
at org.apache.hadoop.ha.ZKFailoverController$ElectorCallbacks.fenceOldActive(ZKFailoverController.java:838)
at org.apache.hadoop.ha.ActiveStandbyElector.fenceOldActive(ActiveStandbyElector.java:859)
at org.apache.hadoop.ha.ActiveStandbyElector.becomeActive(ActiveStandbyElector.java:760)
at org.apache.hadoop.ha.ActiveStandbyElector.processResult(ActiveStandbyElector.java:407)
at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:609)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:497)
Caused by: java.net.NoRouteToHostException: No route to host
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:567)
at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:524)
at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489)
at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:472)
at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:566)
at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:215)
at org.apache.hadoop.ipc.Client.getConnection(Client.java:1271)
at org.apache.hadoop.ipc.Client.call(Client.java:1141)
... 13 more
2012-09-05 15:47:26,275 INFO org.apache.hadoop.ha.NodeFencer: ====== Beginning Service Fencing Process... ======
2012-09-05 15:47:26,275 INFO org.apache.hadoop.ha.NodeFencer: Trying method 1/1: org.apache.hadoop.ha.ShellCommandFencer(/opt/hadoop/etc/hadoop/fencing.sh)
2012-09-05 15:47:26,319 INFO org.apache.hadoop.ha.ShellCommandFencer: Launched fencing command '/opt/hadoop/etc/hadoop/fencing.sh' with pid 2777
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.ShellCommandFencer: [PID 2777] /opt/had...encing.sh: OK
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.NodeFencer: ====== Fencing successful by method org.apache.hadoop.ha.ShellCommandFencer(/opt/hadoop/etc/hadoop/fencing.sh) ======
2012-09-05 15:47:26,321 INFO org.apache.hadoop.ha.ActiveStandbyElector: Writing znode /hadoop-ha/mycluster/ActiveBreadCrumb to indicate that the local node is the most recent active...
2012-09-05 15:47:26,325 INFO org.apache.hadoop.ha.ZKFailoverController: Trying to make NameNode at bd09/10.1.1.83:9000 active...
2012-09-05 15:47:40,960 INFO org.apache.hadoop.ha.ZKFailoverController: Successfully transitioned NameNode at bd09/10.1.1.83:9000 to active state
another client @ha switch
./runRW.sh
Wed Sep 5 16:19:56 CST 2012
12/09/05 16:20:09 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 1 fail over attempts. Trying to fail over after sleeping for 958ms.
12/09/05 16:20:10 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 2 fail over attempts. Trying to fail over after sleeping for 2238ms.
12/09/05 16:20:15 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 3 fail over attempts. Trying to fail over after sleeping for 3806ms.
12/09/05 16:20:19 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 4 fail over attempts. Trying to fail over after sleeping for 6348ms.
12/09/05 16:20:29 WARN retry.RetryInvocationHandler: Exception while invoking getFileInfo of class ClientNamenodeProtocolTranslatorPB after 5 fail over attempts. Trying to fail over after sleeping for 20625ms.
Found 6 items
-rw-r--r-- 3 peter supergroup 100291546 2012-09-05 16:15 4
drwxr-xr-x - peter supergroup 0 2012-08-24 17:03 abc
-rw-r--r-- 3 peter supergroup 100291546 2012-09-05 15:10 hadoop-2.0.0-cdh4.0.0.tar.gz
drwxr-xr-x - peter supergroup 0 2012-09-04 16:41 smallfiles
drwxr-xr-x - peter supergroup 0 2012-08-28 16:57 smallfiles1
Deleted 4
Wed Sep 5 16:20:57 CST 2012
发表评论
-
Hadoop TestDFSIO
2013-04-21 21:02 2432@VM [bigdata@bigdata hadoo ... -
Hadoop NNBENCH
2013-04-21 20:46 1630@VM [bigdata@bigdata hadoop]$ ... -
Hadoop 安装手册
2013-04-08 15:47 1193Hadoop 安装手册 软件准备 ... -
What do real life hadoop workloads look like
2012-09-10 15:52 832http://www.cloudera.com/blog/20 ... -
CDH4 HA 切换
2012-09-05 10:51 1382HA 切换问题 切换时间太长。。。 copy 0 ... ... -
Hadoop CDh4 Standby HA 启动过程
2012-08-02 11:40 2863根据日志: StandBy NN启动过程 1.获得Active ... -
CDH4 HA test
2012-08-01 14:55 2647场景: NN HA 设置成功,HA切换客户端出现异 ... -
Hadoop TextOutput
2012-07-29 21:08 906TextOutputFormat 分隔符参数: mapredu ... -
Hadoop SteamXMLRecordReader
2012-07-28 23:59 704StreamXmlRecordReader 设置属性 str ... -
Hadoop NLineInputFormat
2012-07-28 23:52 1646NLineInputFormat 重写了splits 设置 ... -
KeyValueTextInputFormat
2012-07-28 23:40 953key/value 分割符 mapreduce.input. ... -
Hadoop 控制split尺寸
2012-07-28 23:08 1337三个参数决定Map的Split尺寸 1.mapred.min ... -
Setting up Disks for Hadoop
2012-07-22 12:13 873Setting up Disks for Hadoop He ... -
Upgrade hadoop need think about it
2012-07-21 17:17 884Compatibility When movin ... -
Hadoop 0.23 config differ from 0.20.205
2012-07-21 17:14 922http://hadoop.apache.org/common ... -
Hadoop hdfs block 状态
2012-07-15 13:37 7231.In Service -
Hadoop 配置不当引起集群不稳
2012-07-05 15:35 1025配置不当内容 资源配置不当:内存、文件句柄数量、磁盘空间 ... -
Hadoop管理-集群维护
2012-07-03 15:27 50051.检查HDFS状态 fsck命令 1)f ... -
Hadoop Ganglia Metric Item
2012-06-27 11:13 2024dfs.FSDirectory.files_delete ... -
Hadoop 参数
2012-06-27 10:05 1012转发自:http://www.cnblogs.com/g ...
相关推荐
在CDH5.5.0中,HDFS(Hadoop Distributed File System)和YARN(Yet Another Resource Negotiator)是两个核心组件,它们在高可用性(HA)模式下的配置尤为重要。HDFS HA允许数据节点和名称节点的冗余,以确保即使单...
为了提高数据的可靠性和系统的可用性,CDH5支持HDFS的高可用性(HA)模式。这通常包括配置NameNode HA,使用JournalNode进行日志同步,以及设置Quorum-based Storage策略。配置过程中需要关注Zookeeper的角色,以及...
9. **备份与容灾**:定期备份数据,配置高可用性和故障切换方案,如HDFS的NameNode HA和Zookeeper的Quorum机制,以确保业务连续性。 10. **性能调优**:通过对硬件、网络、操作系统以及Hadoop组件的综合调优,可以...
### Hadoop之CDH:基于Cloudera的HA部署指南 #### 关于本指南 本文档旨在提供关于如何在Cloudera Distribution Including Hadoop (CDH)上配置高可用性的详细指南。CDH是由Cloudera公司提供的一个企业级Hadoop发行...
在CDH 5.10.0中,Zookeeper的集成使得Hadoop的HA(High Availability)功能得以实现,例如HDFS的NameNode热备、YARN的ResourceManager热备等。当主节点故障时,Zookeeper能够快速进行选举,确保服务的无缝切换。 总...
**高可用 CDH4:Namenode HA + HA 自动切换** 为了提高系统的可用性和可靠性,CDH4 提供了 Namenode HA 功能。这通常涉及以下步骤: 1. **配置 JournalNodes**:至少需要三个 JournalNodes 来实现日志复制。 2. **...
- 在 CDH4 之前,Hadoop 社区版仅支持单一 NameNode 的部署模式,这意味着如果 NameNode 出现故障,则整个集群将无法使用,直到 NameNode 重启为止。 - 这样的架构不仅增加了意外宕机的风险,还限制了 NameNode 的...
1.1.10 **HDFS平衡并行移动块数**:`dfs.ha.fencing.methods.parallelism`控制在HDFS HA中切换NameNode时并行移动的块数,以加快切换速度。 1.1.11 **HDFS副本数**:默认为3,`dfs.replication`参数可调整,需考虑...
1. **Hadoop中的NameNode HA**:Zookeeper用于监控和切换NameNode的主备状态,确保HDFS的高可用性。 2. **HBase的RegionServer管理**:Zookeeper负责RegionServer的注册、发现以及负载均衡。 3. **Kafka的集群协调...
本文将详细阐述在四台机器上构建Hadoop2集群的步骤,其中包括配置NameNode HA和ResourceManager HA,以提供冗余和故障切换能力。 **1. 集群配置** 集群由四台机器组成,每台机器有不同的角色: - hadoop1和hadoop2...
8.2 CDH4B1版本HDFS集群配置 8.2.1 虚拟机安装 8.2.2 nn1配置 8.2.3 dn1~dn3配置 8.2.4 HDFS集群构建 8.3 HA NameNode配置 8.3.1 nn1配置 8.3.2 其他节点配置 8.4 HA NameNode使用 8.4.1 启动HA HDFS集群 8.4.2 第1...
这份配置说明将详细介绍如何在Cloudera Data Hub (CDH)环境中实现高可用性,涵盖HDFS HA以及CDH其他组件如Hive Metastore、Hue和Impala与HDFS HA的集成。 1. **简介** Apache Hadoop集群常常承载着各种用户运行的...
Zookeeper在HDFS和YARN HA中都起到关键作用,它负责NameNode和ResourceManager的选举,确保在节点故障时能够快速切换。 5. **环境和软件版本**: - 环境:VirtualBox 6.0.24,CentOS 7,Windows 10 - 软件:JDK ...
2. **Capacity Scheduler**:按队列分配资源,保证了不同队列的资源保证,是CDH默认的调度器。 3. **Fair Scheduler**:追求资源的公平分配,初期可能独占资源,但随着任务提交,资源会得到合理调整。 参数`yarn....
Hadoop-HA(High Availability)提供了一个高可用性解决方案,允许在发生故障时快速切换到备用NameNode。Federation机制允许多个NameSpace共存,可以水平扩展NameNode,从而解决了单点故障问题。CDH(Cloudera's ...