`

hadoop 2-state machine-(1)

 
阅读更多

1.what  

  in many scenarios,we see some figures like this:some state node,some events triggered state changes,and certain directed-arcs.it look likes a running-machine with misc commands ,say like events.so it is called "State Machine".

  yep,through these elements,we know a clear flow of running,so it's simple and easily-understandable.

 

2.why

  bases on it's concept above,we know 'State machine' is targeted to simplify logics,decrease couples,and facilitate to tune programs.

 

3.how

  in hadoop yarn,there are many state machines in different modules,eg. RMAppImpl,RMAppAtttemptImpl,RMContainerImpl,RMNodeImpl etc.from the construction of it,there are at least certain items exists in it:

  prestate/current state--current status before any event comes in to be handled.

  post state--in contrast with prestate,this is the next state after triggering certain events

  event--that is the cuase to trigger to state to be moved on

  handler--what to do if any event comes in .in hadoop this is called by 'callback'

  here is a simple outline below:

prestate1 -------event1----------> post state1  
  - - - - event2 - - - -> post state1  
  - - - - event 3 - - - -> post state2  
prestate2 -------event2--------> post state2  
  -------event2--------> post state1  
..... ... ...  

  so  the relationship between event and post state may be N:M

  e.g.in RMAppImpl,there is a 'stateMachine' class varibale defined to render all flows from old to new states,of course ,related handlers are registered when starting up StateMachineFactory#installTopology()

分享到:
评论

相关推荐

    Apache Hadoop 3.x state of the union and upgrade guidance

    Apache Hadoop YARN is the modern distributed operating system for big data applications....And you’ll leave with all the knowledge of how to upgrade painlessly from 2.x to 3.x to get all the benefits.

    Spark2.3.0-Hadoop2.7.4集群部署

    - **Zookeeper集群配置**: 设置`yarn.resourcemanager.zk-state-store.quorum`参数指向Zookeeper集群。 #### 三、Spark2.3.0集群部署 ##### 3.1 Spark集群架构 - **Master/Worker模式**: Spark集群通常采用Master/...

    Python: Real World Machine Learning

    It covers the most effective machine learning techniques on a map reduce framework in Hadoop and Spark in Python. This Learning Path will teach you Python machine learning for the real world. The ...

    zkfc-design

    在故障转移过程中,ZKFC会使用一个状态机(State Machine),这个状态机定义了故障转移过程中的各种状态以及状态之间的转换条件和操作。 自动故障转移是高可用系统的重要组成部分,解决了之前需要管理员手动触发...

    大数据之zookeeper集群详细安装步骤

    在大数据环境中,ZooKeeper常用于管理HDFS(Hadoop Distributed File System)等分布式组件,确保集群的高可用性和一致性。 Zookeeper集群的安装步骤如下: 1. **环境准备**: - 首先,确保系统已经安装了Java ...

    基于SDL的集成R语言大数据挖掘系统设计.pdf

    SDL是一种形式化描述语言,用于描述系统和通信协议,基于扩展有限状态机(Extended Finite State Machine)和抽象数据类型(Abstract Data Types)。SDL被广泛应用于通信系统和实时系统的分析、设计、运行和测试之中...

Global site tag (gtag.js) - Google Analytics