as of hadoop-2.5.1,there are some important features of hadoop 2.x for me to learn these days
new key features | abstract |
NN HA (high availability) | standby meta node,failure over anytime,maybe same as
hbase-backup master |
HDFS Federation | union all nameapes from different rights |
HDFS snapshot | |
short circuit reads | |
exeuction framework YARN | run any type apps what you want on it,eg. MR,DAG... |
RM HA | same as above |
of course,relate to hadoop 1.x,increasing learnings are important,as this will save you lot of time to study from scatch,you only need to focus the key features or new components what you want to see only!
ref:
useful official docs about hadoop architecture
Hadoop 0.23.x/NameNode federation原理、编译、安装、体验
相关推荐
PXF is now certified with Hadoop 2.x and 3.1.x and Hive Server 2.x and 3.1, and bundles new and upgraded Hadoop libraries to support these versions. PXF supports Kerberos authentication to Hive ...
model = h2o.gbm(x=features, y=target, training_frame=data) ``` 最后,利用训练好的模型进行预测: ```python predictions = model.predict(new_data) ``` 以上就是关于H2O库的基本介绍及其在Python中的使用...
Wangda Tan and Wei-Chiu Chuang the current status of Apache Hadoop 3.x—how it’s used today in deployments large and small, and they dive into the exciting present and future of Hadoop 3.x—features ...
Apache Hadoop 2.7.2 is a minor release in the 2.x.y release line, building upon the previous stable release 2.7.1. Here is a short overview of the major features and improvements. Common ...
Vertica 9.1.x New Features and Changes 39 Vertica Concepts 64 Installing Vertica 125 Getting Started 293 Administrator's Guide 353 Analyzing Data 1443 Using Flex Tables 1761 Using Management Console ...
On the Spark Streaming front, two major features have been added: mapWithState to maintain state across batches and using back pressure to throttle the input rate in case of queue buildup.2 In ...
The book starts by explaining the new features implemented in Cassandra 3.x and get you set up with Cassandra. Then you'll walk through data modeling in Cassandra and the rich feature set available to...
Big Data Analytics with Spark is therefore written for busy professionals who prefer learning a new technology from a consolidated source instead of spending countless hours on the Internet trying to...
Big Data Analytics with Spark is therefore written for busy professionals who prefer learning a new technology from a consolidated source instead of spending countless hours on the Internet trying to...
- 版本8.1.x引入了新的功能和改进,这些通常会体现在官方文档的特定章节中,如“Vertica 8.1.x New Features”部分。 2. **Vertica概念**: - 此部分通常涉及基础概念介绍,帮助新用户理解Vertica的架构、设计...
- **Hadoop发行版兼容性(HadoopDistributionCompatibility)**:Greenplum 5.7可能支持与某些Hadoop发行版集成,以实现与大数据生态系统的兼容性。 **升级与迁移:** - **升级到Greenplum数据库5.7.x...