Much of this content is taken from the
HBase Overview [1] and the HBase Default Configuration [2]
.
System
- Increase the default per-process file handle limit [3] in
/etc/security/limits.conf
HDFS
- Set dfs.datanode.max.xceivers to 2047 [4] in
$HADOOP_HOME/conf/hdfs-site.xml
- Set dfs.datanode.socket.write.timeout to 0 [5]
HBase
- First, note that the default configuration values are stored atsrc/main/resources/hbase-default.xml [6] in the source tree
- For your site-specific configuration values, edit conf/hbase-site.xml
- Set hbase.rootdir to point to the directory in HDFS where HBase will put its data; e.g.
hdfs://localhost:9000/hbase
Per-Cluster
-
hfile.block.cache.size controls the amount of region server heap space to devote to the block cache. Currently defaults to 20%.
Per-Table
-
Max File Size: for clusters with lots of data, can be tuned up to 1 GB to result in less regions on the cluster.
- MemStore Flush Size
Per-Family
- Compression
- Bloom filters
Per-Region Server
-
hbase.regionserver.global.memstore.upperLimit is used to cap the amount of heap room in each region server to reserve for all MemStores [7] served by that region. It defaults to 40% of the heap.
-
hbase.hregion.memstore.flush.size is the threshold for deciding when to flush a single MemStore to disk. It defaults to 64 MB.
-
hbase.hregion.memstore.block.multiplier controls when to start blocking writes to keep the MemStore size sane. It defaults to 2 (multiplied by the memstore.flush.size). For production clusters with lots of RAM that you monitor closely, you can up to something like 8.
-
hbase.hregion.max.filesize determines how big a StoreFile is allowed to grow before splitting a region. Defaults to 256 MB.
Per-Store
-
hbase.hstore.blockingStoreFiles determines the maximum number of StoreFiles per Store to allow before blocking writes and forcing a compaction. The default is 7, but in production clusters monitored closely, it may make sense to up to 15.
[1]
http://hbase.apache.org/docs/cur...[2]
http://hbase.apache.org/docs/cur...[3]
http://wiki.apache.org/hadoop/Hb...[4]
http://wiki.apache.org/hadoop/Hb...[5]
http://hbase.markmail.org/thread...[6]
http://svn.apache.org/viewvc/hba...[7]
http://hbase.apache.org/docs/cur...
相关推荐
NAV350 报文解析 Telegram_listing_Telegrams_for_Configuring_and_Operating_the_NAV350_
Configuring HBase Chapter 2. Loading Data from Various DBs Chapter 3. Working with Large Distributed Systems Part I Chapter 4. Working with Large Distributed Systems Part II Chapter 5. Working with ...
总结起来,西门子技术指导文件《Configuring a TCP Connection for S7-300/S7-400 Industrial Ethernet CPs》为自动化领域工程师和技术人员提供了一套详尽的TCP连接配置指导,不仅包括了基本的网络和通讯处理器设置...
Configuring SAP R3 FICO The Essential Resource for Configuring the Financial and Controlling Modules
Configuring SAP R3 FICO The Essential Resource for Configuring the Financial and Controlling Modules.part3
What Are the Recovery Options? Chapter 13 Using the Oracle Enterprise Manager What Is the OEM? The DBA Toolkit and Security OEM and the Job Scheduler OEM and the Event Management System Chapter...
This application note provides code examples for the following important operations that are involved in booting a bare-metal system: • Initializing exceptions. • Initializing registers. • ...
Prepare for Microsoft Exam 70-410 – and help demonstrate your real-world mastery of implementing and configuring core services in Windows Server 2012 R2. Designed for experienced IT professionals ...
### 配置TwinCAT以控制AM335x #### 一、引言 本文主要介绍了如何利用运行在PC上的TwinCAT软件来控制基于AM335x的EtherCAT从控制器,并监控其输入信号。TwinCAT是贝加莱(Beckhoff)公司的一款软件系统,它能够将...
参考官方文档:http://wiki.pentaho.com/display/BAD/Configuring+Pentaho+for+your+Hadoop+Distro+and+Version。对于 Kettle 5.3,选择适用于 Hadoop 2.x 的 shim 包 hdp21。具体操作是在 `data-integration/...
Prepare for Microsoft Exam 70-687 – and help demonstrate your real-world mastery of configuring Windows 8.1 in the enterprise. Designed for experienced IT professionals ready to advance their status...
Syngress - Configuring IPv6 for Cisco IOS(2002) 配置IPv6
This_is_a_small_Micropython_module_for_configuring_ADS1299-x-MicroPython-Module
在介绍如何配置和调整HP ProLiant服务器以用于低延迟应用程序时,文档主要涉及了在金融服务业中常见的高性能需求,尤其是在高频交易、市场数据分发和交易所数据处理等方面。其他需要低延迟响应的行业还包括实时信号...
本文将基于《Configuring Siebel Business Applications》(版本8.1)的内容,详细介绍如何配置Siebel业务应用程序。 #### 二、软件许可与版权信息 根据文档开头部分提供的信息,该文档为Siebel业务应用程序配置...