`
sunwinner
  • 浏览: 202495 次
  • 性别: Icon_minigender_1
  • 来自: 上海
社区版块
存档分类
最新评论

Create Local Cloudera Parcels Repo to Save Your ASS

 
阅读更多

If you want to install Hadoop cluster via Cloudera Manager in CHINA, the network to cloudera parels is toooooo slow. You must create your local parcel repo to save your ass, thanks to the DAMN GFW!!

 

Follow this page from cloudera:

  • Download parcels:    
            Link to CDH4:http://archive.cloudera.com/cdh4/parcels/                                                                                 
            Link to IMPALA:http://archive.cloudera.com/impala/parcels/

  • Verify the location of the local parcel repository on your Cloudera Manager server: Go to the Administration page -> Properties tab -> Parcels category. You can change the local repository path in the Local Parcel Repository Path property. By default it is /opt/cloudera/parcel-repo.
  • Download the .parcel file for your operating system: (el5 or el6 for Red Hat 5 or 6, lucid or precise for Ubuntu and so on) and place it into the local parcel repository on your Cloudera Manager server. In my case it's CDH-4.3.0-1.cdh4.3.0.p0.22-el6.parcel and IMPALA-1.0.1-1.p0.431-el6.parcel
  • Open the manifest.json file in the same directory as the .parcel file you just copied. Find the section of the manifest that corresponds to the parcel you downloaded: In my case, I'm running RHEL 6 and copied the parcel file CDH-4.2.0-1.cdh4.2.0.p0.10-el6.parcel, then look for the section:
      {
          "parcelName": "CDH-4.3.0-1.cdh4.3.0.p0.22-el6.parcel",
          "components": [
            { "name":     "flume-ng",
              "version":  "1.3.0-cdh4.3.0",
              "pkg_version":  "1.3.0+159" 
            }
            ,{ "name":     "hadoop-0.20-mapreduce",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hadoop-hdfs",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hadoop-httpfs",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hadoop-mapreduce",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hadoop-yarn",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hadoop",
              "version":  "2.0.0-cdh4.3.0",
              "pkg_version":  "2.0.0+1357" 
            }
            ,{ "name":     "hbase",
              "version":  "0.94.6-cdh4.3.0",
              "pkg_version":  "0.94.6+96" 
            }
            ,{ "name":     "hcatalog",
              "version":  "0.5.0-cdh4.3.0",
              "pkg_version":  "0.5.0+9" 
            }
            ,{ "name":     "hive",
              "version":  "0.10.0-cdh4.3.0",
              "pkg_version":  "0.10.0+121" 
            }
            ,{ "name":     "mahout",
              "version":  "0.7-cdh4.3.0",
              "pkg_version":  "0.7+16" 
            }
            ,{ "name":     "oozie",
              "version":  "3.3.2-cdh4.3.0",
              "pkg_version":  "3.3.2+49" 
            }
            ,{ "name":     "pig",
              "version":  "0.11.0-cdh4.3.0",
              "pkg_version":  "0.11.0+28" 
            }
            ,{ "name":     "sqoop",
              "version":  "1.4.3-cdh4.3.0",
              "pkg_version":  "1.4.3+34" 
            }
            ,{ "name":     "sqoop2",
              "version":  "1.99.1-cdh4.3.0",
              "pkg_version":  "1.99.1+115" 
            }
            ,{ "name":     "whirr",
              "version":  "0.8.2-cdh4.3.0",
              "pkg_version":  "0.8.2+10" 
            }
            ,{ "name":     "zookeeper",
              "version":  "3.4.5-cdh4.3.0",
              "pkg_version":  "3.4.5+19" 
            }
            ,{ "name":     "hue",
              "version":  "2.3.0-cdh4.3.0",
              "pkg_version":  "2.3.0+136" 
            }
          ],
          "hash": "df5cc61b2d257aaf625341f709a4f8e09754038a"
        }
     
      {
          "parcelName": "IMPALA-1.0.1-1.p0.431-el6.parcel",
          "components": [
            { "name":     "impala",
              "version":  "1.0.1-SNAPSHOT",
              "pkg_version":  "1.0.1"
            }
          ],
          "hash": "992467f2e54bd394cbdd3f4ed97b6e9bead60ff0"
        }
     
  • Create a text file whose name is <parcel file name> .sha (e.g. CDH-4.2.0-1.cdh4.2.0.p0.10-el6.parcel.sha) and copy the hash code into it. In my case, I created file CDH-4.3.0-1.cdh4.3.0.p0.22-el6.parcel.sha and IMPALA-1.0.1-1.p0.431-el6.parcel.sha, then I copied the hash value into them respectively:
    # cat > CDH-4.3.0-1.cdh4.3.0.p0.22-el6.parcel.sha
    df5cc61b2d257aaf625341f709a4f8e09754038a
    ^C
     
    # cat > IMPALA-1.0.1-1.p0.431-el6.parcel.sha
    992467f2e54bd394cbdd3f4ed97b6e9bead60ff0
    ^C
     Place both of files into your local parcel repository, /opt/cloudera/parcel-repo by default, depending you settings in the above step.
  • Once these files are in place, Cloudera Manager will pick up the parcel and it will appear on the Hosts > Parcels page. Note that how quickly this occurs depends on the Parcel Update Frequency setting, set by default to 1 hour. You can change this on the Administration page -> Properties tab -> Parcels category.
分享到:
评论
1 楼 NIghtmare28 2014-07-17  
  太好用了, 谢谢

相关推荐

    cloudera-manager.repo

    用于进行环境clouderamanger相关的初始环境搭建以及过程

    cloudera-kudu.repo

    centos 7 安装 kudu 时,所需要的文件cloudera-kudu.repo

    Cloudera Manager安装手册(离线parcels存储库方式)

    Cloudera Manager安装手册(离线parcels存储库方式),基于CentOS操作系统一步步傻瓜式安装,截图记录整个部署过程。

    Centos7 安装Cloudera PDF 下载

    在/etc/yum.repos.d/目录下创建一个名为cloudera.repo的新文件,并添加以下内容: ```ini [cloudera] name=Cloudera Repository baseurl=https://archive.cloudera.com/cdh7/7.2.10/centos7/amd64/cdh gpgkey=...

    Cloudera Manager一步步详细部署文档(离线parcels存储库方式)

    Cloudera Manager安装手册(离线parcels存储库方式),基于CentOS操作系统一步步傻瓜式安装,截图记录整个部署过程。带集群功能验证,检查集群是否正常工作 目录 一、文档内容 3 二、软硬件环境 3 2.1.软件信息 3 ...

    cloudera manager 安装cdh 搭建大数据集群,详细讲解核心

    1. 在 Server 所在节点,创建 /opt/cloudera/parcel-repo 目录,将 parcel 二进制包放入。Cloudera Manager 在图形安装引导程序中会自动扫描并分发到各节点,并且安装。 优点:不需要下载。 url: ...

    clouder-cdh-6.2.1 离线安装包

    本资源包为 cdh6.2.1 redhat7 版本 # 目录结构如下 ├── cdh6 │ └── 6.2.1 ... ├── cloudera-manager.repo ├── RPM-GPG-KEY-cloudera └── RPMS └── x86_64 ├── cloudera-manage

    cloudera manager的运行机制及目录

    其他重要的目录还包括/usr/share/cmf/(Cloudera Manager的程序安装目录)、/var/lib/cloudera-scm-server-db/data(内嵌数据库目录)和/opt/cloudera/parcel-repo/(服务软件包数据下载目录)等。 Cloudera ...

    java调用Cloudera Manager Api实例

    Java调用Cloudera Manager API是一个复杂而关键的任务,它涉及到使用Java编程语言与Cloudera Manager服务器进行交互,以实现自动化管理和监控大数据集群。Cloudera Manager是管理Hadoop和其他Cloudera支持的数据处理...

    cloudera 5.12.zip

    Cloudera 5.12 是一款开源大数据管理平台,主要提供Hadoop生态系统的企业级解决方案。这个版本的Cloudera Manager和CDH(Cloudera Distribution Including Apache Hadoop)组合在一起,为用户提供了数据存储、处理和...

    cloudera manager安装

    - **CDH Parcels**: [http://archive.cloudera.com/cdh5/parcels/5.12.1/](http://archive.cloudera.com/cdh5/parcels/5.12.1/) 根据自己的需求选择合适的版本进行下载。 #### 三、配置主机名和 SSH 互信 在安装 ...

    Cloudera技术参考资料

    ### Cloudera技术参考资料知识点详解 #### 一、Cloudera概述与大数据平台的重要性 - **数据驱动行业发展:** 在当今社会,数据已经成为推动各行业发展的核心动力之一。随着技术的进步,越来越多的企业意识到数据的...

    Cloudera 5.4.x Documentation系列官方文档

    Cloudera 5.4.x Documentation系列官方文档。压缩包里面共有十个文档!分别是: cloudera-administration.pdf-配置管理文档 cloudera-datamgmt.pdf-数据管理文档 cloudera-impala.pdf-impala使用文档 cloudera-...

    Cloudera Hadoop 安装指南

    根据给定的文件信息,以下是对Cloudera Hadoop安装指南中的关键知识点的详细解析。 ### 关于Cloudera Hadoop安装指南 Cloudera Hadoop安装指南是为那些希望在自己的环境中部署并运行Cloudera Hadoop软件的用户提供...

    cloudera manager中添加hive数据库使用mysql的配置步骤

    在Cloudera Manager中配置Hive使用MySQL数据库涉及多个步骤,从卸载CentOS默认MySQL到配置完毕,下面详细说明每一步的知识点。 首先,确保在添加Hive数据库前,系统中不存在先前安装的MySQL版本。使用命令rpm -qa |...

    CLOUDERA-Manager_中文手册(全 高清)+ CDH安装手册.pdf

    "Cloudera Manager中文手册" Cloudera Manager是一款基于大数据管理平台,用于管理Hadoop集群和CDH(Cloudera Distribution of Hadoop)集群。该手册详细介绍了Cloudera Manager的产品介绍、基本功能、监控功能等...

    Cloudera Introduction官方介绍文档

    在开始介绍Cloudera之前,我们首先要了解的是Cloudera Inc. 是一家在数据管理领域内提供商业解决方案以及大数据技术的公司。Cloudera创立于2008年,并且是Hadoop生态系统中的重要成员之一。Cloudera的产品和服务主要...

    Cloudera Hive JDBC 2.5.20.1060

    The Cloudera JDBC Driver for Hive enables your enterprise users to access Hadoop data through Business Intelligence (BI) applications with JDBC support. The driver achieves this by translating calls ...

    Cloudera Manager 6.3.1 (文件1)

    cloudera-manager-agent-6.3.1-1466458.el7.x86_64.rpm, cloudera-manager-daemons-6.3.1-1466458.el7.x86_64.rpm, cloudera-manager-server-6.3.1-1466458.el7.x86_64.rpm, cloudera-manager-server-db-2-6.3.1-...

    cloudera search官网参考资料

    【Cloudera Search】是Cloudera公司提供的一个企业级搜索解决方案,它基于Apache Solr构建,能够处理大规模数据集的全文检索、分析和展示。Cloudera Search整合了SolrCloud,使得索引和查询操作能够在分布式环境中...

Global site tag (gtag.js) - Google Analytics