`
weigang.gao
  • 浏览: 488105 次
  • 性别: Icon_minigender_1
  • 来自: 上海
文章分类
社区版块
存档分类
最新评论

hadoop problem

 
阅读更多

1. 启动hadoop2.5.2遇到的datanode启动不了

日志信息如下:


 简单的操作是将这个节点上hadoop工作目录中内存删除就行了

 

 

2.Hadoop开启关闭调试信息

开启:export HADOOP_ROOT_LOGGER=DEBUG,console

关闭:export HADOOP_ROOT_LOGGER=INFO,console

具体参考:http://bufeifang.blog.163.com/blog/static/55898151201232652239324/

 

3.中文分词器

http://blog.sina.com.cn/s/blog_7663527601012vdg.html

 

4.在window下eclipse导出的jar包依赖外部jar包,因此无法使用hadoop命令执行成功。

把hadoop依赖的外部jar文件都拷贝到JAVA_HOME/jre/lib/ext/下面,就可以了

 

5.hadoop常见问题总结

http://www.sharpcloud.cn/thread-4927-1-1.html

 

6.org.apache.hadoop.yarn.exceptions.YarnException: Unauthorized request to start container

问题原因:namenode,datanode时间同步问题

解决办法:多个datanode与namenode进行时间同步,在每台服务器执行:ntpdate time.nist.gov,确认时间同步成功。

最好在每台服务器的 /etc/crontab 中加入一行:

0 2 * * * root ntpdate time.nist.gov && hwclock -w

 

查看centOS时区,

 

7.mapReduce程序中如何添加自己的日志

 

8.在hadoop2.5中怎么找不到HADOOP_HOME/logs/userlogs怎么没有

 

9.官网MapReduce实例代码详细批注

http://www.linuxidc.com/Linux/2014-10/108194.htm

 

10.创建软链

 

 

11.Hive启动报错: Found class jline.Terminal, but interface was expected

http://www.linuxidc.com/Linux/2015-10/123822.htm

 

12.linux ps命令

 有时候系统管理员可能只关心现在系统中运行着哪些程序,而不想知道有哪些进程在运行。由于一个应用程序可能需要启动多个进程。所以在同等情况下,进程的数 量要比程序多的多。为此从阅读方面考虑,管理员需要知道系统中运行的具体程序。要实现这个需求的话,就需要利用命令ps来帮忙。

要对进程进行监测和控制,首先必须要了解当前进程的情况,也就是需要查看当前进程,而 ps 命令就是最基本同时也是非常强大的进程查看命令。使用该命令可以确定有哪些进程正在运行和运行的状态、进程是否结束、进程有没有僵死、哪些进程占用了过多 的资源等等。总之大部分信息都是可以通过执行该命令得到的。

参考:http://www.cnblogs.com/wangkangluo1/archive/2011/09/23/2185938.html 

 

 

 

  • 大小: 43.2 KB
分享到:
评论

相关推荐

    hadoop常见问题及解决办法

    问题6:Problem: "No live nodes contain current block" 问题描述:Hadoop抛出"No live nodes contain current block"错误。 解决办法:该问题是由于dfs.datanode.socket.write.timeout的值太小所致。解决办法是...

    Scaling Big Data with Hadoop and Solr

    Bringing these two technologies together is helping organizations resolve the problem of information extraction from Big Data by providing excellent distributed faceted search capabilities. ...

    Hadoop in Practice

    Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face,

    Big Data, MapReduce, Hadoop, and Spark with Python

    The problem is, the amount of data we collect has outpaced gains in CPU performance. Therefore, scalable methods for processing data are needed. In the early 2000s, Google invented MapReduce, a ...

    hadoop-eclipse-plugin-2.6.0.jar程序文件

    It takes me two days to find the root cause for the exceptions and finally generate this jar file for hadoop 2.6.0. 1.use hadoop 2.6.0. 2.change the hadoop_home and ...Any problem,please let me know.

    Hadoop in Practice(2012)

    Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log ...

    Pro Apache Hadoop(Apress,2ed,2014)

    Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze ...

    hadoop_the_definitive_guide_3nd_edition

    Decomposing a Problem into MapReduce Jobs 180 JobControl 182 Apache Oozie 182 6. How MapReduce Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187...

    hadoop-eclipse-plugin-2.5.2.jar

    Tested with eclipse jee luna R win32 x86 64 Any problem please give comments below Eclipse Java EE IDE for Web Developers Version: Luna Release 4 4 0 Build id: 20140612 0600

    hadoop-eclipse-plugin-2.4.1.jar

    try clean eclipse to see whether it works or not if any problem found.

    Big Data Made Easy - A Working Guide To The Complete Hadoop Toolset

    #### Chapter 1: The Problem with Data This chapter delves into the challenges associated with handling big data. It explains why traditional databases and processing tools are not suitable for ...

    Hadoop3.2.0集群搭建常见注意事项

    在搭建Hadoop 3.2.0集群时,有几点重要的注意事项需要关注,以确保集群的稳定运行和正确配置。以下是对这些要点的详细说明: 首先,关于端口的变化是关键。在3.2.0版本中,Namenode的Web界面端口由原来的50070更改...

    PySpark Recipes: A Problem-Solution Approach with PySpark2

    Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved! ...

    nuclei-analysis:一个用于数据分析的 hadoop-gis 项目

    ###核分析一个用于数据分析的 hadoop-gis 项目。 #####Problem 陈述:给定一组 Si 的多边形(核),根据病理图像计算特征向量 Fi。 对功能的选择要有创意。 例子: 遮罩面积、周长、形状蒙版内纹理填充区域纹理开发...

Global site tag (gtag.js) - Google Analytics