hadoop 内存错误

黎明lm

浏览: 298999 次
性别:
来自: 北京

最近访客更多访客>>

baby孔祥超

jiazhigang

slipper-jay

woshiliukun

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

hadoop

hadoop 内存给定错误

11/09/06 09:20:25 WARN mapred.JobClient: Error reading task outputhttp://server4:50060/tasklog?plaintext=true&taskid=attempt_201109060853_0005_r_000008_0&filter=stdout
11/09/06 09:20:25 WARN mapred.JobClient: Error reading task outputhttp://server4:50060/tasklog?plaintext=true&taskid=attempt_201109060853_0005_r_000008_0&filter=stderr
11/09/06 09:20:34 INFO mapred.JobClient: Task Id : attempt_201109060853_0005_m_000009_1, Status : FAILED
java.io.IOException: Task process exit with nonzero status of 1.
at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:418)

运行map reduce任务的时候报这个错误，查了些文章说是应该吧 userlogs 下面的文件都删除掉

老外的文章：

Just an FYI, found the solution to this problem.

Apparently, it's an OS limit on the number of sub-directories that can be reated in another directory.  In this case, we had 31998 sub-directories uder hadoop/userlogs/, so any new tasks would fail in Job Setup.

From the unix command line, mkdir fails as well:
  $ mkdir hadoop/userlogs/testdir
  mkdir: cannot create directory `hadoop/userlogs/testdir': Too many links

Difficult to track down because the Hadoop error message gives no hint whasoever.  And normally, you'd look in the userlog itself for more info, butin this case the userlog couldn't be created.

但是我的问题在userlogs下可以mkdir test 是成功的所以删除这个userlogs下的所有文件仍然报错

于是查看uerlogs下的文件：

[suse@server6 userlogs]$ cat attempt_201109060853_0005_m_000009_2/

[suse@server6 attempt_201109060853_0005_m_000009_2]$ cat stdout
Error occurred during initialization of VM
Incompatible minimum and maximum heap sizes specified

发现是 jvm 内存给定错误：

    <property>
            <name>mapred.child.java.opts</name>
            <value>-Xmx1024m -Xms1024m -Xmn192m -XX:+UseConcMarkSweepGC -XX:CMSFullGCsBeforeCompaction=5 -XX:+UseParNewGC -XX:SurvivorRatio=8 -XX:TargetSurvivorRatio=90 -XX:MaxTenuringThreshold=31 -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:+PrintGCApplicationStoppedTime -Xloggc:$HADOOP_HOME/logs/gc.log</value>
            <description>Java opts for the task tracker child processes.
                    The following symbol, if present, will be interpolated: @taskid@ is replaced
                    by current TaskID. Any other occurrences of '@' will go unchanged.
                    For example, to enable verbose gc logging to a file named for the taskid in
                    /tmp and to set the heap maximum to be a gigabyte, pass a 'value' of:
                    -Xmx1024m -verbose:gc -Xloggc:/tmp/@taskid@.gc
                    The configuration variable mapred.child.ulimit can be used to control the
                    maximum virtual memory of the child processes.
            </description>
    </property>

原来我写的是

    <property>
            <name>mapred.child.java.opts</name>
            <value>-Xmx512m -Xms1024m -Xmn192m -XX:+UseConcMarkSweepGC -XX:CMSFullGCsBeforeCompaction=5 -XX:+UseParNewGC -XX:SurvivorRatio=8 -XX:TargetSurvivorRatio=90 -XX:MaxTenuringThreshold=31 -XX:+PrintGCDetails -XX:+PrintGCTimeStamps -XX:+PrintGCApplicationStoppedTime -Xloggc:$HADOOP_HOME/logs/gc.log</value>
            <description>Java opts for the task tracker child processes.
                    The following symbol, if present, will be interpolated: @taskid@ is replaced
                    by current TaskID. Any other occurrences of '@' will go unchanged.
                    For example, to enable verbose gc logging to a file named for the taskid in
                    /tmp and to set the heap maximum to be a gigabyte, pass a 'value' of:
                    -Xmx1024m -verbose:gc -Xloggc:/tmp/@taskid@.gc
                    The configuration variable mapred.child.ulimit can be used to control the
                    maximum virtual memory of the child processes.
            </description>
    </property>

不小心写错了：

下面介绍下这几个参数的意思

-Xss 20000k

这个参数的意思是每增加一个线程 jvm 会增加 20M 的内存，而最佳值应该是128K,默认值好像是512k.

-Xmx jvm 启动最大内存，Java Heap最大值，默认值为物理内存的1 / 4 ，最佳设值应该视物理内存大小及计算机内其他内存开销而定

-Xms jvm Java Heap初始值，Server端JVM最好将-Xms和-Xmx设为相同值，开发测试机JVM可以保留默认值；

-Xmn Java Heap Young区大小，不熟悉最好保留默认值；

-Xss 每个线程的Stack大小，不熟悉最好保留默认值；

0
顶

0
踩

分享到：

SiteMesh 应用 | zookeeper linux下无法启动的问题

2011-09-06 09:58
浏览 2357
评论(0)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论