yarn-similar logs when starting up container

博客分类：

hadoop
spark

15/12/09 16:47:52 INFO yarn.ExecutorRunnable: Setting up executor with environment: Map(CLASSPATH -> {{PWD}}<CPS>{{PWD}}/__spark__.jar<CPS>$HADOOP_CONF_DIR<CPS>$HADOOP_COMMON_HOME/s hare/hadoop/common/*<CPS>$HADOOP_COMMON_HOME/share/hadoop/common/lib/*<CPS>$HADOOP_ ...

2015-12-09 17:17
浏览 947
评论(0)
分类:开源软件

spark-run apps on yarn mode

博客分类：

spark

run on a yarn ensemble is straightforward, 1.setup HADOOP_CONF_DIR u can use command export HADOOP_CONF_DIR=xx or add it to spark-env.sh 2. spark-submit --master yarn --class org.apache.spark.examples.JavaWordCount --verbose --deploy-mode client ~/spark/spark-1.4.1-bin-hadoop2.4/ ...

2015-11-25 17:37
浏览 1342
评论(0)
分类:开源软件

spark-spawn a app via spark-shell VS spark-submit

博客分类：

spark

yep,u can submit a app to spark ensemble by spark-submit command ,e.g. spark-submit --master spark://gzsw-02:7077 --class org.apache.spark.examples.JavaWordCount --verbose --deploy-mode client ~/spark/spark-1.4.1-bin-hadoop2.4/lib/spark-examples-1.4.1-hadoop2.4.0.jar spark/spark-1.4.1-bin-hadoo ...

2015-11-25 12:30
浏览 1083
评论(0)
分类:开源软件

spark-per partition operations

博客分类：

spark

per partition versions of map() and foreach, ref :learning spark

2015-11-06 23:46
浏览 652
评论(0)
分类:开源软件

scala-the answers to 'impatient scala'

http://www.baidu.com/p/hejuncheng1018?from=wenku chapter 9,http://www.tuicool.com/articles/3mMz6b chapter 11 chapter 12 chapter 14 chapter 20(this answer is not all correct:( chapter 17

2015-11-03 14:14
浏览 328
评论(0)
分类:开源软件

hadoop-compression

博客分类：

hadoop

http://blog.cloudera.com/blog/2009/11/hadoop-at-twitter-part-1-splittable-lzo-compression/ (namely :使hadoop支持Splittable压缩lzo) Very basic question about Hadoop and compressed input files Hadoop gzip input file using only one mapper Why can't hadoop split up a large text file and then compress t ...

2015-10-26 16:52
浏览 492
评论(0)
分类:开源软件

spark-common RDD transformations and actions

博客分类：

spark

all figures below are from 'learing-spark',

2015-10-20 16:33
浏览 504
评论(0)
分类:互联网

math-high middle school basics

answers for the books: outline required 1 required 2 https://zhidao.baidu.com/question/1671038094187205107.html optional 2.1 optional 2-3 required 3 required 4 required 5 -- teacher's book

2015-09-24 00:00
浏览 535
评论(0)
分类:非技术

zookeeper-negotiated session timeout

博客分类：

zookeeper

in zookeeper ,during certain io pressure,the client will try to reconnect to quorum.after that,the quorum peer will return a new session timeout (akka negotiatedSessionTimeout) to former,then client will recompulate the real connTimeout and readTimeout from the response .the negotiatedSessionTime ...

2015-09-23 15:09
浏览 6267
评论(0)
分类:开源软件

spark-basic demo from book 'learning spark'

博客分类：

spark
scala

after a heavy cost time(primary at download huge number of jars),the first example from book 'learning spark' is run through. the source code is very simple /** * Illustrates flatMap + countByValue for wordcount. */ package com.oreilly.learningsparkexamples.scala import org.apache. ...

2015-09-22 23:35
浏览 1195
评论(0)
分类:开源软件

hbase-logroll optimizations

博客分类：

hbase

as u know,the hbas's data logs (akka wal) will roll after certain intervals to speedup restore data lost occasionally.and of course,both log rolling and flush memstore will block up all wirtes but reads.so if decreasing the log rolling will optimize the cluster perf. 1.case during the h ...

2015-09-21 12:10
浏览 1054
评论(0)
分类:开源软件

spark-compile spark 1.6.1

博客分类：

spark

abstract,spark can be compiled with: maven, sbt, intellj ideal ref:Spark1.0.0 源码编译和部署包生成 also,if u want to load spark-project into eclipse ,then it is necessary to make a 'eclipse project' first by one of below solutions: 1.mvn eclipse:eclipse [optional] 2. ./sbt/sbt clean compile packa ...

2015-09-14 18:05
浏览 668
评论(0)
分类:开源软件

scala-plugins for eclipse

博客分类：

scala

the flows of distributing a scala project by using scala sbt(scala simple build tool) plugin,sbt assembly plugin through 'create scala project','download dependent jars','publish scala project' steps scala eclipse sbt（ Simple Build Tool）应用程序开发

2015-09-02 16:14
浏览 254
评论(0)
分类:开源软件

ubuntu-ip selection policy difference between ping and telnet

below u will see different ip selection for command 'ping' and telnet' in linux: server1@myhost18:~$ telnet host1-26 60020 Trying 192.168.1.126... Connected to host1-26. Escape character is '^]'. quit |??)org.apache.hadoop.ipc.RPC$VersionMismatch>Server IPC version 3 cannot communica ...

2015-08-12 18:10
浏览 698
评论(0)
分类:开源软件

hive-the summaries of features

博客分类：

hive

in these days ,i learned to the data warehouse framework-hive ,mainly from the ebook 'programming hive' [1],as it's about 23 chapters in detail;) so below are the outlines about this topic: 1.overview 2.architecture 3.features 4. hive vs pig,hive vs hbase 5.use cases 1.overview ...

2015-08-05 22:56
浏览 370
评论(0)
分类:开源软件

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

yarn-similar logs when starting up container

spark-run apps on yarn mode

spark-spawn a app via spark-shell VS spark-submit

spark-per partition operations

scala-the answers to 'impatient scala'

hadoop-compression

spark-common RDD transformations and actions

math-high middle school basics

zookeeper-negotiated session timeout

spark-basic demo from book 'learning spark'

hbase-logroll optimizations

spark-compile spark 1.6.1

scala-plugins for eclipse

ubuntu-ip selection policy difference between ping and telnet

hive-the summaries of features

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

最近访客更多访客>>