hadoop-serializations

leibnitz

浏览: 286080 次
性别:
来自: 广州

最近访客更多访客>>

eternal1025

bneliao

adapterofcoms

caipeijun666

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

hadoop

Hadoop

一. Writable

note:part of codes are from other's blog!here is a integrated and optimized shards.

package test;

import java.io.IOException;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.io.DefaultStringifier;

public class serializerWritable {

	/**
	 * @param args
	 */
	public static void main(String[] args) {

		Configuration conf = new Configuration();
		conf.set(
				"io.serializations",
				//TestSerializer uses Java's Serialization.
                                //if Testcase is used by that,here must be uncomment.
//				"org.apache.hadoop.io.serializer.JavaSerialization," + 
				"org.apache.hadoop.io.serializer.WritableSerialization"
				);
		TestSerializerWritable ts = new TestSerializerWritable(1, "测试呀");
		DefaultStringifier<TestSerializerWritable> ds = new DefaultStringifier<TestSerializerWritable>(
				conf, TestSerializerWritable.class);
		String s = null;
		try {
			s = ds.toString(ts);	//invoke ts's serialization method(write) automatically
		} catch (IOException e) {
			e.printStackTrace();
		}
                //if u used java serialization ,u will see the result  is space-cost much  than this
		System.out.println(s);
		TestSerializerWritable tsxp = null;
		try {
			tsxp = ds.fromString(s); //invoke deserialization method(read)
		} catch (IOException e) {
			e.printStackTrace();
		}
		System.out.println(tsxp.getA() + ":" + tsxp.getB());
	}

}

package test;

import java.io.DataInput;
import java.io.DataOutput;
import java.io.IOException;

import org.apache.hadoop.io.Writable;

public class TestSerializerWritable implements Writable{

	private int a;
	private String b;
	public TestSerializerWritable( ) {
		
	}

	public TestSerializerWritable(int a, String b) {
		super();
		this.a = a;
		this.b = b;
	}

	public int getA() {
		return a;
	}

	public void setA(int a) {
		this.a = a;
	}

	public String getB() {
		return b;
	}

	public void setB(String b) {
		this.b = b;
	}

	@Override
	public void write(DataOutput out) throws IOException {
		out.writeInt(a);
		out.writeUTF(b);
		
	}

	@Override
	public void readFields(DataInput in) throws IOException {
		a = in.readInt();
		b = in.readUTF();
//		byte[] bb = new byte[1];
//		in.readFully(bb);
//		b = new String(bb);
	}
}

here is a tips to not to use java objects to serialize to SequenceFile

//TODO

References:

http://blog.sina.com.cn/s/blog_5cec1e1d0100oi8p.html

分享到：

ubuntu-base-commands | hdfs data flow-part writing

2011-03-24 23:00
浏览 1083
评论(0)
分类:非技术
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

hadoop-serializations

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

hadoop-serializations

评论

发表评论

相关推荐

hadoop-replication written flow

hbase-export table to json file

yarn-similar logs when starting up container

hadoop-compression

hoya--hbase on yarn

compile hadoop-2.5.x on OS X(macbook)

upgrades of hadoop and hbase

how to submit jars to a map reduce job?

install snappy compression in hadoop and hbase

３。hbase rpc/ipc/proxy通信机制

hadoop-2 dfs/yarn 相关概念

hadoop 删除节点(Decommission nodes)

hadoop 2(0.23.x) 与 0.20.x比较

hadoop-2.0 alpha standalone install

hadoop源码阅读-shell启动流程-start-all

hadoop源码阅读-shell启动流程

hadoop源码阅读-第二回阅读开始

hadoop 联合 join操作

hadoop几种排序简介

nutch搜索架构关键类

最近访客更多访客>>