压缩 HDFS 上的文件, 提供使用者下載

duguyiren3476

浏览: 468633 次
性别:
来自: 北京

最近访客更多访客>>

zhangyou1010

zhianchen

guochongcan

jyzbcs

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

hadoop
java
hbase
hive

Hadoop

(转)今天的進度是研究如何讓使用者透過網路來下載 HDFS 上的檔案，基本上這和壓縮一般的檔案沒什麼兩樣，直接透過 java 內建的 java.util.zip 套件就可以輕易做到了。唯一的差別，在這裡要用 Hadoop API 提供的 FSDatainputStream 來開啟檔案串流，然後逐一寫入到壓縮串流就可以完成檔案壓縮的目的。

而在操作流程的上，使用者會先選擇要下載的檔案，這個對 HTML 有基礎瞭解的開發人員不是什麼問題，利用 Form 加 Checkbox 就可以輕易的達成目的，當使用者選擇確認後，再送到 servlet 處理就可以了。

protected void doPost(HttpServletRequest request, HttpServletResponse response) throws ServletException, IOException {
 	String photoYear = request.getParameter("photoYear");
 	String id = request.getParameter("id");
 	
	Configuration conf = new Configuration();
 	conf.set("hadoop.job.ugi","hadoop,supergroup"); 	
 	
 	String uriBase= String.format("hdfs://cloud:9000/%s/%s/", photoYear, id);
 	String files[] = request.getParameterValues("SelectPic");	 	 	
 	BufferedOutputStream dest = new BufferedOutputStream(response.getOutputStream());
        ZipOutputStream outZip = new ZipOutputStream(new BufferedOutputStream(dest));
    	    
	response.setHeader("Content-Type", "application/zip");
	
	int bytesRead;
	Path sourceFilePath;
	
	FileSystem fs = FileSystem.get(URI.create(uriBase),conf);	
	try {	
	    for (int i=0; i < files.length; i++) {	 
       	         sourceFilePath = new Path(uriBase + files[i]);
	    	
	        //開啟資料輸入串流
	         FSDataInputStream in = fs.open(sourceFilePath);
	    			       
	       //建立檔案的 entry
	        ZipEntry entry = new ZipEntry(files[i]); 
	        outZip.putNextEntry(entry); //將壓縮串流移到此 entry 的資料位置
	       
	       //透過檔案輸入串流, 將 HDFS 檔案內容寫入到壓縮串流
	        byte[] buffer = new byte[4096];
	        while ((bytesRead = in.read(buffer)) < 0) {
		  outZip.write(buffer, 0, bytesRead);
	        }				
	        in.close();
	    }
	    outZip.flush();
	    outZip.close();
	 } catch(Exception e) {
	      outZip.close();
	      e.printStackTrace();
	 } 

}

分享到：

mahout安装测试 | svn常见错误及解决办法

2012-09-06 13:48
浏览 1762
评论(0)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

压缩 HDFS 上的文件, 提供使用者下載

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

压缩 HDFS 上的文件, 提供使用者下載

评论

发表评论

相关推荐

spark运行在yarn上的一个异常

drill1.0配置hive storage plugin及测试

tez ui 安装测试

编译tez 0.7

pig on tez测试

hive 小记

ambari 安装配置

hive on tez hive运行在tez之上 安装测试

hadoop2.5.2配置httpfs服务

NFS挂载hdfs到本地

apache drill 0.8.0 单机/分布式安装测试

测试hbase预设分区

Phoenix设置时间戳

eclipse远程连接hadoop进行开发测试

HBase 查看HFile内容

HBase 多master 安装配置

hadoop2.x jobhistoryserver 配置

hadoop balancer

hadoop second namenode异常 Inconsistent checkpoint fields

Hadoop2本地库和系统库版本不一致 解决方案

最近访客更多访客>>

hive on tez hive运行在tez之上安装测试

Hadoop2本地库和系统库版本不一致解决方案