java 读取文件编码问题 -

jianFengGong

浏览: 20879 次
性别:
来自: 上海

最近访客更多访客>>

a3787928

chensh2010

2199075001

summerwaves

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

java 读取文件编码问题

博客分类：

JAVA

在项目中遇到要读取文本文件内容然后批量查询，但每当在后台读取上传文件流时，第一个内容总会有一个？如：

？test0

test1

而实际内容应该是：

test0

test1.

经过查找资料，有了下面解决方式：

BufferedReader nickContent = new BufferedReader(new UnicodeReader(mFileItem.getInputStream(),Charset.defaultCharset().name()));

UnicodeReader.java
import java.io.*;
    /**
     * Generic unicode textreader, which will use BOM mark
     * to identify the encoding to be used. If BOM is not found
     * then use a given default or system encoding.
     */
    public class UnicodeReader extends Reader {
        PushbackInputStream internalIn;
        InputStreamReader   internalIn2 = null;
        String              defaultEnc;

        private static final int BOM_SIZE = 4;

        /**
         *
         * @param in inputstream to be read
         * @param defaultEnc default encoding if stream does not have
         *                   BOM marker. Give NULL to use system-level default.
         */
        public UnicodeReader(InputStream in, String defaultEnc) {
            internalIn = new PushbackInputStream(in, BOM_SIZE);
            this.defaultEnc = defaultEnc;
        }

        public String getDefaultEncoding() {
            return defaultEnc;
        }

        /**
         * Get stream encoding or NULL if stream is uninitialized.
         * Call init() or read() method to initialize it.
         */
        public String getEncoding() {
            if (internalIn2 == null) return null;
            return internalIn2.getEncoding();
        }

        /**
         * Read-ahead four bytes and check for BOM marks. Extra bytes are
         * unread back to the stream, only BOM bytes are skipped.
         */
        protected void init() throws IOException {
            if (internalIn2 != null) return;

            String encoding;
            byte bom[] = new byte[BOM_SIZE];
            int n, unread;
            n = internalIn.read(bom, 0, bom.length);

            if ( (bom[0] == (byte)0x00) && (bom[1] == (byte)0x00) &&
                    (bom[2] == (byte)0xFE) && (bom[3] == (byte)0xFF) ) {
                encoding = "UTF-32BE";
                unread = n - 4;
            } else if ( (bom[0] == (byte)0xFF) && (bom[1] == (byte)0xFE) &&
                    (bom[2] == (byte)0x00) && (bom[3] == (byte)0x00) ) {
                encoding = "UTF-32LE";
                unread = n - 4;
            } else if ( (bom[0] == (byte)0xEF) && (bom[1] == (byte)0xBB) &&
                    (bom[2] == (byte)0xBF) ) {
                encoding = "UTF-8";
                unread = n - 3;
            } else if ( (bom[0] == (byte)0xFE) && (bom[1] == (byte)0xFF) ) {
                encoding = "UTF-16BE";
                unread = n - 2;
            } else if ( (bom[0] == (byte)0xFF) && (bom[1] == (byte)0xFE) ) {
                encoding = "UTF-16LE";
                unread = n - 2;
            } else {
                // Unicode BOM mark not found, unread all bytes
                encoding = defaultEnc;
                unread = n;
            }
            //System.out.println("read=" + n + ", unread=" + unread);

            if (unread > 0) internalIn.unread(bom, (n - unread), unread);

            // Use given encoding
            if (encoding == null) {
                internalIn2 = new InputStreamReader(internalIn);
            } else {
                internalIn2 = new InputStreamReader(internalIn, encoding);
            }
        }

        public void close() throws IOException {
            init();
            internalIn2.close();
        }

        public int read(char[] cbuf, int off, int len) throws IOException {
            init();
            return internalIn2.read(cbuf, off, len);
        }
}

分享到：

什么是云计算 | Java POI导出Excel

2014-07-22 09:51
浏览 385
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

java 读取文件编码问题

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

java 读取文件编码问题

评论

发表评论

相关推荐

Java POI导出Excel

java 发送http请求

基站定位

java --枚举

redis客户端与spring整合

double 保留指定的小数位

无线定位系统的基站选择算法

基站定位算法

linux下配置redis server

readis windows servrer 搭建与Java客户端的连接

Spring 整合EhCache

坐标纠偏的实现

MIAN2 Server端与spring的整合

java项目转web项目

MIAN2客户端与spring的整合

最近访客更多访客>>