使用百度API识别图片文字

raymond.chen

浏览: 1443260 次
性别:
来自: 广州

最近访客更多访客>>

林祥纤

whzresponse

loginboot

vicento4

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Java

1、注册百度账号 https://login.bce.baidu.com/

2、定位到产品服务 / 文字识别 - 概览页面

3、创建一个应用

4、下载相关的SDK包，在工程项目中引用。如果是maven工程，直接在pom.xml文件中添加依赖包

<groupId>com.baidu.aip</groupId>

</dependency>

5、编写测试代码

public class BaiduAPITest {
    public static final String APP_ID = "你的应用的 AppID";
    public static final String API_KEY = "你的应用的 API Key";
    public static final String SECRET_KEY = "你的应用的 Secret Key";
	
    public static void main(String[] args) {
    	String imagePath = "doc\\pic2.jpg";
    	System.out.println(generalString(imagePath, true));
	}
    
    /**
     * 识别图片上的文本内容，转成文字字符串返回
     * @param imagePath 图片文件的路径
     */
    private static String generalString(String imagePath, boolean isNewline){
    	try{
    		HashMap<String, String> options = new HashMap<String, String>();
            options.put("language_type", "CHN_ENG"); //CHN_ENG:中英文混合， ENG:英文
            options.put("detect_direction", "true"); //是否检测图像朝向，默认不检测，即：false
            options.put("detect_language", "true"); //是否检测语言，默认不检测。
            options.put("probability", "false"); //是否返回识别结果中每一行的置信度
            
            //通用文字识别
        	JSONObject jsonObject = getAipOcr().basicGeneral(imagePath, options); 
        	String result = mergeString(jsonObject, isNewline);
        	return result;
        }catch(Exception ex){
        	ex.printStackTrace();
        }
    	
    	return "";
    }
    
    /**
     * AipOcr最好单例模式使用
     */
    private static AipOcr getAipOcr(){
        AipOcr client = new AipOcr(APP_ID, API_KEY, SECRET_KEY);
        client.setConnectionTimeoutInMillis(2000);
        client.setSocketTimeoutInMillis(60000);
        return client;
    }
    
    /**
     * 从返回的JSONObject对象中取出需要的文字内容，组装成一个大文本内容
     * @param jsonObject JSONObject对象
     * @param isNewline 每个识别结果字符串的后面是否增加换行符
     */
    private static String mergeString(JSONObject jsonObject, boolean isNewline){
    	if(jsonObject == null){
    		return "";
    	}
    	
    	if(jsonObject.has("words_result") && jsonObject.has("words_result_num")){
    		int wordsResultNum = jsonObject.getInt("words_result_num");
    		if(wordsResultNum > 0){
    			StringBuilder sb = new StringBuilder();
    			
    			JSONArray jsonArray = jsonObject.getJSONArray("words_result");
    			int len = jsonArray.length();
    			for(Iterator<Object> it=jsonArray.iterator(); it.hasNext();){
    				JSONObject obj = (JSONObject)it.next();
    				if(isNewline){
    					sb.append(obj.get("words") + "\n");
    				}else{
    					sb.append(obj.get("words"));
    				}
    			}
    			
    			return sb.toString();
    		}
    	}else{
    		return jsonObject.toString();
    	}
    	
    	return null;
    }
}

6、准备一张jpg图片文件

7、执行测试代码，查看效果

//接口返回的内容
{
    "words_result": [
        {"words": "命令:yum- y install libpng12"},
        {"words": "做完这些我们就可以直接使用Tess4进行图片识别了,目前我只试过字母和数字的"},
        {"words": "子,有点渣,请诸位大神不要吐槽,是直接传入图片ur地址解析的"},
        {"words": "public static String imagetotel(String imgurl)"},
        {"words": "Itesseract instance=new Tesseracto);"},
        {"words": "URL url=new URL(imgurl);"}
    ],
    "direction": 0,
    "words_result_num": 6,
    "language": 3,
    "log_id": 5228016013525318579
}

//处理后的内容
命令:yum- y install libpng12
做完这些我们就可以直接使用Tess4进行图片识别了,目前我只试过字母和数字的
子,有点渣,请诸位大神不要吐槽,是直接传入图片ur地址解析的
public static String imagetotel(String imgurl)
Itesseract instance=new Tesseracto);
URL url=new URL(imgurl);

查看图片附件

分享到：

Heasy：基于Android平台的APP混合开发框 ... | HanLP自然语言处理包的使用

2018-09-21 22:41
浏览 2503
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

使用百度API识别图片文字

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

使用百度API识别图片文字

评论

发表评论

相关推荐

keytool的使用

Bitset数据结构的使用

Disruptor：高性能低延迟的内存有界队列框架

java的类加载机制

ThreadLocal的使用范例

反射工具包Reflections的使用

使用CGLIB对实现类进行动态代理

基于JDK动态代理实现Mybatis的Mapper功能

Java8新特性

HanLP自然语言处理包的使用

org.apache.commons常用类的使用

图片转换为单色

Java事件机制范例

编程方式的quartz2例子

数字证书格式

Drools6使用范例

生成带logo的二维码图片

用HttpClient访问CXF的RESTful接口

commons-configuration使用范例

Websocket的使用范例

最近访客更多访客>>