tesseract应用

huangyongxing310

浏览: 508456 次
性别:
来自: 广州

最近访客更多访客>>

hiroada

lixiaoxin

u012363178

wangyy

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

机器学习

tesseract应用

from PIL import Image
import pytesseract
print(pytesseract.image_to_string(Image.open('test.png')))
print(pytesseract.image_to_string(Image.open('test-european.jpg'), lang='fra'))

核心代码就是image_to_string函数，该函数还支持-l eng 参数，支持-psm 参数。

--psm：指定识别对象属性，如果要识别的图像中文字的分布是只有一行，就是用 “--psm 7”

Page segmentation modes:
0 Orientation and script detection (OSD) only.
1 Automatic page segmentation with OSD.
2 Automatic page segmentation, but no OSD, or OCR.
3 Fully automatic page segmentation, but no OSD. (Default)
4 Assume a single column of text of variable sizes.
5 Assume a single uniform block of vertically aligned text.
6 Assume a single uniform block of text.
7 Treat the image as a single text line.
8 Treat the image as a single word.
9 Treat the image as a single word in a circle.
10 Treat the image as a single character.
11 Sparse text. Find as much text as possible in no particular order.
12 Sparse text with OSD.
13 Raw line. Treat the image as a single text line,

bypassing hacks that are Tesseract-specific.

image_to_string(Image.open('test.png'),lang="eng" config="-psm 7")

命令行使用
tesseract chi_sm.png result -l chi_sim
格式的意思是：软件图片名识别结果保存为result.txt -l表示选择语言最后是语言
chi_sim(简体中文)
eng(英文)

训练自己的库
jTessBoxEditor
这个东西是用来训练一个叫做teesseract智能图片识别软件的训练框架，

在进行训练之前还有几个小步骤：
1.将图片转换成tif格式，用于后面生成box文件。可以通过画图，然后另存为tif即可。(标签图像文件格式)
更改图片名字，这个是有要求的=。=
tif文面命名格式[lang].[fontname].exp[num].tif
lang是语言 fontname是字体
比如我们要训练自定义字库 mjorcen字体名normal
那么我们把图片文件重命名 mjorcen.normal.exp0.jpg在转tif。

2.生成box文件，CMD命令：
tesseract mjorcen.normal.exp0.jpg mjorcen.normal.exp0 -l chi_sim batch.nochop makebox
这里生成的box是存储这图片文字的识别位置参数，如果没有识别出任何文字，里面应该是空的，不信的可以用记事本方式打开。顺表可以随手添加几个数据，分别是字体坐标，和文字宽高，还有图片序号，因为这里只有一张图片，所以我最后就写0

https://blog.csdn.net/ProgramOfApe/article/details/78288622(jTessBoxEditor使用)

http://www.cnblogs.com/cnlian/p/5765871.html(jTessBoxEditor使用)

https://blog.csdn.net/woaipangruimao/article/details/78741022(用jTessBoxEditor自动训练3500常用汉字)

https://blog.csdn.net/Metamorpho/article/details/80835574

分享到：

灰度图像--形态学处理（腐蚀，膨胀，开、闭 ... | 卷积神经网络（CNN）

2018-10-12 14:05
浏览 543
评论(0)
分类:互联网
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论