Python---提取图片的文字信息

2024年3月7日 118次阅读来源: qq_34802511

环境：

1.Python3
2.Python3的pillow、pytesseract包
可使用pip install pillow、pip install pytesseract命令安装。或者通过pycharm进行安装
3.识别引擎tesseract-ocr ，下载地址

代码：

#-*- coding:utf-8 -*-
from PIL import Image
import pytesseract

pytesseract.pytesseract.tesseract_cmd = 'E://Tesseract-OCR/tesseract.exe'

# 使用pytesseract对英文进行识别，lang参数可省略
text = pytesseract.image_to_string(Image.open('E:\project\study_tornado\mm.jpg'), lang='eng')
# 使用pytesseract对中文进行识别,（含英文，但识别率降低）
text = pytesseract.image_to_string(Image.open('E:\project\study_tornado\mm.jpg'), lang='chi_sim')
print text

# 参考：https://blog.csdn.net/u010134642/article/details/78747630
# 参考：https://blog.csdn.net/helloc0de/article/details/80410250
# 参考：https://www.cnblogs.com/morwind/p/6867547.html

    原文作者：qq_34802511
    原文地址: https://blog.csdn.net/qq_34802511/article/details/90208679
    本文转自网络文章，转载此文章仅为分享知识，如有侵权，请联系博主进行删除。