首页批量提取图片文字使用Python

批量提取图片文字使用Python

时间: 2025-02-23 07:34:00 浏览: 44

批量提取图片文字，也称为光学字符识别（OCR），在Python中可以借助一些强大的库来实现，比如Tesseract OCR、PIL（Python Imaging Library）以及第三方模块如pytesseract和google-cloud-vision。以下是基本步骤：

安装必要的库：首先，需要安装pytesseract和Pillow（如果还没安装）。对于Tesseract，如果你使用的是Linux系统，通常它已经预装了；如果是Windows或Mac，可通过包管理器或官网下载安装。然后通过pip安装pytesseract，例如：
```
pip install pytesseract pillow
```

导入库并初始化：导入所需的库，并设置Tesseract的语言环境，如果图片包含非英文文本，记得指定正确的语言代码。

import pytesseract
from PIL import Image
pytesseract.pytesseract.tesseract_cmd = r'/path/to/tesseract'  # Windows/Linux路径
lang = 'chi_sim'  # 设置为简体中文或其他语言

图片读取和处理：遍历你需要提取文字的图片文件夹，对每张图片做预处理，例如调整大小、灰度化等，然后进行OCR提取。

for filename in os.listdir('images_folder'):
    with Image.open(os.path.join('images_folder', filename)) as img:
        text = pytesseract.image_to_string(img, lang=lang)
        print(f"图片 {filename} 的内容: {text}")