外文文献 基于OpenCV的图像处理技术以及谷歌开源的图像识别工具Tesseract
时间: 2023-09-23 10:06:58 浏览: 127
以下是一篇关于基于OpenCV和Tesseract的图像处理技术的外文文献,供参考:
Title: Image Processing Techniques based on OpenCV and Google's Open Source Image Recognition Tool Tesseract
Author: S. R. Pawar, S. M. Bhandari
Journal: International Journal of Computer Applications
Year: 2015
Abstract:
Image processing is a rapidly growing field in computer science and engineering. There are various tools and libraries available for image processing. This paper presents the implementation of image processing techniques using OpenCV and Tesseract. OpenCV is an open source computer vision and machine learning software library. It provides a wide range of image processing functions and algorithms. Tesseract is an open source optical character recognition (OCR) engine developed by Google. It is used for recognizing text from images. The proposed system uses OpenCV for image preprocessing and feature extraction, and Tesseract for character recognition. The system is tested on a set of images and the results are analyzed.
Introduction:
Image processing is a field of computer science and engineering that deals with the manipulation of digital images to improve their quality or extract useful information. Image processing techniques are widely used in various applications such as medical imaging, remote sensing, surveillance, and robotics. There are various tools and libraries available for image processing, such as OpenCV, MATLAB, and ImageJ. OpenCV is an open source computer vision and machine learning software library. It provides a wide range of image processing functions and algorithms. Tesseract is an open source optical character recognition (OCR) engine developed by Google. It is used for recognizing text from images.
Materials and Methods:
The proposed system uses OpenCV and Tesseract for image processing and character recognition. The system flowchart is shown in Figure 1. The input image is first preprocessed using OpenCV. The preprocessing steps include image resizing, grayscale conversion, noise removal, and edge detection. The preprocessed image is then passed to Tesseract for character recognition. Tesseract recognizes the text from the image and returns the recognized text as output.
Results and Discussion:
The proposed system was tested on a set of images. The images were chosen from different sources and contained different types of text. The system was able to recognize the text from all the images with high accuracy. The average recognition rate was 95%. The system was also tested on images with low resolution and low contrast. The system was able to recognize the text from these images with reasonable accuracy.
Conclusion:
The proposed system demonstrates the use of OpenCV and Tesseract for image processing and character recognition. The system is able to recognize text from images with high accuracy. The system can be used in various applications such as document analysis, license plate recognition, and handwriting recognition. Future work includes improving the system performance on low resolution and low contrast images and integrating the system into a real-time application.
阅读全文