Professional Documents
Culture Documents
png
$ ./pytesseract.py -l eng test-english.jpg
import Image
from tesseract import image_to_string
print image_to_string(Image.open('test.png'))
print image_to_string(Image.open('test-english.jpg'), lang='eng')
***********************************************************************************
*****
$ workon cv
2
$ pip install pillow
$ pip install pytesseract
3
4
5
$ tesseract images/example_02.png stdout
Detected 32 diacritics
" Tess�ra�c't Will
Fail With Noisy
Backgrounds
2
3
4
$ python ocr.py --image images/example_02.png --preprocess blur
Tesseract Will
Fail With Noisy
Backgrounds