Skip to content

Instantly share code, notes, and snippets.

@deangelo200
Forked from lobstrio/captcha-solver.py
Created January 25, 2021 16:04
Show Gist options
  • Save deangelo200/91e119321562e37f43c2b6fd16756be2 to your computer and use it in GitHub Desktop.
Save deangelo200/91e119321562e37f43c2b6fd16756be2 to your computer and use it in GitHub Desktop.
Solving (simple) Captcha, using PyTesseract, PIL, and Python 3
#!/usr/bin/python3
# coding: utf-8
import pytesseract
import os
import argparse
try:
import Image, ImageOps, ImageEnhance, imread
except ImportError:
from PIL import Image, ImageOps, ImageEnhance
def solve_captcha(path):
"""
Convert a captcha image into a text,
using PyTesseract Python-wrapper for Tesseract
Arguments:
path (str):
path to the image to be processed
Return:
'textualized' image
"""
image = Image.open(path).convert('RGB')
image = ImageOps.autocontrast(image)
filename = "{}.png".format(os.getpid())
image.save(filename)
text = pytesseract.image_to_string(Image.open(filename))
return text
if __name__ == '__main__':
argparser = argparse.ArgumentParser()
argparser.add_argument("-i", "--image", required=True, help="path to input image to be OCR'd")
args = vars(argparser.parse_args())
path = args["image"]
print('-- Resolving')
captcha_text = solve_captcha(path)
print('-- Result: {}'.format(captcha_text))
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment