Open source OCR in the wild

About as sexy as an eye exam, but damn, this technology is difficult to get right. So yesterday Google announced the open sourcing of Tesseract OCR, character/text-recognition software it developed back in the 80’s that it claims is better than most of the open source alternatives (I’d believe that) but not quite as good as some of the commercially available technologies (I’d buy that too).

But hmm, isn’t there a lot that could be done with this? Personally, can’t wait until we see this make it’s way into OpenOffice among other places.

Author: Chris Messina

Head of West Coast Business Development at Republic. Ever-curious product designer and technologist. Hashtag inventor. Previously: Molly.com (YC W18), Uber, Google.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: