Open source OCR in the wild

About as sexy as an eye exam, but damn, this technology is difficult to get right. So yesterday Google announced the open sourcing of Tesseract OCR, character/text-recognition software it developed back in the 80’s that it claims is better than most of the open source alternatives (I’d believe that) but not quite as good as some of the commercially available technologies (I’d buy that too).

But hmm, isn’t there a lot that could be done with this? Personally, can’t wait until we see this make it’s way into OpenOffice among other places.

Advertisements

Author: Chris Messina

Product guy, friend to startups, inventor of the hashtag, proponent of bots and conversational apps; Xoogler and X Uber.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s