Optical character recognition
- See Scanning for the OCR of books to create OLPC content.
Optical character recognition (OCR) is the conversion of photographs of text, into editable text.
This page was created mostly in support of Test automation.
Running OCR on an XO
yum install gocr
Next, get a pnm image of some page of text.
One approach is to take a screenshot.
yum install ImageMagick
sleep 5; import -window root pageimage.png
and switch back to Record while the sleep is happening, and wait for the screenshot to be taken.
Convert the image to text.