Following notes are based on the instructions for installing OCRopus 3.0. The following notes were generated in the following environment:
- OCRopus 0.3
- iulib 0.3
- Ubuntu 8.10 (Intrepid Ibex)
Before you Begin
Make sure your Ubuntu installation has the following components installed (not installed by default):
- g++
- scons
- svn
If you have problems downloading any of the packages or libraries required, try changing the server which apt-get or Synaptic use. In our experience, mirror.csclub.uwaterloo.ca/ubuntu was good.
Installing iulib
- Download iulib 0.3 package from iulib's google code page. http://code.google.com/p/iulib/
- Get any missing libraries, run:
sudo apt-get install libpng12-dev libjpeg62-dev libtiff4-dev libavcodec-dev libavformat-dev libsdl-gfx1.2-dev libsdl-image1.2-dev
- First run
sudo scons install
. This will help avoid an error in the next step. Make sure to have scons installed. - Then run
sudo make install
- Error:
./vidio/vidio.cc:484 error: cannot convert 'ByteIOContext*' to 'ByteIOContext' for argument '1' to 'int url_fclose(ByteIOContext*)'
- Solved using solution in post here: http://code.google.com/p/iulib/issues/detail?id=2 (change Line 484)
- Error:
Installing Tesseract
- used SVN to checkout latest version of tesseract (make sure SVN is installed)
- installed tesseract according to instructions on OCRopus' install guide (http://sites.google.com/site/ocropus/install-0-3)
- compiled and installed fine.
Installing OCRopus
- ran
./configure --without-fst --without-leptonica --without-SDL
– OK - ran
make
– ERROR
/home/leviticus/ocropus/iulib/utils/dgraphics.cc:154: undefined reference to `SDL_FillRect' /home/leviticus/ocropus/iulib/utils/dgraphics.cc:155: undefined reference to `SDL_UpdateRect' /home/leviticus/ocropus/iulib/utils/dgraphics.cc:157: undefined reference to `SDL_UpdateRect' collect2: ld returned 1 exit status make[1]: *** [ocroscript] Error 1 make[1]: Leaving directory `/home/leviticus/ocropus/ocropus-0.3/ocroscript' make: *** [all-recursive] Error 1
- re-ran
./configure --without-fst --without-leptonica
(configure with SDL) – OK - ran
make
– OK - ran
sudo make install
(NOTE: OCRopus instructions only tell you to runmake install
) – OK
Running OCRopus
Run at command line:
ocroscript recognize data/pages/alive_1.png
This will print an HTML document to stdout
with the text conversion of the image. Pipe to Firefox or redirect output to file and open in browser.