Skip navigation.

Tessaract

Compile OCR Tessaract on Kubuntu

As some of you may know, I am trying to scan a whole book written by my grandfather, as I plan to publish it on the internet. It is a very inspiring book which could change our society.

So, first thing first, I need to scan the whole book and use an Optical Character Recognition (OCR) software as a first step to put it into HTML form.

This is the main documentation wiki for Ubuntu OCR.

I only had a test scan in .pdf format to test with, so I followed the instructions to use GIMP to transform it into a TIFF (.tif) file.

Syndicate content