Welcome!
Important information
-- Spectre and Meltdown vulnerabilities
-- Change in MX sources

News
-- MX Linux on social media: here
-- Mepis support still here

Current releases
-- MX-17.1 Final release info here
-- antiX-17 release info here

New users
-- Please read this first, and don't forget to add system and hardware information to posts!
-- Here are the Forum Rules

xsane - ocr

For help or questions about 64-bit version of MEPIS, this is the forum to use.
Post Reply
Message
Author
User avatar
beckwith
Forum Regular
Forum Regular
Posts: 475
Joined: Thu Feb 15, 2007 9:27 pm

xsane - ocr

#1 Post by beckwith » Fri Jan 09, 2015 5:16 am

This is crazy:
xsane has an option for ocr, but the icon does not appear.
I've set the ocr to "gocr".
I want to scan a document in German (i.e. it has umlauts ('""')
on some words and the Etzett character (look like a BETA)
Linux user 449018, since 1994, Mepis (3.4 onwards) 2006, now Debian 7.8
Desktop: HP 500-315a i54460, - Debian 7.8
Laptop: Toshiba Satellite L500D/006 - Depian 7.8
Toy: Raspberry PI/mkBii - Raspian 2015-01-31
ePad: Asus - Android

User avatar
Utopia
Forum Veteran
Forum Veteran
Posts: 3685
Joined: Sun Apr 29, 2007 11:53 am

Re: xsane - ocr

#2 Post by Utopia » Fri Jan 09, 2015 5:50 am

Haven't done this in a long time but I think I did it as a two step process. First scan, then OCR.
Tesseract has a reputation of being very accurate, you have to install both the app and the German language data.
Nothing is worse than getting a lot of gibberish, I have some scanned and OCR treated reference books that are useless. Very difficult to fix afterward.
http://code.google.com/p/tesseract-ocr/
The tips of how to improve the quality of the output are very good.
Henry

Post Reply

Return to “64-bit Regulars”