Welcome!
Important information
-- Required MX 15/16 Repository Changes
-- Information on torrent hosting changes
-- Information on MX15/16 GPG Keys
-- Spectre and Meltdown vulnerabilities

News
-- Introducing our new Website
-- MX Linux on social media: here

Current releases
-- MX-18.2 Point Release release info here
-- Migration Information to MX-18 here
-- antiX-17.4.1 release info here

New users
-- Please read this first, and don't forget to add system and hardware information to posts!
-- Here are the Forum Rules

xsane - ocr

For help or questions about 64-bit version of MEPIS, this is the forum to use.
Post Reply
User avatar
beckwith
Forum Regular
Forum Regular
Posts: 475
Joined: Thu Feb 15, 2007 9:27 pm

xsane - ocr

#1

Post by beckwith » Fri Jan 09, 2015 5:16 am

This is crazy:
xsane has an option for ocr, but the icon does not appear.
I've set the ocr to "gocr".
I want to scan a document in German (i.e. it has umlauts ('""')
on some words and the Etzett character (look like a BETA)
Linux user 449018, since 1994, Mepis (3.4 onwards) 2006, now Debian 7.8
Desktop: HP 500-315a i54460, - Debian 7.8
Laptop: Toshiba Satellite L500D/006 - Depian 7.8
Toy: Raspberry PI/mkBii - Raspian 2015-01-31
ePad: Asus - Android

User avatar
Utopia
Forum Veteran
Forum Veteran
Posts: 3956
Joined: Sun Apr 29, 2007 11:53 am

Re: xsane - ocr

#2

Post by Utopia » Fri Jan 09, 2015 5:50 am

Haven't done this in a long time but I think I did it as a two step process. First scan, then OCR.
Tesseract has a reputation of being very accurate, you have to install both the app and the German language data.
Nothing is worse than getting a lot of gibberish, I have some scanned and OCR treated reference books that are useless. Very difficult to fix afterward.
http://code.google.com/p/tesseract-ocr/
The tips of how to improve the quality of the output are very good.
Henry

Post Reply

Return to “64-bit Regulars”