GOCR

From Encoresoup - The Ultimate Guide to Free/Open Source Software

Jump to: navigation, search
This article contains content from the Wikipedia article:
GOCR
history contributors
GOCR
Developer: Jörg Schulenburg
Stable release

0.45  (01 November 2007)

Genre: Optical character recognition
License: GNU General Public License
Website: [[Website::jocr.sourceforge.net]]


GOCR (or JOCR) is a free optical character recognition program, initially written by Jörg Schulenburg. It can be used to convert or scan image files (portable pixmap or PCX) into text files.[1]

Contents

[edit] Development

According to the program's documentation, as of version 0.44 it is still in the early stages of development.[2]

It claims to handle single-column sans-serif fonts of 20-60 pixels in height, and reports trouble with serif fonts, overlapping characters, handwritten text, heterogeneous fonts, noisy images, large angles of skew, and text in anything other than a Latin alphabet.[2]

[edit] Nomenclature

The application was originally named GOCR which stands for GNU Optical Character Recognition. When it came time to register the project on SourceForge the name GOCR was already taken so the project was registered as JOCR (Jörg's Optical Character Recognition).[1][2]

As a result of this situation the project and application are known as both GOCR and JOCR. Schulenburg admits that this is problematic.[1]

[edit] Formats

Acceptable image formats are:[2]

  • pnm
  • pbm
  • pgm
  • ppm
  • pcx (some)
  • tga

Other formats are automatically converted using netpbm-progs, gzip and bzip2 via the use of a unix pipe. These images types include:[2]

  • pnm.gz
  • pnm.bz2
  • png
  • jpg
  • tiff
  • gif
  • bmp

[edit] Barcodes

GOCR can also translate barcodes.[2]

[edit] See also

  • GNU Ocrad

[edit] References

  1. 1.0 1.1 1.2 Schulenburg, Joerg (March 2007). GOCR. Retrieved on 2008-06-25.
  2. 2.0 2.1 2.2 2.3 2.4 2.5 SfR Fresh (undated). Member "gocr-0.45/README" of archive gocr-0.45.tar.gz. Retrieved on 2008-06-25.

[edit] External links

Template:OCR


Retrieved from "http://encoresoup.net/GOCR"
Personal tools

Pico USB Flash Drive (8Gb) [ThinkGeek]The Ruby Programming Language [Amazon]Dive Into Python [Amazon]