mirror of
https://github.com/Ponce/slackbuilds
synced 2024-11-18 22:06:04 +01:00
21 lines
1.2 KiB
Text
21 lines
1.2 KiB
Text
Tesseract is a commercial quality OCR engine originally developed at HP
|
|
between 1985 and 1995. In 1995, this engine was among the top 3 evaluated
|
|
by UNLV. It was open-sourced by HP and UNLV in 2005.
|
|
|
|
You will need to get one of the language packs in order to do anything
|
|
useful with tesseract, and that language pack tarball should be present
|
|
in the same directory as the SlackBuild script when the package is created.
|
|
See http://code.google.com/p/tesseract-ocr/downloads/list for a list of
|
|
all available language packs. Note that you can install more than one
|
|
(or even all) of the language packs, as they do not conflict with each
|
|
other. The build script defaults to use English, but this is easily
|
|
changed by passing an alternate value on the command line.
|
|
|
|
Here is the relevant code from the build script:
|
|
# Language pack(s) to use
|
|
# We'll install English by default, but you can pass another one (or all)
|
|
# of them on the command line (space delimited). If you pass more than one
|
|
# (again, space delimited), you must enclose the string in quotes. Examples:
|
|
# TESSLANG=fra ./tesseract.SlackBuild
|
|
# TESSLANG="deu deu-f eng fra ita nld por spa vie" ./tesseract.SlackBuild
|
|
TESSLANG=${TESSLANG:-eng} # Default to English
|