Skip to content

Commit 3e9b0ac

Browse files
committed
Update tesseract man page
- move Tesseract 4 release note to other release notes - format command line options in text - add link to release notes (wiki) - add link to contributors (GitHub) Signed-off-by: Stefan Weil <[email protected]>
1 parent a86292b commit 3e9b0ac

File tree

1 file changed

+15
-10
lines changed

1 file changed

+15
-10
lines changed

doc/tesseract.1.asc

+15-10
Original file line numberDiff line numberDiff line change
@@ -17,12 +17,6 @@ between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by
1717
UNLV. It was open-sourced by HP and UNLV in 2005, and has been developed
1818
at Google since then.
1919

20-
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
21-
on line recognition, but also still supports the legacy Tesseract OCR engine of
22-
Tesseract 3 which works by recognizing character patterns. Compatibility with
23-
Tesseract 3 is enabled by --oem 0. It also needs traineddata files which support
24-
the legacy engine, for example those from the tessdata repository.
25-
2620

2721
IN/OUT ARGUMENTS
2822
----------------
@@ -97,7 +91,7 @@ OPTIONS
9791
* hocr - Output in hOCR format instead of as a text file.
9892
* pdf - Output in pdf instead of a text file.
9993

100-
*Nota Bene:* The options '-l lang' and '--psm N' must occur
94+
*Nota Bene:* The options `-l lang` and `--psm N` must occur
10195
before any 'configfile'.
10296

10397

@@ -116,7 +110,7 @@ SINGLE OPTIONS
116110
Returns the current version of the tesseract(1) executable.
117111
118112
'--list-langs'::
119-
List available languages for tesseract engine. Can be used with --tessdata-dir.
113+
List available languages for tesseract engine. Can be used with `--tessdata-dir`.
120114
121115
'--print-parameters'::
122116
Print tesseract parameters.
@@ -251,7 +245,7 @@ for the following languages are in
251245
To use a non-standard language pack named *foo.traineddata*, set the
252246
*TESSDATA_PREFIX* environment variable so the file can be found at
253247
*TESSDATA_PREFIX*/tessdata/*foo*.traineddata and give Tesseract the
254-
argument '-l foo'.
248+
argument `-l foo`.
255249

256250
SCRIPTS
257251
-------
@@ -377,7 +371,15 @@ language data.
377371
Tesseract 3.02 adds BiDirectional text support, the ability to recognize
378372
multiple languages in a single image, and improved layout analysis.
379373
380-
For further details, see the file ReleaseNotes included with the distribution.
374+
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
375+
on line recognition, but also still supports the legacy Tesseract OCR engine of
376+
Tesseract 3 which works by recognizing character patterns. Compatibility with
377+
Tesseract 3 is enabled by `--oem 0`. It also needs traineddata files which
378+
support the legacy engine, for example those from the tessdata repository.
379+
380+
For further details, see the file ReleaseNotes in the Tesseract wiki
381+
(<https://github.com/tesseract-ocr/tesseract/wiki/ReleaseNotes>).
382+
381383
382384
RESOURCES
383385
---------
@@ -402,6 +404,9 @@ Pingping Xiu, Pong Eksombatchai (Chantat), Ranjith Unnikrishnan, Raquel
402404
Romano, Ray Smith, Rika Antonova, Robert Moss, Samuel Charron, Sheelagh
403405
Lloyd, Shobhit Saxena, and Thomas Kielbus.
404406

407+
For a list of contributors see
408+
<https://github.com/tesseract-ocr/tesseract/blob/master/AUTHORS>.
409+
405410
COPYING
406411
-------
407412
Licensed under the Apache License, Version 2.0

0 commit comments

Comments
 (0)