mirror of
https://github.com/tldr-pages/tldr.git
synced 2025-08-19 12:15:44 +02:00
tesseract: add page (#1267)
This commit is contained in:
parent
85d1ab3400
commit
b4f9a57471
1 changed files with 23 additions and 0 deletions
23
pages/common/tesseract.md
Normal file
23
pages/common/tesseract.md
Normal file
|
@ -0,0 +1,23 @@
|
|||
# tesseract
|
||||
|
||||
> OCR (Optical Character Recognition) engine.
|
||||
|
||||
- Recognize text in an image and save it to `output.txt`. The file extension MUST not be mentioned:
|
||||
|
||||
`tesseract {{image.png}} {{output}}`
|
||||
|
||||
- Specify a custom language (default is English) with an ISO 639-2 code (e.g. deu = Deutsch = German):
|
||||
|
||||
`tesseract -l deu {{image.png}} {{output}}`
|
||||
|
||||
- List the ISO 639-2 codes of available languages:
|
||||
|
||||
`tesseract --list-langs`
|
||||
|
||||
- Specify a custom page segmentation mode (default is 3):
|
||||
|
||||
`tesseract -psm {{0_to_10}} {{image.png}} {{output}}`
|
||||
|
||||
- List page segmentation modes and their descriptions:
|
||||
|
||||
`tesseract --help-psm`
|
Loading…
Add table
Reference in a new issue