Open Source OCR Engine [R package tesseract version 4.1]

Jeroen Ooms

tesseract: Open Source OCR Engine

Bindings to 'Tesseract' <https://opensource.google.com/projects/tesseract>: a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results.

Version:	4.1
Imports:	Rcpp (≥ 0.12.12), pdftools (≥ 1.5), curl, rappdirs, digest
LinkingTo:	Rcpp
Suggests:	magick (≥ 1.7), spelling, knitr, tibble, rmarkdown
Published:	2019-07-25
Author:	Jeroen Ooms [aut, cre] tesseract author details
Maintainer:	Jeroen Ooms <jeroen at berkeley.edu>
BugReports:	https://github.com/ropensci/tesseract/issues
License:	Apache License 2.0
URL:	https://github.com/ropensci/tesseract
NeedsCompilation:	yes
SystemRequirements:	Tesseract >= 3.03 (libtesseract-dev / tesseract-devel) and Leptonica (libleptonica-dev / leptonica-devel). On Debian you need to install the English training data separately (tesseract-ocr-eng)
Language:	en-US
Materials:	NEWS
In views:	NaturalLanguageProcessing
CRAN checks:	tesseract results

Downloads:

Reference manual:	tesseract.pdf
Vignettes:	Using the Tesseract OCR engine in R
Package source:	tesseract_4.1.tar.gz
Windows binaries:	r-devel: tesseract_4.1.zip, r-release: tesseract_4.1.zip, r-oldrel: tesseract_4.1.zip
macOS binaries:	r-release: tesseract_4.1.tgz, r-oldrel: tesseract_4.1.tgz
Old sources:	tesseract archive

Reverse dependencies:

Reverse suggests:

imagerExtra, magick, pdftools, textreadr

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tesseract to link to this page.