Summary: | add support for Tesseract-OCR | ||
---|---|---|---|
Product: | [Applications] kooka | Reporter: | Marc Collin <marc.collin> |
Component: | general | Assignee: | Klaas Freitag <freitag> |
Status: | CONFIRMED --- | ||
Severity: | wishlist | CC: | esigra |
Priority: | NOR | ||
Version: | 0.44 | ||
Target Milestone: | --- | ||
Platform: | unspecified | ||
OS: | Linux | ||
Latest Commit: | Version Fixed In: | ||
Sentry Crash Report: |
Description
Marc Collin
2006-10-07 16:03:21 UTC
I agree, tesseract-ocr is now a debian package. It is the engine for Google's ocropus project. try: apt-get install tesseract-ocr tesseract-ocr --help call: tesseract inputfilename outputfilename i came here to file this wish as well.. we definitely need Tesseract OCR. It was developed by HP and google later picked it up. *** This bug has been confirmed by popular vote. *** Ocropus is a document analysis and OCR program that uses Tesseract as a block-wise recognizer. http://code.google.com/p/ocropus/ seeing as kooka does not do document analysis, kooka should piggyback on ocropus and let that gets the characters with tesseract |