1

I am using Tesseract portable version 3.02 and would like to get the hocr output for character. The problem is hocr output only shows bouding box for words but not characters, if someone know if there is an option to change in tessdata/config that would do the trick please let me know. Otherwise let me know if there is another method to get around this. I am unable to install anything on the computer so I cannot use the Tesseract API method. Only dll files can be used.

user1352777
  • 81
  • 1
  • 1
  • 10
  • Possible duplicate of [Does Tesseract's hOCR output really contain bounding boxes and confidence levels for each character?](https://stackoverflow.com/questions/15829148/does-tesseracts-hocr-output-really-contain-bounding-boxes-and-confidence-levels) – jtlz2 Sep 04 '19 at 06:57
  • Answer at https://stackoverflow.com/a/57766860/1021819 – jtlz2 Sep 04 '19 at 07:21

1 Answers1

0

I found box file which does the same thing, not necessarily in html format.

user1352777
  • 81
  • 1
  • 1
  • 10