Since HP had independently-developed page layout
analysis technology that was used in products, (and
therefore not released for open-source) Tesseract never
needed its own page layout analysis. Tesseract
therefore assumes that its input is a binary image with
optional polygonal text regions defined.