Skip to content

Regression: gif not scraping in 5.2 that was OK in 5.1 #3940

@jacekkopecky

Description

@jacekkopecky

Environment

  • Tesseract Version: v5.2.0.20220712
  • Platform: Windows 10, x64

Current Behavior:

tesseract.exe v 5.2 only scrapes top line (window title) from this gif image:

image-screenshot-pdf.gif

Expected Behavior:

It should scrape all text.

Tesseract v5.1.0.20220510 worked as expected, and both 5.1 and 5.2 work as expected with this equivalent png image:

image-screenshot-pdf.png

Suggested Fix:

It might have something to do with a different DPI estimation: 5.2 estimates resolution 132, while 5.1 estimated 168. However, running 5.2 with --dpi 168 does not seem to fix anything.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions