Extract text from annotated shapes and use it in thumbnail alt text #7077

robertknight · 2025-05-14T13:11:21Z

When the area selected for a rect or pin annotations contains PDF text, extract that text and store it in the shape selector. This is then used to generate better alt text for the thumbnail in the sidebar:

The length of the extracted text is capped to stop the selector JSON from becoming too large.

In future we plan to enable users to enter a text description for the thumbnail. That description will either replace or be combined with the extracted text.

codecov · 2025-05-14T14:12:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.39%. Comparing base (bbed920) to head (0a79f88).
Report is 6 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #7077   +/-   ##
=======================================
  Coverage   99.38%   99.39%           
=======================================
  Files         277      278    +1     
  Lines       11273    11314   +41     
  Branches     2719     2727    +8     
=======================================
+ Hits        11204    11245   +41     
  Misses         69       69

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Add a field to shape selectors containing the text that intersects a shape, using word-level granularity.

Include extracted document text from shape selectors in the alt text of thumbnails.

robertknight added the project: image annotations label May 14, 2025

robertknight force-pushed the shape-selector-text branch from 0ef295f to 2c72202 Compare May 14, 2025 13:12

robertknight force-pushed the shape-selector-text branch 4 times, most recently from d641c24 to a35e540 Compare May 15, 2025 05:54

robertknight added 2 commits May 15, 2025 07:24

Add text field to shape selectors

ae56a86

Add a field to shape selectors containing the text that intersects a shape, using word-level granularity.

Include text from shape selector in thumbnail alt text

0a79f88

Include extracted document text from shape selectors in the alt text of thumbnails.

robertknight force-pushed the shape-selector-text branch from a35e540 to 0a79f88 Compare May 15, 2025 06:24

robertknight marked this pull request as ready for review May 15, 2025 10:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract text from annotated shapes and use it in thumbnail alt text #7077

Extract text from annotated shapes and use it in thumbnail alt text #7077

robertknight commented May 14, 2025 •

edited

Loading

codecov bot commented May 14, 2025 •

edited

Loading

Extract text from annotated shapes and use it in thumbnail alt text #7077

Are you sure you want to change the base?

Extract text from annotated shapes and use it in thumbnail alt text #7077

Conversation

robertknight commented May 14, 2025 • edited Loading

codecov bot commented May 14, 2025 • edited Loading

Codecov Report

robertknight commented May 14, 2025 •

edited

Loading

codecov bot commented May 14, 2025 •

edited

Loading