Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The output is being sorted randomly for the 'Extract Text from Images in Folder...' feature. #402

Open
amymor opened this issue Dec 8, 2023 · 1 comment
Labels
bug Something isn't working Edit Text Window Anything to do with the Edit Text Window or functions within it General Processing Relating to the processing of images to some type of text output

Comments

@amymor
Copy link

amymor commented Dec 8, 2023

Describe the bug
I want to extract text from a series of photos and the order of the images is important to me. Let's say I have 1000 images named screenshot (1).png, screenshot (2).png, etc. The extracted text should follow this order by name, but it appears to be extracted randomly.

Where is the bug

  • Edit Text Window
  • OCR Output

To Reproduce
Steps to reproduce the behavior:

  1. Go to 'Edit Text Window'
  2. Click on 'Capture' > 'Extract Text from Images in Folder...'
  3. Select Folder
  4. See error

Screenshots
0070

Where did you get Text Grab?

  • GitHub releases
    • Self-contained

Desktop (please complete the following information):

  • OS: Windows 10
  • Version 22H2
  • Text Grab Version v4.2.0

Additional context
I also tried the drag and drop method but the result is the same.
Thanks in advance.❤️

@amymor amymor added the bug Something isn't working label Dec 8, 2023
@TheJoeFin
Copy link
Owner

The process of extracting many images is multi-threaded to speed things up. Because of this the threads will return at different times depending on how long it takes to perform the OCR. The only way to keep everything in order would be to:

  1. Force single threaded operation, (much slower to do OCR on large batch)
  2. Complex order tracking and insertion logic to return the results and keep them ordered, (complex and time consuming to write)
  3. Do not return any results until fully complete, (no feedback result, would maybe seem slower).
    This could be a good option for when OCRing stuff like pages in a book

@TheJoeFin TheJoeFin added Edit Text Window Anything to do with the Edit Text Window or functions within it General Processing Relating to the processing of images to some type of text output labels Apr 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Edit Text Window Anything to do with the Edit Text Window or functions within it General Processing Relating to the processing of images to some type of text output
Projects
None yet
Development

No branches or pull requests

2 participants