Image object not recognized #4578
-
Description of the bugI'm extracting images from scientific papers. For this PDF I'm having troubles to extract Fig. 3 on page 10 - this image object is not included in I have the same issue in pypdf, see pypdf#3335. How to reproduce the bugRun the extract-images scripts on the given PDF file. PyMuPDF version1.26.1 Operating systemLinux Python version3.13 |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
This not a bug. There is exactly one image on the page (above "Fig. 2"), which is correctly recognized. |
Beta Was this translation helpful? Give feedback.
-
BTW, what is your underlying problem? Want to extract what looks like an image, no matter what it technically is? |
Beta Was this translation helpful? Give feedback.
-
In contrast to many other packages, PyMuPDF can
Taking together the above, you can find the rectangle that covers Fig. 3, and make a picture / image of this area. |
Beta Was this translation helpful? Give feedback.
This not a bug. There is exactly one image on the page (above "Fig. 2"), which is correctly recognized.
You probably believe that "Fig. 3" also is an image - which is not: it is a vector graphic.