Extract images from PDF without resampling, in python?
You can use the module PyMuPDF. This outputs all images as .png files, but worked out of the box and is fast. import fitz doc = fitz.open(“file.pdf”) for i in range(len(doc)): for img in doc.getPageImageList(i): xref = img[0] pix = fitz.Pixmap(doc, xref) if pix.n < 5: # this is GRAY or RGB pix.writePNG(“p%s-%s.png” % (i, … Read more