Skip to content

Instantly share code, notes, and snippets.

@willjobs
Last active March 27, 2020 20:21
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save willjobs/7606bdd5daa79de9cfae07a2b40e46b6 to your computer and use it in GitHub Desktop.
Save willjobs/7606bdd5daa79de9cfae07a2b40e46b6 to your computer and use it in GitHub Desktop.
Extract PDF - Python script to extract pages of PDF into separate files
import os
from PyPDF2 import PdfFileReader, PdfFileWriter
orig = input("enter path to file:\n")
pages_at_a_time = int(input('how many pages per file?\n'))
path, filename = os.path.split(orig)
path += '\\split'
filename = filename[:-4]
if not os.path.exists(path):
os.makedirs(path)
pdf_reader = PdfFileReader(orig, "rb")
num_pages = pdf_reader.getNumPages()
i = 0
pdf_writer = PdfFileWriter()
for page in range(num_pages):
pdf_writer.addPage(pdf_reader.getPage(page))
i += 1
if i % pages_at_a_time == 0 or i == num_pages - 1:
with open(f'{path}\\{filename}_{page}.pdf', 'wb') as out:
pdf_writer.write(out)
pdf_writer = PdfFileWriter()
print('Done')
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment