Skip to content

Instantly share code, notes, and snippets.

@yob
Created September 26, 2021 12:34
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save yob/235415f37f542c5992971eca5716e15a to your computer and use it in GitHub Desktop.
Save yob/235415f37f542c5992971eca5716e15a to your computer and use it in GitHub Desktop.
pdf-reader extract characters
#!/usr/bin/env ruby
require 'pdf/reader'
PDF::Reader.open("foo.pdf") do |pdf|
pdf.pages.each do |page|
receiver = PDF::Reader::PageTextReceiver.new
page.walk(receiver)
characters = receiver.instance_variable_get("@characters")
puts characters.inspect
end
end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment