Skip to content

Instantly share code, notes, and snippets.

@miyagawa
Created October 20, 2013 02:51
Show Gist options
  • Star 9 You must be signed in to star a gist
  • Fork 2 You must be signed in to fork a gist
  • Save miyagawa/7064346 to your computer and use it in GitHub Desktop.
Save miyagawa/7064346 to your computer and use it in GitHub Desktop.
#!/usr/bin/env ruby
file = File.open("index.html", "r:iso-2022-jp:UTF-8")
content = file.read
all_html = <<HTML
<html>
<head>
<meta http-requiv="Content-Type" content="text/html;charset=utf-8">
<meta name="Author" content="Aoki Minero">
<title>Ruby Hacking Guide</title>
</head>
<body>
HTML
content.scan(%r[<li><a href="(.*?)">(.*?)</a>]) do |filename, title|
chap = File.open(filename, "r:iso-2022-jp:UTF-8")
html = chap.read
html.sub!(/^.+?<h1>/m, %q[<h1 class="chapter">])
html.gsub!(/<h2>/, "<h3>")
html.sub!(%r[</body>.*]m, '')
all_html << html
end
all_html << '</body></html>'
puts all_html
@miyagawa
Copy link
Author

Save the output to all.html and then run: ebook-convert all.html rhg.mobi --mobi-keep-original-image --mobi-file-type both and you get the MOBI:

module.html seems to have invalid characters by the way you have to eliminate before running the script.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment