bugenhagen

A collection of scripts to help with generating ebooks.

`add_titles`

This will read a tab separated file in the format (filename, title). Then it will call the sub-script add_to_start_of_element, which will create a new H1 element inside the entry-content div. Useful for adding titles to otherwise title-less articles.

`extract_article_content.perl`

Will extract the HTML content of the entry-content div.

`extract_article_dates.perl`

Will create tab-separated output of article dates. This uses a META tag to get the real date.

`extract_article_titles.perl`

Creates the tab-separated input file that can later be used by add_titles.

`generate_manifest_and_spine.perl`

Creates manifest and spine entries in the OPF file for the ebook, in EPUB 3 format.

`strip_images.perl`

This is going to strip some useless tags from the article content. This is a postprocessing step that can be run after the extract_article_content step.

`extract_article_v0.perl`

A previous version of extract_article_content that expects to locate the article content in a different HTML structure; specifically it just takes the content of the first article element verbatim.

`driver.perl`

This works with extract_article_v0 as a wrapper script. It just runs the tidy program on the output before and after, to eliminate spurious changes caused by the parsing and re-parsing of the output.

Yow! x1

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
epub_skeleton		epub_skeleton
templates		templates
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
add_titles.perl		add_titles.perl
add_to_start_of_element.perl		add_to_start_of_element.perl
archiver.py		archiver.py
bugenhagen.py		bugenhagen.py
compile_html_to_ebook.sh		compile_html_to_ebook.sh
create_xhtml.py		create_xhtml.py
driver.perl		driver.perl
driver.py		driver.py
extract_article_content.perl		extract_article_content.perl
extract_article_dates.perl		extract_article_dates.perl
extract_article_titles.perl		extract_article_titles.perl
extract_article_v0.perl		extract_article_v0.perl
foo.py		foo.py
generate_manifest_and_spine.perl		generate_manifest_and_spine.perl
nuklear_slug.py		nuklear_slug.py
nuklear_slug_recipe.py		nuklear_slug_recipe.py
odell.py		odell.py
readinlineshapes.py		readinlineshapes.py
renderer.py		renderer.py
requirements.txt		requirements.txt
salvage.py		salvage.py
strip_images.perl		strip_images.perl
test.py		test.py

amoe/bugenhagen

Folders and files

Latest commit

History

Repository files navigation

bugenhagen

add_titles

extract_article_content.perl

extract_article_dates.perl

extract_article_titles.perl

generate_manifest_and_spine.perl

strip_images.perl

extract_article_v0.perl

driver.perl

About

Resources

Stars

Watchers

Forks

Languages