Skip to content

Some general information that applies all repositories in GutenbergSource.

Notifications You must be signed in to change notification settings

GutenbergSource/Information

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

About these Repositories

The repositories in GutenbergSource are all my source files for ebooks I’ve submitted to Project Gutenberg. These source files are in the TEI format, and to be more precise in the now obsolete SGML format defined by the P3 version of TEI (A version in XML is also included).

Each of these repositories has the following contents (some of these are optional).

  • <name>-<version>.tei – The source file itself.

  • metadata.xml – automatically generated metadata in RDF format.

  • README.adoc – automatically generated readme in AsciiDoc format.

  • good_words.txt – File generated by PGDP site, containing words marked as 'good'.

  • bad_words.txt – File generated by PGDP site, containing words marked as 'bad'.

  • project<hex-number>-comments.html – Instructions as used at PGDP site.

  • tei2html.config – Configuration for my tei2html tooling, used to create derived versions.

  • Processed – a directory with processed results

  • <name>.html – HTML file derived from the source file.

  • <name>.xml – XML file in TEI format, derived from the source file.

  • <name>.txt – Latin1 plain text file, manually derived from the source file.

  • <name>-utf8.txt – UTF8 plain text file, manually derived from the source file.

  • images – directory with illustrations (either this, or the two directories listed below).

  • images@1 – directory with illustrations, at 144 DPI, maximum dimension 720px on longest edge.

  • images@2 – directory with illustrations, at 288 DPI, maximum dimension 1440px on longest edge.

The source file can be processed with the tooling made available in https://github.com/jhellingman/tei2html.

Full instructions on how to use the scripts can be found in that repository.

About

Some general information that applies all repositories in GutenbergSource.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages