A beautiful demo is worth a thousand words:
- Bible de Genève, 1564 (fonts and typography): HTML / PDF
- Cheat Sheet (math formulas): HTML / PDF
- Scientific Paper (text and figures): HTML / PDF
- Full Circle Magazine (read while downloading): HTML / PDF
- Git Manual (CJK support): HTML / PDF
- Try your own files
pdf2htmlEX renders PDF files in HTML, utilizing modern Web technologies. It aims to provide an accurate rendering, while keeping optimized for Web display.
pdf2htmlEX is best for text-based PDF files, for example scientific papers with complicated formulas and figures. Text, fonts and formats are natively preserved in HTML such that you can still search and copy. The generated HTML file is static, with optional features powered by JavaScript.
pdf2htmlEX is not only a converter, but also an online publishing tool which is flexible for many different use cases. Learn more about who and why should use pdf2htmlEX.
- Precise and native text in HTML
- Flexible Output
- Moderate Size
- More PDF stuffs that you love: links, outlines & printing
- SVG background output & Type 3 font conversion
Learn more
Compare with others
- Lead author: 王璐 (Lu Wang) coolwanglu+no.support.for.pdf2htmlEX@gmail.com or @coolwanglu
- Questions about pdf2htmlEX? Use the mailling list instead.
- Accepting messages in 中文, English or 日本語
pdf2htmlEX, as a whole package, is licensed under GPLv3.
Some resource files are released with relaxed licenses, read LICENSE
for more details.
pdf2htmlEX is made possible thanks to the following projects:
pdf2htmlEX is inspired by the following projects:
- pdftops & pdftohtml from poppler
- MuPDF
- PDF.js
- Crocodoc
- Google Doc
- Hongliang Tian
- Wanmin Liu