本文主要是介绍pdf2htmlEX 32位windows版本,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!
pdf2htmlEX 32位windows版本,原文出处:https://blog.csdn.net/weixin_44603744/article/details/86596082
windows系统可执行版下载地址:
http://soft.rubypdf.com/software/pdf2htmlex-windows-version
使用方法:
-
将需要转换的pdf文件放入pdf2htmlEX的解压目录
-
使用命令提示符进入pdf2htmlEX的解压目录
cd d:\pdfex
d:
- 1
- 2
- 执行cmd命令调用pdf2htmlex进行转换:
pdf2htmlex --zoom 1.8 abc.pdf
- 1
- 执行完毕后,会在同目录下生成与pdf同名的html文件:
参数说明
–zoom 缩放倍率 (转换结果是基于pdf文件的默认设置,如果转换结果阅读体验不佳,可通过调节zoom参数进行文字缩放)
更多参数:https://github.com/coolwanglu/pdf2htmlEX/wiki/Command-Line-Options
项目github:https://github.com/coolwanglu/pdf2htmlEX
OPTIONSPages-f, --first-page <num> (Default: 1)Specify the first page to process-l, --last-page <num> (Default: last page)Specify the last page to processDimensions--zoom <ratio>, --fit-width <width>, --fit-height <height>--zoom specifies the zoom factor directly; --fit-width/heightspecifies the maximum width/height of a page, the values are inpixels.If multiple values are specified, the minimum one will be used.If none is specified, pages will be rendered as 72DPI.--use-cropbox <0|1> (Default: 1)Use CropBox instead of MediaBox for output.--hdpi <dpi>, --vdpi <dpi> (Default: 144)Specify the horizontal and vertical DPI for imagesOutput--embed <string>--embed-css <0|1> (Default: 1)--embed-font <0|1> (Default: 1)--embed-image <0|1> (Default: 1)--embed-javascript <0|1> (Default: 1)--embed-outline <0|1> (Default: 1)Specify which elements should be embedded into the output HTMLfile.If switched off, separated files will be generated along withthe HTML file for the corresponding elements.--embed accepts a string as argument. Each letter of the stringmust be one of `cCfFiIjJoO`, which corresponds to one of the--embed-*** switches. Lower case letters for 0 and upper caseletters for 1. For example, `--embed cFIJo` means to embedeverything but CSS files and outlines.--split-pages <0|1> (Default: 0)If turned on, the content of each page is stored in a separatedfile.This switch is useful if you want pages to be loaded separately& dynamically -- a supporting server might be necessary.Also see --page-filename.--dest-dir <dir> (Default: .)Specify destination folder.--css-filename <filename> (Default: <none>)Specify the filename of the generated css file, if not embedded.If it's empty, the file name will be determined automatically.--page-filename <filename> (Default: <none>)Specify the filename template for pages when --split-pages is 1A %d placeholder may be included in `filename` to indicate wherethe page number should be placed. The placeholder supports alimited subset of normal numerical placeholders, including spec‐ified width and zero padding.If `filename` does not contain a placeholder for the page num‐ber, the page number will be inserted directly before the fileextension. If the filename does not have an extension, the pagenumber will be placed at the end of the file name.If --page-filename is not specified, <input-filename> will beused for the output filename, replacing the extension with .pageand adding the page number directly before the extension.Examplespdf2htmlEX --split-pages 1 foo.pdfYields page files foo1.page, foo2.page, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar.bazYields page files bar1.baz, bar2.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename page%dbar.bazYields page files page1bar.baz, page2bar.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar%03d.bazYields page files bar001.baz, bar002.baz, etc.--outline-filename <filename> (Default: <none>)Specify the filename of the generated outline file, if notembedded.If it's empty, the file name will be determined automatically.--process-nontext <0|1> (Default: 1)Whether to process non-text objects (as images)--process-outline <0|1> (Default: 1)Whether to show outline in the generated HTML--printing <0|1> (Default: 1)Enable printing support. Disabling this option may reduce thesize of CSS.--fallback <0|1> (Default: 0)Output in fallback mode, for better accuracy and browser compat‐ibility, but the size becomes larger.--tmp-file-size-limit <limit> (Default: -1)This limits the total size (in KB) of the temporary files whichwill also limit the total size of the output file. This is anestimate and it will stop after a page, once the total temporaryfiles size is greater than this number.-1 means no limit and is the default.Fonts--embed-external-font <0|1> (Default: 1)Specify whether the local matched fonts, for fonts not embeddedin PDF, should be embedded into HTML.If this switch is off, only font names are exported such thatweb browsers may try to find proper fonts themselves, and thatmight cause issues about incorrect font metrics.--font-format <format> (Default: woff)Specify the format of fonts extracted from the PDF file.--decompose-ligature <0|1> (Default: 0)Decompose ligatures. For example 'fi' -> 'f''i'.--auto-hint <0|1> (Default: 0)If set to 1, hints will be generated for the fonts using font‐forge.This may be preceded by --external-hint-tool.--external-hint-tool <tool> (Default: <none>)If specified, the tool will be called in order to enhanced hint‐ing for fonts, this will precede --auto-hint.The tool will be called as '<tool> <in.suffix> <out.suffix>',where suffix will be the same as specified for --font-format.--stretch-narrow-glyph <0|1> (Default: 0)If set to 1, glyphs narrower than described in PDF will bestretched; otherwise space will be padded to the right of theglyphs--squeeze-wide-glyph <0|1> (Default: 1)If set to 1, glyphs wider than described in PDF will besqueezed; otherwise it will be truncated.--override-fstype <0|1> (Default: 0)Clear the fstype bits in TTF/OTF fonts.Turn this on if Internet Explorer complains about 'Permissionmust be Installable' AND you have permission to do so.--process-type3 <0|1> (Default: 0)If turned on, pdf2htmlEX will try to convert Type 3 fonts suchthat text can be rendered natively in HTML. Otherwise all textwith Type 3 fonts will be rendered as image.This feature is highly experimental.Text--heps <len>, --veps <len> (Default: 1)Specify the maximum tolerable horizontal/vertical offset (inpixels).pdf2htmlEX would try to optimize the generated HTML file movingText within this distance.--space-threshold <ratio> (Default: 0.125)pdf2htmlEX would insert a whitespace character ' ' if the dis‐tance between two consecutive letters in the same line is widerthan ratio * font_size.--font-size-multiplier <ratio> (Default: 4.0)Many web browsers limit the minimum font size, and many wouldround the given font size, which results in incorrect rendering.Specify a ratio greater than 1 would resolve this issue, howeverit might freeze some browsers.For some versions of Firefox, however, there will be a problemwhen the font size is too large, in which case a smaller valueshould be specified here.--space-as-offset <0|1> (Default: 0)If set to 1, space characters will be treated as offsets, whichallows a better optimization.For PDF files with bad encodings, turning on this option maycause losing characters.--tounicode <-1|0|1> (Default: 0)A ToUnicode map may be provided for each font in PDF which indi‐cates the 'meaning' of the characters. However often there isbetter "ToUnicode" info in Type 0/1 fonts, and sometimes theToUnicode map provided is wrong. If this value is set to 1, theToUnicode Map is always applied, if provided in PDF, and charac‐ters may not render correctly in HTML if there are collisions.If set to -1, a customized map is used such that rendering willbe correct in HTML (visually the same), but you may not get cor‐rect characters by select & copy & paste.If set to 0, pdf2htmlEX would try its best to balance the twomethods above.--optimize-text <0|1> (Default: 0)If set to 1, pdf2htmlEX will try to reduce the number of HTMLelements used for text. Turn it off if anything goes wrong.Background Image--bg-format <format> (Default: png)Specify the background image format. Run `pdf2htmlEX -v` tocheck all supported formats.PDF Protection-o, --owner-password <password>Specify owner password-u, --user-password <password>Specify user password--no-drm <0|1> (Default: 0)Override document DRM settingsTurn this on only when you have permission.Misc.--clean-tmp <0|1> (Default: 1)If switched off, intermediate files won't be cleaned in the end.--data-dir <dir> (Default: /usr/local/share/pdf2htmlEX)Specify the folder holding the manifest and other files (seebelow for the manifest file)`--tmp-dir <dir> (Default: /tmp)Specify the temporary folder to use for temporary files--css-draw <0|1> (Default: 0)Experimental and unsupported CSS drawing--debug <0|1> (Default: 0)Print debug information.Meta-v, --versionPrint copyright and version info--help Print usage information
这篇关于pdf2htmlEX 32位windows版本的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!