|
|
пре 1 година | |
|---|---|---|
| pdf2zh | пре 1 година | |
| .gitignore | пре 1 година | |
| LICENSE | пре 1 година | |
| README.md | пре 1 година | |
| setup.py | пре 1 година |
PDF scientific paper translation and bilingual comparison based on font rules and deep learning, preserving formula and figure layout.
Retain formulas and charts.
Preserve table of contents.
Support multiple translation services.
pip install pdf2zh
Execute the translation command in the command line to generate the translated document example-zh.pdf and the bilingual document example-dual.pdf in the current directory.
pdf2zh example.pdf
pdf2zh example.pdf -p 1-3,5
See Languages Codes.
pdf2zh example.pdf -li en -lo ja
pdf2zh example.pdf -s gemma2
pdf2zh BDA3.pdf -f "(CM[^RT].*|MS.*|XY.*|MT.*|BL.*|.*0700|.*0500|.*Italic)" -c "(\(|\||\)|\+|=|\d|[\u0080-\ufaff])"
Document merging: PyMuPDF
Document parsing: Pdfminer.six
Document extraction: MinerU
Multi-threaded translation: MathTranslate
Layout parsing: DocLayout-YOLO