PDFix SDK
From EverybodyWiki Bios & Wiki
PDFix SDK is a cross-platform library for Portable Document Format (PDF) processing.
History[edit]
- Version 1.0 [2016] - Initial release. Logical content extraction, PDF to HTML conversion in fixed and responsive layout. Language support: C++, Java
- Version 2.0 [2016] - PDF Form (AcroForm) support in HTML
- Version 3.0 [2017] - PDF Autotag and OCR. Language support: C#
- Version 4.0 [2018] - PDF to PDF/UA conversion, Language support: Python
Features [1][edit]
PDF Standard Features[edit]
- PDF page rendering
- Document metadata
- Commenting
- Watermarks and Stamps
- Links and Actions
- Bookmarks
- PDF Form fields and form filling
- Printing
- OCR
- Read access to low level PDF objects
- Read access to PDF page objects
Logical Content Extraction[edit]
- Document layout and structure recognition
- Inteligent data extraction
- Text paragraph detection
- Image and graphics extraction
- Annotation extraction
- White space detection
- Table detection (rows and columns)
- Table of contents detection
- Header and footer detection
- Regular expressions and pattern matching
Conversion[edit]
- PDF to HTML (original fixed and responsive layout)
- PDF to XML, JSON, CSV, TXT, PNG, JPEG
Accessibility[edit]
- Autotag
- Font embedding
- Make document PDF/UA
PDF Forms[edit]
- PDF Form to HTML Form
- AcroForm JavaScript (ECMAScript for PDF) support in HTML
See also[edit]
External Links[edit]
References
Re-submitting - fixed content after declined submissin[edit]
This article "PDFix SDK" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:PDFix SDK. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.