You can edit almost every page by Creating an account. Otherwise, see the FAQ.

PDFix SDK

From EverybodyWiki Bios & Wiki



PDFix SDK is a cross-platform library for Portable Document Format (PDF) processing.

History[edit]

  • Version 1.0 [2016] - Initial release. Logical content extraction, PDF to HTML conversion in fixed and responsive layout. Language support: C++, Java
  • Version 2.0 [2016] - PDF Form (AcroForm) support in HTML
  • Version 3.0 [2017] - PDF Autotag and OCR. Language support: C#
  • Version 4.0 [2018] - PDF to PDF/UA conversion, Language support: Python

Features [1][edit]

PDF Standard Features[edit]

  • PDF page rendering
  • Document metadata
  • Commenting
  • Watermarks and Stamps
  • Links and Actions
  • Bookmarks
  • PDF Form fields and form filling
  • Printing
  • OCR
  • Read access to low level PDF objects
  • Read access to PDF page objects

Logical Content Extraction[edit]

  • Document layout and structure recognition
  • Inteligent data extraction
  • Text paragraph detection
  • Image and graphics extraction
  • Annotation extraction
  • White space detection
  • Table detection (rows and columns)
  • Table of contents detection
  • Header and footer detection
  • Regular expressions and pattern matching

Conversion[edit]

  • PDF to HTML (original fixed and responsive layout)
  • PDF to XML, JSON, CSV, TXT, PNG, JPEG

Accessibility[edit]

  • Autotag
  • Font embedding
  • Make document PDF/UA

PDF Forms[edit]

See also[edit]

External Links[edit]

References

Re-submitting - fixed content after declined submissin[edit]


This article "PDFix SDK" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:PDFix SDK. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.