You can edit almost every page by Creating an account. Otherwise, see the FAQ.

pdftotext

From EverybodyWiki Bios & Wiki

pdftotext is an open-source command-line utility for converting PDF files to plain text files—i.e. extracting text data from PDF-encapsulated files. It is freely available and included by default with many Linux distributions, and is also available for Windows as part of the Xpdf Windows port. Such text extraction is complicated as PDF files are internally built on page drawing primitives, meaning the boundaries between words and paragraphs often must be inferred based on their position on the page.

pdftotext is part of the Xpdf software suite. Poppler, which is derived from Xpdf, also includes an implementation of pdftotext. On most Linux distributions, pdftotext is included as part of the poppler-utils package.[1]

See also[edit]

References[edit]

  1. "Poppler". Freedesktop.org. Retrieved 2022-10-27.

External links[edit]

  • Lua error in Module:Official_website at line 90: attempt to index field 'wikibase' (a nil value).



This article "Pdftotext" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Pdftotext. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.