PDF Text Extraction with Python

less than 1 minute read

PDF Text Extraction with Python

Portable Document Files (PDFs) originated during the Wild West of Word Processing. Competitors created innumerable file formats, which only their proprietary applications could decipher. Popular cross-platform applications like Microsoft Word provided no relief, so it was not uncommon for a Mac user to open a doc written on PC, and vice versa, only to find the file had been inexplicably converted to alien script.