Process PDF’s fast with PyPDF2 and Pdfminer3k

Getting text from PDF files can be an absolute pain. Fortunately the quality of scans are getting better which makes parsing them a little bit easier. In this post we will show you two Python packages for working with PDF files. Neither of them is perfect, but you can get decent results with them. In … Continue reading “Process PDF’s fast with PyPDF2 and Pdfminer3k”