Tag: PDF

Thumbnail for How to extract text from images using Python?

This article discusses the fascinating concept of extracting text from images using Python and Pytesseract. It covers the installation process of Pytesseract and PIL, and provides a simple code example to extract text from an image. The article also highlights the limitations of Pytesseract and...... Read More


Keywords: python Pytesseract OCR optical character recognition image processing text extraction PIL accuracy configuration automation PDF
Thumbnail for How To Convert PDF documents to images using python

Learn how to extract the first page from a PDF file as a PNG image file using PyMuPDF's fitz module. This article explains the features of the fitz module and provides a Python script for extracting PDF pages as images. A bash script is also provided to demonstrate how to activate a local...... Read More


Keywords: PyMuPDF fitz module PDF image extraction virtual environment Python script bash script bash script. python python