From 80a7409afbbbf1043e2adc9f5004b50ea0838931 Mon Sep 17 00:00:00 2001 From: Yury Fedoseev Date: Mon, 16 Feb 2026 13:40:57 -0800 Subject: [PATCH] Add pdf_oxide to PDF section Fast PDF library for text extraction, image extraction, and markdown conversion. Rust-powered via PyO3 with Python 3.8-3.14 support. MIT/Apache-2.0 licensed. https://github.com/yfedoseev/pdf_oxide https://pypi.org/project/pdf-oxide/ --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index f89e853f6..99248a99e 100644 --- a/README.md +++ b/README.md @@ -984,6 +984,7 @@ _Libraries for parsing and manipulating specific text formats._ - [XlsxWriter](https://github.com/jmcnamara/XlsxWriter) - A Python module for creating Excel .xlsx files. - [xlwings](https://github.com/ZoomerAnalytics/xlwings) - A BSD-licensed library that makes it easy to call Python from Excel and vice versa. - PDF + - [pdf_oxide](https://github.com/yfedoseev/pdf_oxide) - A fast PDF library for text extraction, image extraction, and markdown conversion, powered by Rust. - [pdfminer.six](https://github.com/pdfminer/pdfminer.six) - Pdfminer.six is a community maintained fork of the original PDFMiner. - [pikepdf](https://github.com/pikepdf/pikepdf) - A powerful library for reading and editing PDF files, based on qpdf. - [PyPDF2](https://github.com/mstamy2/PyPDF2) - A library capable of splitting, merging and transforming PDF pages.