17 packages tagged with “PdfPig”
Reads text content from PDF documents and supports document creation. Apache 2.0 licensed.
Extends Verify (https://github.com/VerifyTests/Verify) to allow verification via PdfPig.
Extract tables from PDF files (port of tabula-java using PdfPig).
PdfPig implementation of the JBIG2 filter, based on pdfbox-jbig2.
PdfPig implementation of the DCT (Jpeg) filter, based on JpegLibrary.
Render pdf documents as images using PdfPig and SkiaSharp.
Extract tables from PDF files (port of tabula-java using PdfPig). Json writer.
PdfPig implementation of the JPX (Jpeg2000) filter, based on OpenJpegDotNet.
PdfPig implementation of the JPX (Jpeg2000) filter, based on OpenJPEG.net.
Extract tables from PDF files (port of tabula-java using PdfPig). Csv and Tsv writers.
SlapKit.PDF provides full API compatibility with the famous PdfPig library. Adding API for layout building and hyperlinks in this version. SlapKit.PDF enhances the essentials of PDF creation, editing, and content extraction, taking your PDF tasks to new heights. Dive into creating captivating documents, enhancing current ones, or quickly pulling out essential content. Experience a blend of familiarity and groundbreaking enhancements with SlapKit.PDF for all your PDF endeavors!
PDF document reader for the Summarizers pipeline. Extracts text with page markers, metadata, and paragraph normalization using PdfPig.
PDF document loaders for Mythosia.AI RAG pipeline. Includes PdfDocumentLoader and PdfPig parser.
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig). Contains OpenCvSharp4 for image processing used in the Lattice parser. Maintained fork with fixes for multi-column text splitting.
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig). Maintained fork with fixes for multi-column text splitting.
PDF table data extraction plugin for the Unio library. Extract strongly-typed data from PDF tables with automatic column detection and row parsing. Built on PdfPig for reliable, open-source PDF processing without commercial dependencies.
PDF processing extensions for Cyclotron.Maf.AgentSdk. Provides PDF image extraction, content analysis, and markdown conversion using PdfPig. Supports vision model integration and document workflow orchestration.