./nugetz

#Extract

335 packages tagged with “Extract

BitMiracle.Docotic.Pdf

C# PDF library to create, edit, draw and print PDF files. Docotic.Pdf is the .NET PDF library for .NET Core, ASP.NET, Windows Forms, WPF, Xamarin, Blazor, Unity, and HoloLense applications. Use Docotic.Pdf to: * Create PDF documents using Canvas API. * Generate PDF reports, invoices, etc. using the fluent API provided by the Layout add-on. * Extract text from PDF documents. * Convert HTML to PDF in C# with the help of the HTML to PDF converter add-on. * Convert PDF to image and print PDF documents. * Add digital signatures to PDF documents. * Encrypt and decrypt PDF files. * OCR PDF files. * Merge PDF in C# or VB.NET code. Split PDF documents. * Compress PDF files. * Linearize PDF files. Optimize for Fast Web View. * Edit PDF files. The SDK is a 100% managed assembly without unsafe blocks. The assembly has no external dependencies. Docotic.Pdf supports .NET 8, .NET 7, .NET 6, .NET 5, .NET Standard / .NET Core, and .NET 4.x frameworks. You can use the library in .NET on Windows, Linux, macOS, Android, and iOS. Works in Azure, AWS and other cloud environments. Can be used from a Docker container. To test the library, visit https://bitmiracle.com/pdf-library/ and get a free time-limited license key. For documentation, sample code, and API reference, visit https://bitmiracle.com/pdf-library/help There are add-ons to the library: * HTML to PDF add-on https://www.nuget.org/packages/BitMiracle.Docotic.Pdf.HtmlToPdf/ * Layout add-on https://www.nuget.org/packages/BitMiracle.Docotic.Pdf.Layout/ * Gdi add-on https://www.nuget.org/packages/BitMiracle.Docotic.Pdf.Gdi/ * Logging add-on https://www.nuget.org/packages/BitMiracle.Docotic.Pdf.Logging We offer royalty-free licenses for Docotic.Pdf. Eligible projects and/or people can receive a free license.

v9.8.186347.1M
PDFpdf-librarypdf-to-textpdf-to-imagecreate-pdf

Aspose.HTML

Aspose.HTML for .NET is a cross-platform class library that works as a headless browser, seamlessly integrating with .NET, C#, VB.NET, and ASP.NET applications. It supports HTML5, CSS3, SVG, and Canvas while building a Document Object Model (DOM) based on the WHATWG standard. Developers can navigate and manipulate HTML documents using DOM traversal, XPath, CSS selectors, or JavaScript. Along with a wide range of functions for programmatic work with HTML content, the library allows users to load, read, convert, and render SVG, MHTML, Markdown, and EPUB documents. Aspose.HTML for .NET provides robust data extraction capabilities, enabling you to parse and extract information from HTML documents. It also supports binding data from XML or JSON sources to HTML templates, making it ideal for generating dynamic content. Additional features include CSS extraction, document sandboxing, SVG file management, support for asynchronous operations, custom output stream handling, real-time DOM observation using MutationObserver, an HTML form editor, comprehensive web accessibility testing, and more. Aspose.HTML for .NET provides comprehensive format conversion support, enabling your applications to convert from HTML, XHTML, SVG, EPUB, MHTML, and Markdown documents to various formats, including PDF, XPS, DOCX, images, etc. It is optimized for handling complex and large-scale documents, making it ideal for web automation, content creation and management.

v26.2.04.3M
Aspose.HTMLC#.NETHTML-to-ImageHTML-to-PDF
pkg

Apitron.PDF.Kit

It is 100% managed code and does not require special manipulations to run with any .NET framework version starting from 2.0. PDF standard versions supported are: ALL versions. Files can be normal, linearized, password-protected, signed, incrementally updated. - We support many possible PDF content manipulations scenarios, below are a few things that worth mentioning: - Extract, modify and add graphics (text, images, drawings) - Split or merge PDF documents - Extract PDF text to HTML, Tagged or Raw format - Fill, sign or create PDF forms - Add or remove document fields - Examine resources within a document - fonts, embedded files, xml ( ZUGFeRD ) - Digitally sign and check existing signatures on PDF documents - Search for specific text - Protect document with a password - Work with navigation objects, e.g. create bookmarks or links - Full support for annotations - Full support for PDF actions - All fonts defined by specification are supported - Various colorspaces and color profiles are supported, e.g. you may draw in RGB, CMYK, gray, or whatever colorspace you like. - Files can be saved to other [subtypes] of PDF – Linearized or PDF/a for example. - If you require a specific funtionality and are unsure about whether it is supported, please review our online help or contact support so we will be able to handle this. - Fixed layout API, implemented to be 100% PDF specification compatible, it unlocks full power of the PDF for you. Any complex PDF creation or manipulation task can be completed instantly. - Flow layout API, a styles-driven content generation API similar to HTML+CSS provides you with ability to create stunning documents, reports, bills, catalogues an more in minutes. Compact and easy to use, supports creation of XML templates and much more.

v2.0.571.8M
Apitronpdf.kit.NETcore3.1

HiQPdf

HiQPdf Library for .NET (Classic) is a fast and flexible tool for creating high-quality PDF documents and converting HTML to PDF in .NET Framework, .NET Core and .NET Standard applications. The library uses the Classic rendering engine to convert HTML to PDF, images and SVG. You can also create, stamp, secure, merge and split PDF documents, extract text and images from PDF documents, search text in PDF, convert PDF pages to images or HTML. This package is compatible with .NET Framework, .NET Core and .NET Standard 2.0 on Windows platforms. For applications that need to run on both Windows and Linux platforms, the HiQPdf.Next.HtmlToPdf package provides a newer and highly accurate rendering engine designed for modern HTML, CSS and JavaScript content. The full HiQPdf.Next package allows you to create, edit and merge PDF documents, convert HTML to PDF or images, convert Word, Excel, RTF and Markdown to PDF, convert PDF to text or images. The compatibility list includes the following .NET versions, platforms and application types: * .NET Framework 2.0, 3.5, 4.0 and above * .NET 10, 9, 8, 7, 6 * .NET Standard 2.0 * Windows platforms * Azure Cloud Services and Azure Virtual Machines * Web, Console and Desktop applications Library Features: * HTML to PDF to quickly create PDF documents from HTML * HTML to Image and HTML to SVG converters * PDF to Image to rasterize PDF document pages to images * PDF to HTML to create HTML documents from PDF pages * PDF to Text to extract text from PDF documents * Search text in PDF documents * Extract images from PDF documents * Create PDF documents with text, HTML, SVG, images and graphics * Create encrypted, password-protected, digitally signed PDF documents * Create PDF documents with forms, text notes, links and JavaScript actions * Merge multiple PDF documents into a single one * Stamp PDF with HTML, text and images Documentation and code samples: https://www.hiqpdf.com/hiqpdf-dotnet

v18.0.22.3M
hiqpdfhtml-to-pdfurl-to-pdfweb-to-pdfhtml-to-image

Spire.OCR

Please refer to the below link to use Spire.OCR smoothly: https://www.e-iceblue.com/Tutorials/NET/Spire.OCR-for-.NET/Program-Guide/Recognize-Text/C-Extract-Text-from-Images-using-the-New-Model-of-Spire.OCR-for-.NET.html Spire.OCR for .NET is a professional OCR library to read text from Images in JPG, PNG, GIF, BMP and TIFF formats. Developers can easily add OCR functionalities within .NET applications in C# and VB.NET. It supports commonly used image formats and provides functionalities like reading multiple characters and fonts from images, bold and italic styles, scanning of the whole image and much more. Spire.OCR for .NET provides a very easy way to read text from images. With just one line of code in C# and VB.NET, Spire.OCR supports variable common image formats, such as Bitmap, JPG, PNG, TIFF and GIF. Spire.OCR supports to recognize text in popular fonts & styles like Arial, Times New Roman, Courier New, Verdana, Tahoma and Calibri fonts in regular, bold and italic text styles. It supports multiple languages such as English, Chinese, French, German, Japanese and Korean. • GIF • TIFF Multiple languages supported • English • Chinese • Japanese • Korean • German • French Popular fonts supported • Arial • Times New Roman • Courier New • Verdana • Tahoma • Calibri Font styles supported • Regular • Bold • Italic It supports the OCR feature on Mac, Windows, Linux, Azure and Docker for: • .Net Framework 2.0 + • .Net Standard 2.0 + • .Net Core 2.0 + • .Net 5 • Mono for MacOS and Linux • Xamarin for MacOS

v2.2.263.3K
OCRimageextractscantext

Apitron.PDF.Kit.Silverlight

It’s 100% managed code PDF standard versions supported are: ALL versions. Files can be normal, linearized, password-protected, signed, incrementally updated. - We support many possible PDF content manipulations scenarios, below are a few things that worth mentioning: - Extract, modify and add graphics (text, images, drawings) - Split or merge PDF documents - Fill or create PDF forms - Add or remove document fields - Examine resources within a document - fonts, embedded files - Digitally sign and check existing signatures on PDF documents - Search for specific text - Protect document with a password - Work with navigation objects, e.g. create bookmarks or links - Full support for annotations - Full support for PDF actions - All fonts defined by specification are supported - Various colorspaces and color profiles are supported, e.g. you may draw in RGB, CMYK, gray, or whatever colorspace you like. - Files can be saved to other “subtypes” of PDF – Linearized or PDF/A for example. - If you require a specific funtionality and are unsure about whether it is supported, please review our online help you contact support so we'll be able to handle this. - Fixed layout API, implemented to be 100% PDF specification compatible, it unlocks full power of the PDF for you. Any complex PDF creation or manipulation task can be completed instantly. - Flow layout API, a styles-driven content generation API similar to HTML+CSS provides you with ability to create stunning documents, reports, bills, catalogues an more in minutes. Compact and easy to use, supports creation of XML templates and much more.

v1.0.96166.1K
Apitron.Pdf.Kit.NETcreatepdfPDF