Found 642 packages
Tesseract 5 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
Adds support for interop with System.Drawing to Tesseract such as passing Bitmap to Tesseract.
Tesseract 5.5.0 adds a new neural net (LSTM) based OCR engine which is focused on line recognition, but also still supports the legacy Tesseract OCR engine of Tesseract 3 which works by recognizing character patterns. Compatibility with Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0). It also needs traineddata files which support the legacy engine, for example those from the tessdata repository.
English language data files for Tesseract OCR v4.0
Tesseract.NET SDK it's a class library based on the tesseract-ocr project for embedding ocr capability in your .net project.
Recognize text from image and save the recognition results to a text file or searchable PDF document. Export text recognition result into HOCR format.
Tesseract OCR for Xamarin and Xamarin.Forms.
Google Tesseract OCR engine train data files for Polish language.
IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET It provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for: * .NET Framework 4.6.2 + * .NET Standard 2.0 + * .NET Core 2.0 + * .NET 5 * .NET 6 * .NET 7 * .NET 8 * .NET 9 * .NET 10 * Mono for MacOS and Linux * Xamarin for MacOS IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes. IronOCR's Unique Features: * Pure .Net OCR API * All OCR tasks run locally (no SAAS) * 125 languages * Barcode & QR Code reading * Corrects low quality, noisy and distorted scans * Performance tuned above and beyond any other known build of Tesseract OCR. * Reads PDFs * Reads multi-page TIFFs * Can save any OCR Scan to a searchable PDF document or XHTML Data output options include: Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters. Language Support: 125 Languages including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created. Licensing & Support available for commercial deployments. Email: support@ironsoftware.com For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/
.NET Wrapper for tessaract v5.0.0.20190623
Tesseract-OCR
This helps to read simple text (string or number) from the images using Tesseract without additional configuration. IMPORTANT : Change the properties of all the files in the "tessdata" folder for "Copy To Output Directory" as "Copy always". Sample Project : https://github.com/rohitvipin/TesseractHelper.Demo
Tesseract is probably the most accurate open source OCR engine available.
Use this package to add Tesseract 3 OCR engine files to your application.
Use this library with the Atalasoft OCR library to add Google's Tesseract engine to the useable engines.
Use this library with the Atalasoft OCR library to add Google's Tesseract engine to the useable engines.
Tesseract wrapper for .Net Maui Windows, iOS and Android platforms.
Tesseract-OCR binaries for NAPS2.Sdk
~ OCR Tesseeract plugin for PDFix.SDK ~ - OCR PDF document, page a portion of a page - embed fonts into the document Free trial: https://pdfix.net/download Code samples: http://pdfix.net/docs/_c_s__samples.html Related packages: PDFix.SDK ~ https://www.nuget.org/packages/PDFix.SDK/ PDFix.PdfToHtml ~ https://www.nuget.org/packages/PDFix.PdfToHtml/
Use this library with the Atalasoft OCR library to add Google's Tesseract v3 engine to the usable engines.