IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET It provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for: * .NET Framework 4.6.2 + * .NET Standard 2.0 + * .NET Core 2.0 + * .NET 5 * .NET 6 * .NET 7 * .NET 8 * .NET 9 * .NET 10 * Mono for MacOS and Linux * Xamarin for MacOS IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes. IronOCR's Unique Features: * Pure .Net OCR API * All OCR tasks run locally (no SAAS) * 125 languages * Barcode & QR Code reading * Corrects low quality, noisy and distorted scans * Performance tuned above and beyond any other known build of Tesseract OCR. * Reads PDFs * Reads multi-page TIFFs * Can save any OCR Scan to a searchable PDF document or XHTML Data output options include: Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters. Language Support: 125 Languages including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created. Licensing & Support available for commercial deployments. Email: support@ironsoftware.com For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/
PM> Install-Package IronOCR
using IronOcr;
var ocr = new IronTesseract();
using (var ocrInput = new OcrInput())
{
ocrInput.LoadImage("image.png");
ocrInput.LoadPdf("document.pdf");
// Optionally Apply Filters if needed:
// ocrInput.Deskew(); // use only if image not straight
// ocrInput.DeNoise(); // use only if image contains digital noise
var ocrResult = ocr.Read(ocrInput);
Console.WriteLine(ocrResult.Text);
}
Dive deeper with our extensive documentation and examples:
Tutorials: Step-by-step guides to help you build your first OCR application.
Code Examples: Practical examples of executable code for common tasks.
How-To Guides: Practical, goal-oriented instructions to solve specific problems.
Demo: Demonstrating guides that showcase how IronOCR works the way it does.
API Reference: Detailed technical descriptions of the API and its components.
Document Reading: Accurately extract text from common documents like scans, photographs, and receipts.
Specialized Reading: Utilize advanced reading functions to read formatted data such as passports (MRZ) and checks (MICR)
Preprocessing: Improve OCR accuracy with a suite of image preprocessing filters, including deskew, denoise, and binarization.
Configurations & Languages: Fine-tune the OCR engine for specific scenarios and read documents with 125 international languages supported.
OCR Results: Access structured output data by pages, paragraphs, lines, etc. Support the export to searchable PDFs
Platforms: .NET 10, .NET 9, .NET 8, .NET 7, .NET 6, .NET 5, Core 2x & 3x, Standard 2
Framework: .NET Framework 4.6.2 (and above)
App Models: Console, Web, and Desktop Apps
Operating Systems: Windows, macOS, Linux (Debian, CentOS, Ubuntu), iOS, Android
IDEs: Microsoft Visual Studio or Jetbrains ReSharper & Rider
IronOCR is a commercially licensed product.
Have a question or running into an issue?
Email Support: Reach out to our team directly at support@ironsoftware.com.
Live Chat Support: https://ironsoftware.com/csharp/ocr/#helpscout-support
Report a Bug: https://ironsoftware.com/ticket-submission/
Community: https://ironsoftware.com/company/iron-slack-community/