XImage.OCR is a C# Optical Character Recognition library to read, extract text contents from images, scanned PDFs, multi-page TIFF files in .NET projects. XImage.OCR from RasterEdge is an advanced OCR library : * Allow characters recognition and extraction from images captured by digital camera, scanned PDF document and image-only PDF * Support multiple languages, including English, French, German, Portuguese, Spanish, Russian, Italian, Dutch, Arabic, Korean, etc * Support user-defined image and document OCR, like full-page, auto and manual zonal OCR recognition * Able to read QR Code, barcode data Compatible with * .NET Standard 2.0 * .NET 8, .NET 7, .NET 6, .NET 5, .NET Core 3.x & 2.x * .NET Framework 4.x * Windows, MacOS, Linux, Docker, Azure Online Documents * C# How to Guide : http://www.rasteredge.com/how-to/csharp-imaging/ocr-sdk/ * Email : support@rasteredge.com
$ dotnet add package XImage.OCROnce installed the package, you can use the following C# code to scan, extract text content from images in C# application
// Set the training data path. Please put eng.traineddata (for English) under the directory you specified.
OCRHandler.SetTrainResourcePath(@"c:\source\");
//Load an image.
Bitmap img = new Bitmap(@"C:\page.jpeg");
// Recognize characters from this image. Default language is English.
OCRPage ocrPage = OCRHandler.Import(img);
ocrPage.Recognize();
Console.WriteLine(ocrPage.GetText());