KeepAutomation.Ocr is a C# OCR (Optical Character Recognition) library to scan, read Chinese Simplified text content from TIFF, PDF, raster image files in your .NET projects. This Tesseract language package includes the following training data: * Chinese Simplified language This OCR nuget package includes : * Chinese Simplified language supported * Tesseract OCR engine to read, export image, multi-page TIFF, PDFs to editable text message * Allow characters recognition and extraction from images captured by digital camera, scanned PDF document and image-only PDF * Support multiple languages with special trained data, including English, German, Chinese, French, Spanish, Russian, Arabic, Korean, Japanese etc * Able to read, recognize QR Code, Data Matrix, Code 128, UPC/EAN and other 20+ barcode data message Compatible with * .NET Standard 2.0 * .NET 9, 8, 7, 6, 5, .NET Core 3.x & 2.x * .NET Framework 4.6.1 Information * Email : support@keepautomation.com
License
—
Deps
24
Install Size
—
Vulns
✓ 0
Published
Feb 26, 2026
$ dotnet add package KeepAutomation.Ocr.Languages.Chinese.SimplifiedKeepAutomation.Ocr is a C# OCR library to scan, read Chinese Simplified text content from TIFF, PDF, raster image files in your .NET projects.
This Tesseract language package includes the following training data:
High quality and easy to use library.