4 packages tagged with “extracting”
The Crawler-Lib Engine is a general purpose workflow enabled task processor. It has evolved from a web crawler over data mining and information retrieval. It is throughput optimized and can perform thousands of tasks per second on standard hardware. Due to its workflow capabilities it allows to structure and parallelize even complex kind of work. Please visit the project page for the complete view of the Crawler-Lib Engine. A license for the Anonymous Edition is included in the package. A license for the more powerful free Community Edition can be generated on the project page. A unrestricted license is available too.
Library allows you to extract different data from text or web sources, analyze text content, extract different information and write your own data mining & extracting applications and services.
A .NET library for extracting millions of words from any language that exists in the universe, with just one line of code.
C# library for scraping text content from websites.