sergey_tihon/Stanford.NLP.Segmenterv4.2.0.2

⚠ Deprecated: Legacy

This package has been marked deprecated in favour of the official maven Stanford CoreNLP package. Read more here https://github.com/sergey-tihon/Stanford.NLP.NET.

Tokenization of raw text is a standard pre-processing step for many NLP tasks. For English, tokenization usually involves punctuation splitting and separation of some affixes like possessives. Other languages require more extensive token pre-processing, which is usually called segmentation.

sergey-tihon.github.io ↗

License

View license

Deps

Install Size

—

Vulns

✓ 0

Published

Aug 25, 2022

Get Started

$ dotnet add package Stanford.NLP.Segmenter

Readme

No README available.

sergey_tihon/Stanford.NLP.Segmenterv4.2.0.2

Get Started

Readme

Links

Keywords

Maintainers