Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Using the OpenNLPTokenizer class

OpenNLP possesses a Tokenizer interface that is implemented by three classes: SimpleTokenizer, TokenizerME, and WhitespaceTokenizer. This interface supports two methods:

tokenize: This is passed a string to tokenize and returns an array of
tokens as strings.
tokenizePos: This is passed a string and returns an array of Span
objects. The Span class is used to specify the beginning and ending
offsets of the tokens.

Each of these classes is demonstrated in the following sections.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Using the OpenNLPTokenizer class

Create new playlist

Sign In

Sign Up

Table of Contents for
Using the OpenNLPTokenizer class