public interface Tokenizer extends Serializable
Modifier and Type | Method and Description |
---|---|
List<String> |
tokenize(String input)
Breaks the input string into a series of tokens that may be used as
features for a classifier.
|
void |
tokenize(String input,
StringBuilder workSpace,
List<String> storageSpace)
Breaks the input string into a series of tokens that may be used as
features for a classifier.
|
List<String> tokenize(String input)
input
- the string to tokenizevoid tokenize(String input, StringBuilder workSpace, List<String> storageSpace)
input
- the string to tokenizeworkSpace
- an already allocated (but empty) string builder than can
be used as a temporary work space.storageSpace
- an already allocated (but empty) list to place the
tokens intoCopyright © 2017. All rights reserved.