array(string) tokenize(string in)
Tokenize the input string (Note: You should first call normalize on it)