segmentCN {Rwordseg} | R Documentation |
A function to segment Chinese text into words.
segmentCN(strwords, analyzer = c("default", "hmm", "jiebaR", "fmm", "coreNLP"), nature = FALSE, nosymbol = TRUE, returnType = c("vector", "tm"), ...)
strwords |
A charactor vector of Chinese sentence. |
analyzer |
One of 'default', 'jiebaR', 'hmm', 'fmm' and 'coreNLP'. Default is 'hmm'. |
nature |
Whether to recognise the nature of the words. |
nosymbol |
Whether to keep symbols in the sentence. Default is TRUE, means no symbols kept. |
returnType |
Default is a string vector but we also can choose 'tm'
to output a single string separated by space so that it can be used by |
... |
Other arguments. |
a vector of words (list if input is vecter) which have been segmented.
Jian Li <rweibo@sina.com>
segmentCN("hello world!")