createDict {Rwordseg}R Documentation

Create a dictionary file from corpus.

Description

Read a corpus vector and generate the dictionary data frame.

Usage

createDict(trainvec, dicfile = NULL, wordsplit = "\\s+",
  natruesplit = "/")

Arguments

trainvec

A character vector of corpus.

dicfile

The path of output file. Defult is NULL.

wordsplit

Character containing regular expression to use for splitting words.

natruesplit

Character containing regular expression to use for splitting nature.

Value

A data frame of:

word

Word.

freq

Frequency.

nature

Nature.

Author(s)

Jian Li <rweibo@sina.com>

Examples

data(PD980105)
d1 <- createDict(PD980105[1:10])
head(d1)


[Package Rwordseg version 0.3-2 Index]