Efforts are made to make Chinese text mining easier, faster, and robust to errors. Document term matrix can be generated by only one line of code; detecting encoding, segmenting and removing stop words are done automatically. Some convenient tools are also supplied.
Version: | 0.2.2 |
Depends: | R (≥ 3.6.0) |
Imports: | jiebaR, NLP, tm (≥ 0.7), stringi, slam (≥ 0.1-37), Matrix, purrr |
Published: | 2020-05-09 |
Author: | Jiang Wu [aut, cre] (from Capital Normal University) |
Maintainer: | Jiang Wu <textidea at sina.com> |
License: | GPL-3 |
URL: | https://github.com/githubwwwjjj/chinese.misc/blob/master/README.md |
NeedsCompilation: | no |
CRAN checks: | chinese.misc results |
Reference manual: | chinese.misc.pdf |
Package source: | chinese.misc_0.2.2.tar.gz |
Windows binaries: | r-devel: chinese.misc_0.2.2.zip, r-release: chinese.misc_0.2.2.zip, r-oldrel: chinese.misc_0.2.2.zip |
macOS binaries: | r-release: chinese.misc_0.2.2.tgz, r-oldrel: chinese.misc_0.2.2.tgz |
Old sources: | chinese.misc archive |
Please use the canonical form https://CRAN.R-project.org/package=chinese.misc to link to this page.