User-friendly functions for extracting a data table (row for each match, column for each group) from non-tabular text data using regular expressions, and for melting columns that match a regular expression. Patterns are defined using a readable syntax that makes it easy to build complex patterns in terms of simpler, re-usable sub-patterns. Named R arguments are translated to column names in the output; capture groups without names are used internally in order to provide a standard interface to three regular expression C libraries (PCRE, RE2, ICU). Output can also include numeric columns via user-specified type conversion functions. RE2 engine (re2r package) was removed from CRAN in Mar 2020 so must be installed from github.
| Version: | 2020.5.13 |
| Depends: | R (≥ 2.14) |
| Imports: | data.table |
| Suggests: | testthat, re2r, stringi, ggplot2, tidyr (≥ 1.0.0), cdata, reshape2, knitr, R.utils |
| Published: | 2020-05-14 |
| Author: | Toby Dylan Hocking |
| Maintainer: | Toby Dylan Hocking <toby.hocking at r-project.org> |
| BugReports: | https://github.com/tdhock/nc/issues |
| License: | GPL-3 |
| URL: | https://github.com/tdhock/nc |
| NeedsCompilation: | no |
| Materials: | NEWS |
| CRAN checks: | nc results |
| Reference manual: | nc.pdf |
| Vignettes: |
vignette 1: capture first match vignette 2: capture all matches vignette 3: capture melt vignette 4: comparisons with other packages |
| Package source: | nc_2020.5.13.tar.gz |
| Windows binaries: | r-devel: nc_2020.5.13.zip, r-release: nc_2020.5.13.zip, r-oldrel: nc_2020.5.13.zip |
| macOS binaries: | r-release: nc_2020.5.13.tgz, r-oldrel: nc_2020.5.13.tgz |
| Old sources: | nc archive |
Please use the canonical form https://CRAN.R-project.org/package=nc to link to this page.