Load WARC (Web ARChive) files into Apache Spark using 'sparklyr'. This allows to read files from the Common Crawl project <http://commoncrawl.org/>.
Version: | 0.1.1 |
Imports: | sparklyr, DBI |
Published: | 2017-01-13 |
Author: | Javier Luraschi [aut, cre] |
Maintainer: | Javier Luraschi <javier at rstudio.com> |
BugReports: | https://github.com/javierluraschi/sparkwarc |
License: | Apache License 2.0 |
NeedsCompilation: | no |
Materials: | README |
CRAN checks: | sparkwarc results |
Reference manual: | sparkwarc.pdf |
Package source: | sparkwarc_0.1.1.tar.gz |
Windows binaries: | r-devel: sparkwarc_0.1.1.zip, r-release: sparkwarc_0.1.1.zip, r-oldrel: sparkwarc_0.1.1.zip |
macOS binaries: | r-release: sparkwarc_0.1.1.tgz, r-oldrel: sparkwarc_0.1.1.tgz |
Old sources: | sparkwarc archive |
Please use the canonical form https://CRAN.R-project.org/package=sparkwarc to link to this page.