NEWS | R Documentation |
This package is an effort to provide a simple way to deal with out of memory data sets within R. The tasks hdd does is to import and perfom simple manipulations: like subsetting, creating new variables, etc.
As in other data base management systems (DBMS), the data is split into several chunks that fit in memory, and operations are performed "chunk-wise". These operations are hidden to the user.
Why creating this package? First, it avoids having to switch to alternative software like DBMS to deal with out of memory data sets (you can also use DBMS in R but it is cumbersome). Second, it allows the user to use the full power of R to create complex new variables. Finally, and maybe most importantly, the syntax is very simple and requires little effort from the user.