V 0.4.3

V 0.4.2

V 0.4.1

V 0.4.0

WARNING: - In aggregateByKey generated column names are changed. - In aggregateByKey generated column for character is different.

V 0.3.9

V 0.3.8

V 0.3.7

-Code quality: - Improving code quality using lintr - Suppressing some useless code - Meeting new covr standard - Improve log of setColAsXXX

V 0.3.6

V 0.3.5

WARNING: - one_hot_encoder now requires you to run build_encoding first. - aggregateByKey now require functions to be passed by character name

This version is making (as much as possible) transformation reproducible on train and test set. This is to prepare future pipeline feature.

V 0.3.4

WARNING: - whichAreIncluded: in case of bijection (col1 is a bijection of col2), they are both included in the other, but the choice of the one to drop might have changed in this version.

V 0.3.3

WARNING: - date3 column in messy_adult data set has changed in order to illustrate the recognition of date character even if there are leading and/or trailing white spaces. - date4 column in messy_adult data set has changed in order to illustrate the recognition of date character even if there are multiple separator.

V 0.3.2

v 0.3.1

v 0.3

WARNING: - date1 column in messy_adult data set has changed in order to illustrate the recognition of date character even if “0” are not present in month or day part.

v 0.2

WARNING: - If you were using diffDates, it is now called generateDateDiffs - date2 column in messy_adult data set have changed in order to illustrate new timestamp features - setColAsFactorOrLogical doesn’t exist anymore: it as been splitted between setColAsFactor and generateFromCat - Considering all those changes: shapeSet and prepareSet don’t give the same result anymore.

v 0.1: release on CRAN July 2017