This is a set of simple utilities for various data manipulation and testing tasks. The goal is to use core R tools well, without bringing in many dependencies. Main areas of interest are semi-automated data frame manipulation, such as converting factors in multiple binary indicator columns. There are testing functions which provide ‘testthat’ expectations to permute arguments to function calls. There are functions and data to test extreme numbers, dates, and bad input of various kinds which should allow testing failure and corner cases, which can be used for fuzzing your functions. The test suite has many examples of usage.
jwutil
has some minimal features to allow fuzz and stress testing functions. This is a complicated topic, but at a minimum, the following functions allow you to to give enexpected input to a function, with the expectation it should still work. The packge does more, but does different things.
Maximum magnitude positive and negative doubles and integers
## [1] 2.147484e+09 -2.147484e+09 1.000000e+00 -1.000000e+00
## [5] 2.225074e-308 1.797693e+308 -2.225074e-308 -1.797693e+308
There are several types of zero in R (excluding string representations):
## [[1]]
## [1] 0
##
## [[2]]
## [1] 0
##
## [[3]]
## [1] 0+0i
## [1] "integer" "numeric" "complex"
If the order of arguments in a function should not matter, the result should be the same each time. expect_that_comabine_all_args
tests this.
is equivalent to
This becomes more interesting with named arguments, where the order also should not matter. Beware of testing features of R, rather than your code: R has a complicated but standard way of matching arguments, described in Writing R Extensions. If your function does complicated argument manipulation, and maybe uses ...
, this function could verify that named argument order doesn’t matter.
res <- gsub(pattern = "the", replacement = "le", x = "Pierre the chat")
expect_that_combine_all_args(
gsub(pattern = "the", replacement = "le", x = "Pierre the chat"),
testthat::equals(res)
)
For functions where the first argument is a vector, and the order of elements in the vector should not matter, expect_that_combine_first_arg
will permute all combinations of the first argument, and expect the same result. A large number of elements will take a very long time, factorial with the number of items.
lsf
lists all the functions in a package, which may be useful to start an attack on all functions with a range of absurd data, looking perhaps for crashes in C/C++ code rather than R errors.
## [1] "%ine%" "%nin%" "add_time_to_date"
## [4] "affixFields" "areIntegers" "areNumeric"
## [7] "as_char_no_warn" "as_integer_nowarn" "as_numeric_nowarn"
## [10] "binary_col_names"