Getting started with randomNames

Damian W Betebenner

March 7th, 2019

Introduction

The randomNames package contains a single function allowing users to generate proportionally correct, gender and ethnicity specific first and last names. The package contains an embedded data set of names that is based upon a large scale data set of names from the United States where the ethnicity and gender of the person was identified. The embedded data set includes information about ethnicity, gender (first name), and the probability of the name occurring in the data set. Random names are often useful in situations where one needs to share personal information but needs to obscure the identity of the person to whom the data belongs.

Use

After installing the package from either CRAN or GitHub, it’s simple to generate random names using the following simple instructions.

By default, the randomNames function supplies a single random last and first name separated by a comma:

> randomNames()
[1] "Contino, Preston"

To generate more random names, just supply a positive integer to the randomNames function:

> randomNames(5)
[1] "Das, Laneece"      "Sanchez, Brittany" "el-Shahidi, Ulyaa" "Brown, Marissa"    "Wartman, Cherokee"

Additional output control

The randomNames function accepts several arguments including n, gender, ethnicity, which.names, name.order, and name.sep.

randomNames(n,
            gender,
            ethnicity,
            which.names="both",
            name.order="last.first",
            name.sep=", ",
            sample.with.replacement=TRUE,
            return.complete.data=FALSE)

For complete documentation on values accepted for arguments, see the function documentation

Examples

The first argument, n, controls the number of names returned by the randomNames function:

The second argument, gender, controls the gender (0=male, 1=female) of the first names returned. This argument can be a vector up to the same length as the number of names requested.

The third argument, ethnicity, controls the ethnicity of the names returned. The following integer codes/ethnicities are accepted:

  1. American Indian or Native Alaskan
  2. Asian or Pacific Islander
  3. Black (not Hispanic)
  4. Hispanic
  5. White (not Hispanic)
  6. Middle-Eastern, Arabic

The fourth argument, which.names controls which names are returned. The argument accepts the values: both (the default), last, first, or complete.data

The fifth argument, name.order, controls the order in which the names are returned. The argument is only relevant if which.names=both. The argument accepts the values: last.first (the default) and first.last.

The sixth argument, name.sep, is a character string that controls the separator used when both names are returned. The default separator is ,.

Contributions & Requests

If you have contribution or a feature request for the randomNames package, don’t hesitate to write or set up an issue on GitHub.