| okc_text | OkCupid Text Data |
| step_lda | Calculates lda dimension estimates |
| step_sequence_onehot | Generate the basic set of text features |
| step_stem | Stemming of list-column variables |
| step_stopwords | Filtering of stopwords from a list-column variable |
| step_textfeature | Generate the basic set of text features |
| step_texthash | Term frequency of tokens |
| step_tf | Term frequency of tokens |
| step_tfidf | Term frequency-inverse document frequency of tokens |
| step_tokenfilter | Filter the tokens based on term frequency |
| step_tokenize | Tokenization of character variables |
| step_tokenmerge | Generate the basic set of text features |
| step_untokenize | Untokenization of list-column variables |
| step_word_embeddings | Pretrained word embeddings of tokens |
| tidy.step_lda | Calculates lda dimension estimates |
| tidy.step_sequence_onehot | Generate the basic set of text features |
| tidy.step_stem | Stemming of list-column variables |
| tidy.step_stopwords | Filtering of stopwords from a list-column variable |
| tidy.step_textfeature | Generate the basic set of text features |
| tidy.step_texthash | Term frequency of tokens |
| tidy.step_tf | Term frequency of tokens |
| tidy.step_tfidf | Term frequency-inverse document frequency of tokens |
| tidy.step_tokenfilter | Filter the tokens based on term frequency |
| tidy.step_tokenize | Tokenization of character variables |
| tidy.step_tokenmerge | Generate the basic set of text features |
| tidy.step_untokenize | Untokenization of list-column variables |
| tidy.step_word_embeddings | Pretrained word embeddings of tokens |