okc_text                OkCupid Text Data
step_lda                Calculates lda dimension estimates
step_sequence_onehot    Generate the basic set of text features
step_stem               Stemming of list-column variables
step_stopwords          Filtering of stopwords from a list-column
                        variable
step_textfeature        Generate the basic set of text features
step_texthash           Term frequency of tokens
step_tf                 Term frequency of tokens
step_tfidf              Term frequency-inverse document frequency of
                        tokens
step_tokenfilter        Filter the tokens based on term frequency
step_tokenize           Tokenization of character variables
step_tokenmerge         Generate the basic set of text features
step_untokenize         Untokenization of list-column variables
step_word_embeddings    Pretrained word embeddings of tokens
