Extra 'Recipes' for Text Processing


[Up] [Top]

Documentation for package ‘textrecipes’ version 0.3.0

Help Pages

rcpp_ngram ngram generator
step_lda Calculates lda dimension estimates
step_lemma Lemmatization of tokenlist variables
step_ngram Generate ngrams from tokenlist
step_pos_filter Part of speech filtering of tokenlist variables
step_sequence_onehot Generate the basic set of text features
step_stem Stemming of tokenlist variables
step_stopwords Filtering of stopwords from a tokenlist variable
step_textfeature Generate the basic set of text features
step_texthash Term frequency of tokens
step_text_normalization text_normalizationming of tokenlist variables
step_tf Term frequency of tokens
step_tfidf Term frequency-inverse document frequency of tokens
step_tokenfilter Filter the tokens based on term frequency
step_tokenize Tokenization of character variables
step_tokenmerge Generate the basic set of text features
step_untokenize Untokenization of tokenlist variables
step_word_embeddings Pretrained word embeddings of tokens
tidy.step_lda Calculates lda dimension estimates
tidy.step_lemma Lemmatization of tokenlist variables
tidy.step_ngram Generate ngrams from tokenlist
tidy.step_pos_filter Part of speech filtering of tokenlist variables
tidy.step_sequence_onehot Generate the basic set of text features
tidy.step_stem Stemming of tokenlist variables
tidy.step_stopwords Filtering of stopwords from a tokenlist variable
tidy.step_textfeature Generate the basic set of text features
tidy.step_texthash Term frequency of tokens
tidy.step_text_normalization text_normalizationming of tokenlist variables
tidy.step_tf Term frequency of tokens
tidy.step_tfidf Term frequency-inverse document frequency of tokens
tidy.step_tokenfilter Filter the tokens based on term frequency
tidy.step_tokenize Tokenization of character variables
tidy.step_tokenmerge Generate the basic set of text features
tidy.step_untokenize Untokenization of tokenlist variables
tidy.step_word_embeddings Pretrained word embeddings of tokens
tokenlist Create tokenlist object