Create a tidy data.frame of unigrams.
tidify_to_unigrams_at(data = NULL, text = "text", rgx_pattern, rgx_replacement, rgx_unnest, stopwords = TRUE, stopwords_lexicon, rgx_ignore_custom) tidify_to_unigrams(..., text)
data | data.frame. Not in 'tidy' format. |
---|---|
text | bare for NSE; character for SE. Name of column in |
rgx_pattern | character. Regular expression to substitute. |
rgx_replacement | character. Regular expression used as replacement. |
rgx_unnest | character. Regular expression to use in |
stopwords | logical. Whether or not to remove stopwords. Default: TRUE |
stopwords_lexicon | character. If not specified, then uses all stop words
available in |
rgx_ignore_custom | character. Custom regular expression to remove. |
... | dots. Parameters to pass directly to |
data.frame.
Heavily influenced by Text Mining with R. Creates output column 'word'.
Text Mining with R. https://www.tidytextmining.com/ngrams.html. https://www.tidytextmining.com/twitter.html.