Create a tidy data.frame of unigrams.
tidify_to_bigrams_at(data = NULL, text = NULL, rgx_pattern, rgx_replacement, stopwords = TRUE, stopwords_lexicon, rgx_ignore_custom) tidify_to_bigrams(..., text)
| data | data.frame. |
|---|---|
| text | bare for NSE; character for SE. Name of column in |
| rgx_pattern | character. Regular expression to substitute. |
| rgx_replacement | character. Regular expression used as replacement. |
| stopwords | logical. Whether or not to remove stopwords. Default: TRUE |
| stopwords_lexicon | character. If not specified, then uses all stop words
available in |
| rgx_ignore_custom | character. Custom regular expression to remove. |
| ... | dots. Parameters to pass directly to |
data.frame.
Heavily influenced by Text Mining with R. Creates output columns 'word', 'word1', and 'word2'. ('word' is simply 'word1' and 'word2' toegether.)