Create a tidy data.frame of unigrams.
tidify_to_bigrams_at(data = NULL, text = NULL, rgx_pattern, rgx_replacement, stopwords = TRUE, stopwords_lexicon, rgx_ignore_custom) tidify_to_bigrams(..., text)
data | data.frame. |
---|---|
text | bare for NSE; character for SE. Name of column in |
rgx_pattern | character. Regular expression to substitute. |
rgx_replacement | character. Regular expression used as replacement. |
stopwords | logical. Whether or not to remove stopwords. Default: TRUE |
stopwords_lexicon | character. If not specified, then uses all stop words
available in |
rgx_ignore_custom | character. Custom regular expression to remove. |
... | dots. Parameters to pass directly to |
data.frame.
Heavily influenced by Text Mining with R. Creates output columns 'word', 'word1', and 'word2'. ('word' is simply 'word1' and 'word2' toegether.)