Return a list of regular expression to used to 'tidify' a
rtweet
data.frame (i.e. with the tidify_to_
functions).
get_tweet_rgx_tidiers(rgx_unnest = "([^A-Za-z_\\d#@']|'(?![A-Za-z_\\d#@]))", rgx_pattern = "https://t.co/[A-Za-z\\d]+|http://[A-Za-z\\d]+|&|<|>|RT|https", rgx_ignore_custom = "^[0-9f][0-9a-f]+$")
rgx_unnest | character. Regular expression. |
---|---|
rgx_pattern | character. Regular expression. |
rgx_ignore_custom | character. Regular expression. |
list
This function is primarily a 'convenience' function for saving
regular expressions that are good to use with a rtweet
data.frame.