You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I find textstat_summary() very useful to compare the textual features between two or more groups.
However, given that the number of puncts, URLs, numbers, symbols, tags, and emojis can be explained by the number of characters/tokens/types, I usually end up writing a custom function to normalize based on length (that is, the longer texts would anyway have more URLs, for example, so normalization is required to compare between texts).
I usually end up writing a function for this, and think it'd be super useful to have this as a feature in textstat_summary.
The text was updated successfully, but these errors were encountered:
I find textstat_summary() very useful to compare the textual features between two or more groups.
However, given that the number of puncts, URLs, numbers, symbols, tags, and emojis can be explained by the number of characters/tokens/types, I usually end up writing a custom function to normalize based on length (that is, the longer texts would anyway have more URLs, for example, so normalization is required to compare between texts).
I usually end up writing a function for this, and think it'd be super useful to have this as a feature in textstat_summary.
The text was updated successfully, but these errors were encountered: