-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
character maps to <undefined> #5
Comments
What exact commands did you run to get this? This looks like a file you are reading was maybe not in utf8, so the error could be with how that input file was saved. |
twarc2 timeline-archive Users.txt /UserTweets Users.txt is in utf8 |
Can you attach the |
Here it is - I have tried different files, with different users, and I keep encountering the same issue. |
Alright, i think i figured it out - the issue is with the file having a BOM (Byte Order Mark), so this breaks the input for twarc. The file is UTF8, but it's UTF8 with a BOM character at the start. This usually happens in windows when you save stuff from the command line, or from other ways of saving stuff - how you get rid of the BOM depends on the way you created the file. Ideally we should check in twarc for this and either strip this out or show a better warning. The fix is to re-save the file as UTF8, without a BOM, like this one: |
Yes that solved the problem. Thanks |
Hi,
I am trying to use the plugin in the command line, and I'm not sure what the fix is for this error:
The text was updated successfully, but these errors were encountered: