Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

charmap decode error - Whatsapp #41

Open
strod opened this issue Jan 17, 2020 · 4 comments · Fixed by #50
Open

charmap decode error - Whatsapp #41

strod opened this issue Jan 17, 2020 · 4 comments · Fixed by #50
Labels

Comments

@strod
Copy link

strod commented Jan 17, 2020

after run "python parse.py whatsapp

Traceback (most recent call last):
File "parse.py", line 83, in
ArgParse()
File "parse.py", line 41, in init
getattr(self, args.command)()
File "parse.py", line 79, in whatsapp
main(args.own_name, args.file_path, args.max, args.infer_datetime)
File "C:\Users\rodrigo.teixeira\Documents\GitHub\Chatistics-master\parsers\whatsapp.py", line 62, in main
data = parse_messages(files, own_name, infer_datetime)
File "C:\Users\rodrigo.teixeira\Documents\GitHub\Chatistics-master\parsers\whatsapp.py", line 85, in parse_messages
regex_message = infer_datetime_regex(f_path)
File "C:\Users\rodrigo.teixeira\Documents\GitHub\Chatistics-master\parsers\whatsapp.py", line 24, in infer_datetime_regex
for c, line in enumerate(f):
File "C:\ProgramData\Anaconda3\envs\chatistics\lib\encodings\cp1252.py", line 23, in decode
return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x90 in position 4095: character maps to

@strod strod changed the title charmap decode error - hangouts charmap decode error - Whatsapp Jan 17, 2020
@hodanli
Copy link

hodanli commented Jan 17, 2020

i get the similar error.

(chatistics) C:\Users\User\Desktop\Chatistics-master>python parse.py whatsapp --own-name Owner
2020-01-17 17:18:49,410 [INFO ] [parsers.what]: Parsing Whatsapp data...
2020-01-17 17:18:49,410 [INFO ] [parsers.what]: Reading raw_data/whatsapp\whatsapp.txt
Traceback (most recent call last):
  File "parse.py", line 83, in <module>
    ArgParse()
  File "parse.py", line 41, in __init__
    getattr(self, args.command)()
  File "parse.py", line 79, in whatsapp
    main(args.own_name, args.file_path, args.max, args.infer_datetime)
  File "C:\Users\User\Desktop\Chatistics-master\parsers\whatsapp.py", line 62, in main
    data = parse_messages(files, own_name, infer_datetime)
  File "C:\Users\User\Desktop\Chatistics-master\parsers\whatsapp.py", line 85, in parse_messages
    regex_message = infer_datetime_regex(f_path)
  File "C:\Users\User\Desktop\Chatistics-master\parsers\whatsapp.py", line 24, in infer_datetime_regex
    for c, line in enumerate(f):
  File "C:\Users\User\Anaconda3\envs\chatistics\lib\encodings\cp1252.py", line 23, in decode
    return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x8d in position 2485: character maps to <undefined>

@mar-muel
Copy link
Collaborator

Is it possible that you are using Python 2? Python 3 should open text files in UTF-8 encoding by default.

@omennemo
Copy link

I was getting the same error. I saved the .txt file using notepad as Unicode instead of UTF-8.
The error message goes away but I think nothing is getting parsed.

@MasterScrat
Copy link
Owner

@strod @hodanli @omennemo can you check if #50 solved the issue for you?

@MasterScrat MasterScrat reopened this Jan 22, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

5 participants