UnicodeDecodeError: ‘gbk‘ codec can‘t decode byte 0xa8 in position 0: incomplete multibyte sequence

柔光的暖阳◎ 2022-12-11 02:30 233阅读 0赞

问题

在使用python的open打开.text文件的时候, 报如下错误

在这里插入图片描述

代码如下:

  1. #encoding=utf-8
  2. import jieba
  3. import jieba.posseg as pseg
  4. p = open(r'1.txt', 'r', encoding = 'gbk')
  5. q = open(r'2.txt', 'w', encoding = 'gbk')
  6. for line in p.readlines():
  7. words = pseg.cut(line)
  8. for word, flag in words:
  9. q.write(str(word) + str(flag) + " ")
  10. q.write('\n')

修改如下

  1. #encoding=utf-8
  2. import jieba
  3. import jieba.posseg as pseg
  4. p = open(r'1.txt', 'r', encoding = 'utf-8')
  5. q = open(r'2.txt', 'w', encoding = 'utf-8')
  6. for line in p.readlines():
  7. words = pseg.cut(line)
  8. for word, flag in words:
  9. q.write(str(word) + str(flag) + " ")
  10. q.write('\n')

发表评论

表情:
评论列表 (有 0 条评论,233人围观)

还没有评论,来说两句吧...

相关阅读