UnicodeDecodeError: 'gb2312' codec can't decode bytes in position 723269-723270: ille

pycharm控制台中文乱码，目标网页编码gb2312编码

网上还很说是先转成gb2312转换unicode成再转成utf-8。

html = response.read().decode("gb2312").encode("utf-8")

运行出错
UnicodeDecodeError: 'gb2312' codec can't decode bytes in position 723269-723270: illegal multibyte sequence
去掉

decode("gb2312").encode("utf-8"

直接

html = response.read()

在linux终端运行python index.py显示网页正常

然后排查pycharmIDE控制台编码问题

修改ctrl+alt+s Editor--File Encoding Global Encoding 设置成GBK，显示正常。
猜测：控制台的编码需要跟win保持一致

随机推荐

Android Eclipse JNI 调用 .so文件加载
[Android L]关于Android L的Service启动
android 编译源码错误解决2
Android 签名类型
Google Maps Android API V2 版本更新导
Android ListView优化
android ui 布局性能优化
Android 客户端发送邮件（JMail方式）
Android打开摄像头拍照，并显示出来
详解Android中的Activity生命周期

更多相关文章

随机推荐