报错：http.client.IncompleteRead: IncompleteRead(180224 bytes read, 39396 more exp

缺乏、安全感 2022-05-09 02:14 276阅读 0赞

在我爬取某网站时出现了该错误，但是只要重新运行一下程序还是请求成功。

我搜索了一下百度，没有发现类似的答案，不过在stackoverflow有类似的疑问。等会后面有链接。

可能出现这个问题的原因：这里执行urllib的read（）函数时候，它会捕获任何不完整的读取异常。因此出现了报错。

我们可以不让它捕获异常，因此当读取链接的时候我们可以用try / catch来抛出异常。

我之前的一段代码，不完整。

request = urllib.request.Request('https://****.org/'+ url)
        res = urllib.request.urlopen(request)
        buffer = res.read()             #这里
        if os.path.isfile(r'E:\pythonxm\zhengze\%s.pdf'%(filename)):
            print("已存在%s文件" %(filename))
            i = i+1
            continue
        fo =open(filename+".pdf","wb+")
        fo.write(buffer)
        fo.close()
        i = i+1

改过之后：

try:                                           #添加了try语句
            buffer = res.read()
        except http_client.IncompleteRead as e:
            buffer = e.partial
        if os.path.isfile(r'E:\pythonxm\zhengze\%s.pdf'%(filename)):
            print("已存在%s文件" %(filename))
            i = i+1
            continue
        fo =open(filename+".pdf","wb+")
        fo.write(buffer)
        fo.close()
        i = i+1

我这里用的是python3，python2中的urllib2不能再使用，由urllib.request代替。

参考网页链接：https://stackoverflow.com/questions/14442222/how-to-handle-incompleteread-in-python

_______________________分割线——————————————————————————————

别人的类似文章，不知道有没有用，先记下链接，仅供参考：

此篇重在分析出错原因：https://blog.csdn.net/woshiaotian/article/details/40297239

这篇给出了其他的解决方案：https://blog.csdn.net/haoli001/article/details/40863433

—————————————————————————再次更新—————————

IncompleteRead打出这个关键字可以发现很多，，，，，，尴尬，，，，尴尬

发表评论取消回复

表情：

评论列表（有 0 条评论，276人围观）

还没有评论，来说两句吧...

相关阅读

相关【异常】报错：TypeError: Cannot read properties of undefined (reading ‘init‘)

一、报错内容在vue当中引用echarts，控制台报错： “TypeError: Cannot read properties of undefined (readin

曾经终败给现在/ 2024年02月17日 11:35/ 0 赞/ 180 阅读

相关解决Oracle exp数据导出时编码报错

一、问题：这个是因为导出端的数据库编码和导入端的数据库编码不一致，我的导出端a是ZHS16GBKUNIX这种编码格式，而导入端b的是AL32UTF8这种编码格式，从而如果从a导

Bertha 。/ 2023年10月06日 08:13/ 0 赞/ 67 阅读

相关 elasticsearch报错index read-only

背景线上服务器的Elasticsearch服务大量报错，查询数据没问题，但是新增或者修改数据时，返回如下错误： { "error": {

冷不防/ 2023年02月13日 11:28/ 0 赞/ 132 阅读

相关 mysql 报错：Can not read response from server. Expected to read 4 bytes, read 0 bytes be

记录最近开发遇到一个这样的错误，Can not read response from server. Expected to read 4 bytes, read 0 byte

绝地灬酷狼/ 2023年01月22日 04:54/ 0 赞/ 93 阅读

相关 Java Can not read response from server.Expected to read bytes,read bytes before connection问题解决

问题描述： Cause: java.sql.SQLException: Can not read response from server. Expected to re

港控/mmm°/ 2022年09月11日 14:25/ 0 赞/ 298 阅读

相关 InputStream中read()与read(byte[] b)

这两个方法在抽象类InputStream中都是作为抽象方法存在的， JDK API中是这样描述两者的： read() : 从输入流中读取数据的下一个字

古城微笑少年丶/ 2022年06月06日 09:23/ 0 赞/ 257 阅读

相关 MongoDB报错,Sort operation used more than the maximum 33554432 bytes of RAM.Add an index

最近项目中用到了mongodb,文档型数据库,nosql,还是第一次接触到. 最大的不习惯就在于,所有的增删改查全部走的是一套API,函数调用就出来了,不用写sql语句查询

清疚/ 2022年05月29日 03:20/ 0 赞/ 307 阅读

相关报错：http.client.IncompleteRead: IncompleteRead(180224 bytes read, 39396 more exp

在我爬取某网站时出现了该错误，但是只要重新运行一下程序还是请求成功。我搜索了一下百度，没有发现类似的答案，不过在stackoverflow有类似的疑问。等会后面有链接。

缺乏、安全感/ 2022年05月09日 02:14/ 0 赞/ 277 阅读

相关 thrift.transport.TTransport.TTransportException: TSocket read 0 bytes报错解决

一、问题描述 htrift版本：2.0.0-cdh6.0.1 hbase版本：1.2.0-cdh5.7.0 使用 thrift client with py

太过爱你忘了你带给我的痛/ 2022年01月29日 01:59/ 0 赞/ 1171 阅读

相关 SQL2012报错：cannot find one or more cpmponents

一、错误情况 ![70][] 二、错误原因小编出现这个错误是在删除VS2015时，误将属于SQL的插件删除了，导致了这种情况的发生，但是，具体是

古城微笑少年丶/ 2021年09月15日 14:06/ 0 赞/ 593 阅读