Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with GoogleImageCrawler #117

Open
drchristophreuter opened this issue Oct 24, 2023 · 4 comments
Open

Problem with GoogleImageCrawler #117

drchristophreuter opened this issue Oct 24, 2023 · 4 comments

Comments

@drchristophreuter
Copy link

When searching for images using GoogleImageCrawler, I always get an error message even though I have limited the search to 100 images. Does anyone have a solution for this problem?

Error message:
2023-10-24 15:36:37,554 - INFO - icrawler.crawler - start crawling...
2023-10-24 15:36:37,557 - INFO - icrawler.crawler - starting 1 feeder threads...
2023-10-24 15:36:37,561 - INFO - feeder - thread feeder-001 exit
2023-10-24 15:36:37,561 - INFO - icrawler.crawler - starting 1 parser threads...
2023-10-24 15:36:37,569 - INFO - icrawler.crawler - starting 4 downloader threads...
2023-10-24 15:36:38,158 - INFO - parser - parsing result page https://www.google.com/search?q=cat&ijn=0&start=0&tbs=isz%3Al%2Cic%3Aspecific%2Cisc%3Aorange%2Csur%3Afmc%2Ccdr%3A1%2Ccd_min%3A01%2F01%2F2017%2Ccd_max%3A11%2F30%2F2017&tbm=isch
Exception in thread parser-001:
Traceback (most recent call last):
File "/home/reuter/anaconda3/lib/python3.11/threading.py", line 1038, in _bootstrap_inner
self.run()
File "/home/reuter/anaconda3/lib/python3.11/threading.py", line 975, in run
self._target(*self._args, **self._kwargs)
File "/home/reuter/anaconda3/lib/python3.11/site-packages/icrawler/parser.py", line 94, in worker_exec
for task in self.parse(response, **kwargs):
TypeError: 'NoneType' object is not iterable
2023-10-24 15:36:42,574 - INFO - downloader - no more download task for thread downloader-001
2023-10-24 15:36:42,575 - INFO - downloader - no more download task for thread downloader-004
2023-10-24 15:36:42,577 - INFO - downloader - thread downloader-004 exit
2023-10-24 15:36:42,576 - INFO - downloader - thread downloader-001 exit
2023-10-24 15:36:42,576 - INFO - downloader - no more download task for thread downloader-003
2023-10-24 15:36:42,582 - INFO - downloader - thread downloader-003 exit
2023-10-24 15:36:42,575 - INFO - downloader - no more download task for thread downloader-002
2023-10-24 15:36:42,584 - INFO - downloader - thread downloader-002 exit
2023-10-24 15:36:43,576 - INFO - icrawler.crawler - Crawling task done!

runfile('/home/reuter/untitled0.py', wdir='/home/reuter')
2023-10-24 15:37:25,544 - INFO - icrawler.crawler - start crawling...
2023-10-24 15:37:25,545 - INFO - icrawler.crawler - starting 1 feeder threads...
2023-10-24 15:37:25,546 - INFO - feeder - thread feeder-001 exit
2023-10-24 15:37:25,546 - INFO - icrawler.crawler - starting 1 parser threads...
2023-10-24 15:37:25,554 - INFO - icrawler.crawler - starting 4 downloader threads...
2023-10-24 15:37:26,032 - INFO - parser - parsing result page https://www.google.com/search?q=cat&ijn=0&start=0&tbs=isz%3Al%2Cic%3Aspecific%2Cisc%3Aorange%2Csur%3Afmc%2Ccdr%3A1%2Ccd_min%3A01%2F01%2F2022%2Ccd_max%3A11%2F30%2F2022&tbm=isch
Exception in thread parser-001:
Traceback (most recent call last):
File "/home/reuter/anaconda3/lib/python3.11/threading.py", line 1038, in _bootstrap_inner
self.run()
File "/home/reuter/anaconda3/lib/python3.11/threading.py", line 975, in run
self._target(*self._args, **self.kwargs)
File "/home/reuter/anaconda3/lib/python3.11/site-packages/icrawler/parser.py", line 94, in worker_exec
for task in self.parse(response, **kwargs):
TypeError: 'NoneType' object is not iterable
2023-10-24 15:37:30,562 - INFO - downloader - no more download task for thread downloader-001
2023-10-24 15:37:30,563 - INFO - downloader - thread downloader-001 exit
2023-10-24 15:37:30,568 - INFO - downloader - no more download task for thread downloader-002
2023-10-24 15:37:30,571 - INFO - downloader - thread downloader-002 exit
2023-10-24 15:37:30,575 - INFO - downloader - no more download task for thread downloader-003
2023-10-24 15:37:30,576 - INFO - downloader - thread downloader-003 exit
2023-10-24 15:37:30,576 - INFO - downloader - no more download task for thread downloader-004
2023-10-24 15:37:30,579 - INFO - downloader - thread downloader-004 exit
2023-10-24 15:37:31,576 - INFO - icrawler.crawler - Crawling task done!

@drchristophreuter drchristophreuter changed the title Problem with GooglImagCrawler Problem with GoogleImageCrawler Oct 24, 2023
@ran5omware
Copy link

ran5omware commented Oct 26, 2023

have same problem for any count of images, how to fix it?

@Neptune-Trojans
Copy link

Same problem here.

@bretdavi
Copy link

bretdavi commented Dec 7, 2023

Some more info over on this Issue: #107

ZhiyuanChen added a commit to ZhiyuanChen/icrawler that referenced this issue May 15, 2024
ZhiyuanChen added a commit to ZhiyuanChen/icrawler that referenced this issue May 15, 2024
@ZhiyuanChen
Copy link
Collaborator

Please let me know if 0.6.8 fixes this issue~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants