Skip to content

Web scraping script for Google's search results page, built using Python and Scrapy.

License

Notifications You must be signed in to change notification settings

Muyoouu/google-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation


google-scraper

Python and Scrapy-based Google's search result scraping software

About The Project

Scraping data from Google search results presents two significant challenges. First, Google's search results are in a constant state of flux, making it difficult to maintain consistent and accurate data extraction. Second, Google has implemented stringent bot detection measures, creating a substantial obstacle to web scraping.

To address these challenges, I've developed specialized web scraping software with the following key features:

  1. Dynamic Web Page Navigation: My script includes instructions tailored to navigate Google's dynamic web pages effectively, allowing it to parse the ever-changing HTML content.

  2. Bypassing Bot Detection Measures: I've integrated proxy network services into the script to ensure that my scraping activities remain undetected as a bot by Google.

For a complete project demo, please visit the Google Scraper Notion Page.

(back to top)

Built With

  • Python
  • Scrapy
  • Pandas

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Contact

Musa Yohanes - [email protected]

Project Link: https://github.com/Muyoouu/google-scraper

(back to top)

About

Web scraping script for Google's search results page, built using Python and Scrapy.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages