Skip to content

GabrielDan92/eMag_Altex-WebScraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 

Repository files navigation

eMag & Altex WebScraping

The Python script extracts all the pages from a given eMag link in less than 20 seconds and saves the data in an Excel output file. In the below example the script extracted all the 25 pages with aprox 1500 products from the mobile phones eMag page (https://www.emag.ro/telefoane-mobile/c).

By sending a different header with each request using https://httpbin.org/user-agent, I managed to bypass the anti spyder/scraping tools eMag is currently using. Otherwise, the website would lock me out after several requests.

Python tools used:

  • Pandas
  • Requests
  • Threading
  • BeautifulSoup
  • Regular Expressions/Regex

eMag

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages