Skip to content

Web scraper that scrapes all products and their price, description, color, and more from supremenewyork.com (Supreme New York) and exports the extracted product information into Excel and CSV files.

License

Notifications You must be signed in to change notification settings

huyszn/supreme-products-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Supreme Products Scraper

Web scraper that scrapes all products and their price, description, colors, sizes, stock levels, and more from supremenewyork.com (Supreme New York). It then exports the extracted product information into two Excel files: one file with one sheet with all products, a second file with all products separated by categories into multiple sheets, and one CSV file containing all products. The scraper works for the North America, Europe, and Japan store.

Here is an example of the CSV file outputted by the scraper.

Multiple sheets of products in an Excel file

Excel File with products separated into multiple sheets by category

One sheet of products in an Excel file

Excel File with all products in one sheet

Requirements

Clone this repository, cd into it, and install dependencies:

git clone https://github.com/huyszn/supreme-products-scraper.git
cd supreme-products-scraper
pip install -r requirements.txt

Usage

Run with a proxy

This will most likely scrape the NA site.

python3 supreme-scraper.py -p

or

python3 supreme-scraper.py --proxy

Proxy Success

Scraped Supreme products with a proxy

If you get a proxy error, then rerun the script or try to run the script without a proxy.

Proxy Banned

Proxy is banned from Supreme

Proxy Error Connecting

Error connecting to the proxy

Proxy Error Connecting while scraping

Error connecting to the proxy while scraping products

Run without a proxy

This is not recommended unless you either:

  • Want to scrape products from the EU or JP site with your own IP
  • Have a proxy that can access the EU or JP site
  • Found that none of the free proxies are working
python3 supreme-scraper.py

No Proxy Success

Scraped Supreme products without a proxy

If you get a message saying you are banned, then rerun the script on a different IP or run the script with the -p or --proxy argument.

No Proxy Success

Your IP is banned from Supreme

Things to work on

  • Add desktop links to products and their colors to the Excel/CSV files

License

This project is licensed under the GPL 3.0 License - see the LICENSE file for details.

About

Web scraper that scrapes all products and their price, description, color, and more from supremenewyork.com (Supreme New York) and exports the extracted product information into Excel and CSV files.

Topics

Resources

License

Stars

Watchers

Forks

Languages