Skip to content

Java Selenium based scraper to collect all media details from IMDb website.

Notifications You must be signed in to change notification settings

kumarAnand05/IMDb-Data-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

IMDb Data Scraper

By Anand Kumar

Features

  • Entire Database Collection : Extracts all the media data stored on IMDb website.
  • CSV Output : Extracts the data in CSV format.

Instructions

After you have downloaded the project files. Follow the instructions below to setup your machine to make code functional.

Downloading/Installing dependencies

Of course you need JDK and an IDE like VSCode, IntelliJ etc. installed on your machine.

Download Dependencies

Open the project in your IDE and connect to internet. Using the pom.xml file present in the project directory, download the dependencies by performing specific actions for the IDE that you are using.

Dos and Don'ts

Do's

  • You can use your machine during the process.
  • You can keep the browser and IDE in background.

Don'ts

  • Do not click on any element of the webpage as it can lead to termination of the code.
  • Do not use console during the process.
  • Do not turn off internet or close the automated browser session.

About

Java Selenium based scraper to collect all media details from IMDb website.

Topics

Resources

Stars

Watchers

Forks

Languages