Skip to content

obedjunias19/Cricket-Statistics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cricket-Statistics

Statistics

This project is basically based on scraping the data from an internet site using bs4 in Python3.

Modules used are:

  1. Beautiful Soup
  2. Pandas
  3. re (Regular Expressions)

Methods Used and Code Overview:

  1. getData(): This method takes in a single parameter i.e. the url to gather/scrape data and store it in excel file after undergoing some cleaning operations using regular expressions.
  2. Then the above method is looped so as to gather data from all available pages.
  3. Now a deep cleaning of data is performed using pandas.
  4. Finally the cleaned data is stored as an excel file.

@author: Obed Junias