NewsScrapper

What is Web Scraping?

Web scraping refers to the extraction of data from a website.This information is collected and then exported into a format that is more useful for the user.

Libraries Used

Beautiful Soup : Beautiful Soup provides simple methods for navigating, searching, and modifying a parse tree in HTML, XML files. It transforms a complex HTML document into a tree of Python objects. It also automatically converts the document to Unicode, so you don’t have to think about encodings. This tool not only helps you scrape but also to clean the data.

To install Beautiful Soup run the following command in your conda environment : pip install beautifulsoup4
Flask : Flask is Python’s micro-framework for web app development.Flask consists of Werkzeug WSGI toolkit and Jinja2 template engine.Web Server Gateway Interface (WSGI) is the standard for Python web application development and Jinja 2 renders the web pages for the server with any specified custom content given to it by the webserver. Flask renders its HTML based templates using Jinja 2.

To install Flask run the following command in your conda environment : pip install Flask

About the Project

In this project , News are Scrapped from 3 different websites and displayed on our own designed html page.

The three websites used for scrapping news are :

Hindi News website - AmarUjala
Cricket website - Cricbuzz
Technology News website - Gadget360

After Scrapping the news from websites these news are displayed on html page.

This project can be deployed on Cloud Platforms like Heroku , AWS , GCP , Azure etc. In our case, App was deployed on Heroku cloud platform and runs on this url https://news-scapper-site.herokuapp.com/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
static		static
templates		templates
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NewsScrapper

What is Web Scraping?

Libraries Used

About the Project

About

Releases

Packages

Languages

sangwanamit621/NewsScrapping

Folders and files

Latest commit

History

Repository files navigation

NewsScrapper

What is Web Scraping?

Libraries Used

About the Project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages