Skip to content

aadium/scrapify-vite

Repository files navigation

Web Scraping App with React and Vite

This project is a web scraping application built using React.js with Vite, designed to scrape data from URLs specified by the user. It leverages modern web technologies to provide a seamless and efficient scraping experience. Additionally, the app includes AI-driven features such as summarization, categorization, and sentiment analysis to enhance the data processing capabilities. Users can view the scraped data on a separate output page and download it in JSON, CSV, or XML formats.

Features

  • Web Scraping: Input a URL and selectors to scrape data from web pages.
  • AI-Driven Analysis: Perform summarization, categorization, and sentiment analysis on the scraped data.
  • Data Export: Download the output in JSON, CSV, or XML formats.
  • Output Page: View the scraped and processed data on a separate output page.
  • Modern Tech Stack: Built with React.js and Vite for fast development and HMR (Hot Module Replacement).

Getting Started

Prerequisites

  • Node.js (18 or above)
  • npm or yarn

Installation

  1. Clone the repository:
    git clone https://github.com/aadium/scrapify-vite.git
  2. Navigate to the project directory
    cd scrapify-vite
  3. Install dependencies
    npm install
    # or
    yarn install
  4. Go to this website: GroqCloud. Create an account there, and then generate an API key. Create a .env file in the root directory, create an environment variable named VITE_GROQ_API, and add that API key as the value. Also, create another variable named VITE_API_URL, and enter this as the value: https://web-scraping-demo-8p7f.onrender.com.
  5. Start the development server
    npm run dev
    # or
    yarn dev

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated. Check the CONTRIBUTING file for more details

  1. Create an issue explaining your intended changes and wait till you are assigned the issue, or request to be assigned to an existing issue.
  2. Fork the Project
  3. Create your own branch (git checkout -b feature/AmazingFeature)
  4. Commit your changes (git commit -m 'your-message')
  5. Push the commits to the reote branch (git push origin feature/AmazingFeature)

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Scraping the web one page at a time

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages