Skip to content

INE Crawler is a tool to extract data from the National Statistics Institute.

License

Notifications You must be signed in to change notification settings

wake-ua/INECrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

INECrawler

INE Crawler is a tool to extract data from the National Statistics Institute.


Logo

A tool to craw data to your projects from open data portals
Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Examples for usage
  4. License
  5. Contact

About The Project

INE Crawler is a tool to extract data from the National Statistics Institute.

Features:

  • Download datasets from INE portal
  • Download metadata from resources
  • Filter by year
  • Filter by topic (just 'tourism' by the moment)

(back to top)

Getting Started

This is an example of how you may give instructions on setting up your project locally. To get a local copy up and running follow these simple example steps.

Requirements

  • You need python 3.9 installed

  • Clone the repo

    git clone https://github.com/wake-ua/INECrawler.git
  • Install the requirements from requirements.txt

    pip3 install -r requirements.txt

Examples

Dowload all metadata from INE portal:

python inecrawler

Dowload all datasets with their metadata:

python inecrawler -d

Dowload specifics categories. Only avilable 'Turismo':

python inecrawler -c Turismo

Dowload specific year. For example 2022:

python inecrawler -y 2022

Dowload specific operation. For example 62:

python inecrawler -id 62

Dowload in a specific path.:

python inecrawler -p /my/example/path/

Help with all posible commands:

python inecrawler -h

For more examples, please refer to the Documentation

(back to top)

INE site

* Works with restrictions or download limitations

See the open issues for a full list of proposed features (and known issues).

(back to top)

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

Colaborators

🙋‍♂️Alberto Berenguer Pastor
📱@aberenguerpas
✉️ [email protected]

Contact

🙋‍♂️Paula González Martínez
📱@lucyleia28
✉️ [email protected]

(back to top)

About

INE Crawler is a tool to extract data from the National Statistics Institute.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages