Skip to content

CrawlyOEG/CrawlyWeb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WebCrawler

CrawlyWeb is a website that facilitates Crawly Project operations.

© 2018 Jorge Galán - OEG-UPM. Available under Apache License 2.0. See LICENSE.

Features

  • Know the belonging percentage of an article to alight pollution topic in the discover section
  • Extract the resources from a pdf in the extract section
  • Create your own model in the make your own section

Requirements

  • If you want to build the code from source, you need all the libraries.

Download

Github does not allow uploading files larger than 10mb, so it is necessary to use it from the source code

Usage

From jar run:

$java -jar Crawly-1.0.jar folderTopics folderPDF folderLibrarly

Where files uploaded in uploadFile will be temporarily saved in folderTopics, checkPDF files uploaded will be temporarily saved in folderPDF and makeModel files will be temporarily saved in folderLibrarly

Use the page at localhost:8080

Building from Source

Clone this repo and run:

mvn clean compile assembly:single

Then, get your own version of the jar in the project's target folder.

OEG Laboratory STARS4ALL