Skip to content

cyx1231st/supider

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A simple framework for crawlers

Runtime Features

  • Progress visualization
  • Suspend crawling when meet errors or network issues and can continue later on
  • Depth first crawling to maximize parallelism
  • Parallelization controlled by thread pool to save memory
  • Decoupled connector to control connection behavior

Develop Features

  • Assemble crawl logics in a very flexible style like pipelines
  • Decouple crawl logics and collected contents
  • Debuggability: Automatically collect exceptions during crawling
  • Test mode (TODO)

About

A simple framework for crawlers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages