Skip to content

z-Wind/getNovel

Repository files navigation

getNovel - download novel to txt

GoDoc

Table of Contents

Installation

Please note that because of the goquery dependency, getNovel requires Go1.1+.

$ go get github.com/z-Wind/getNovel

To build with two ways

$ cd $GOPATH/src/github.com/z-Wind/getNovel
$ make

(optional) To run unit tests:

$ cd $GOPATH/src/github.com/z-Wind/getNovel
$ make test

(optional) To clean all except source code:

$ cd $GOPATH/src/github.com/z-Wind/getNovel
$ make clean

Examples

$ cd $GOPATH/src/github.com/z-Wind/getNovel
$ ./getNovel -url url_novel_contents

Anti Cloudflare

Currently, only one website has cloudflare installed. You will need to do something extra to crawl from it.

First, visit the site (e.g. https://czbooks.net/) and pass the cloudflare check. After that, open the browser developer tool and check the http request's Request Headers.

You will need to copy the following two values represented by <xxx>.

Cookie: cf_clearance=<base64 encoded string>; ...
User-Agent: <user agent>

Then, run the program with new arguments.

./getNovel -url URL -cf <base64 encoded string> -ua <user agent>

Support

Adding new files to test_dataset

$ cd test_dataset; wget2 -kx URL