skribd / README.md
rkwyu
Add modes
acf17aa
|
raw
history blame
2.43 kB

Scribd-dl nodedotjs npm

License: GPLv3

About

Scribd-dl helps downloading documents on scribd.com without membership / sign-in.
2 modes are available:

  • default: the .pdf file is generated by chromium's print function
  • image-based the .pdf file is generated by image snapshots taken for pages

It is prefer to use the default mode as it gives a better performance in generation time and file size.
image-based mode is a backup solution in case the default mode doesn't work as expected.

Friendly reminder:

  1. The .pdf generated by image-based mode is formed by images, so it does NOT contain any text.

Development Plan

Scribd obfuscates the .pdf files, the texts copied from the documents will become strange garbled message.
De-obfuscating will be the next stage.

Prerequisites

Please make sure the following tool(s) / application(s) are properly setup and ready to use:

Setup

  1. Download repository
git clone https://github.com/rkwyu/scribd-dl
  1. Install dependencies
cd ./scribd-dl
npm install

Configuration

[SCRIBD]
rendertime=100

[DIRECTORY]
output=output

Configuration can be altered in config.ini.
rendertime is the waiting time in millisecond for single page rendering, it is only applicable for default mode. (too short might cause missing images)
output is the ouput directory for generated .pdf files.

Usage (CLI)

Usage: npm start [options] url
Options:  
  /d            default: generated by chromium's print function
  /i        image-based: generated by image snapshots taken for pages

Example 1: Download 《The Minds of Billy Milligan》

npm start https://www.scribd.com/doc/249398282/The-Minds-of-Billy-Milligan-Daniel-Keyes

Example 2: Download 《The Minds of Billy Milligan》 using image-based method####

npm start /i https://www.scribd.com/doc/249398282/The-Minds-of-Billy-Milligan-Daniel-Keyes

License

GNU GPL v3.0