Skip to content

tupolev/crabgrassbk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crabgrassbk

Crabgrass page backup crawler

Description

This a crawler to save all created pages within a crabgrass group, intially on riseup.net. It uses selenium webdriver (python dependency) and phantomJS (included).

Features

  • Can download page images
  • Can download page attachments
  • Can generate Zip file package with the backup directory
  • File extensions for downloadable attachments is configurable

Requirements

  • Python 3.5
  • Pip
  • Check requirements.txt for complete list of requirements

Setup

  • Get a release package or clone the repository.
  • Using a virtualenv is strongly recommended.
  • pip install -r requirements.txt

Execution

Just configure properly the conf/config.yml file and run python crabgrassbk.py

Credits

Written by tupolev for Oficina Precaria Berlín organization.

RiseUpLabs Crabgrass repository: https://0xacab.org/riseuplabs/crabgrass

Rise Up Crabgrass: https://we.riseup.net

License

Crabgrasbk is licensed under the GNU General Public License v3.0