This a crawler to save all created pages within a crabgrass group, intially on riseup.net. It uses selenium webdriver (python dependency) and phantomJS (included).
- Can download page images
- Can download page attachments
- Can generate Zip file package with the backup directory
- File extensions for downloadable attachments is configurable
- Python 3.5
- Pip
- Check requirements.txt for complete list of requirements
- Get a release package or clone the repository.
- Using a virtualenv is strongly recommended.
pip install -r requirements.txt
Just configure properly the conf/config.yml
file and run python crabgrassbk.py
Written by tupolev for Oficina Precaria Berlín organization.
RiseUpLabs Crabgrass repository: https://0xacab.org/riseuplabs/crabgrass
Rise Up Crabgrass: https://we.riseup.net
Crabgrasbk is licensed under the GNU General Public License v3.0