Skip to content

Cranix is a powerfull web crawler for crawling in scope urls from the websites.

License

Notifications You must be signed in to change notification settings

nullr3x/Crawnix

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crawnix

Crawnix is a python tool designed to crawl all the web pages from the website. Cranix uses Beautiful Soup to crawl web pages. It only crawl those webpages which are matching with base url. Crawnix uses url.netloc function for extract the base url and using regular expression it will check for the urls with base url and show on the display.

Screenshot

alt text

Installation

git clone http://github.com/Mehra1998/Crawnix.git

Recomended Python Version :

Crawnix currently supported Python3.x.

  • The recommended version for Python3 is 3.x

Dependencies :

Crawnix depends on the colorama, subprocess, codecs, Beautiful Soup, lxml python modules.

These dependencies can be installed using the requirements file:

  • Installation in Windows:
c:\python27\python.exe -m pip install -r requirements.txt
  • Installation on Linux:
$ sudo pip install -r requirements.txt

Features :

  • Crawl all Web Pages with in scope urls.

Usage :

$ python3 crawnix.py

License :

Crawnix is licensed under the GNU GPL license. take a look at the LICENSE for more information

About

Cranix is a powerfull web crawler for crawling in scope urls from the websites.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages