Find all keywords for your subdomain or folder search with website scrapping and NLP - for FUZZ (Bug Bounty)
- txt export
- parameter export
- words export
- Only Parameters or only Words
- git clone https://github.com/Proviesec/nlp-website-string-miner-for-fuzzing.git
- cd nlp-website-string-miner-for-fuzzing
- pip install -r requirements.txt or pip3 install -r requirements.txt
- python -m textblob.download_corpora
- py .\name-miner-crawler.py url deep range export?
- py .\name-miner-crawler.py https://google.com 1 3 1
- Deep: says how many levels (links) should be followed. Deep 2 says: all links from the first page and all links present on the next pages are viewed.
Screen:
py -3 .\dir-name-crawler.py http://github.com
Output:
{'longer', 'contributors', 'twitter', 'octopus', 'enhance', 'javascript', 'secrets', 'partners', 'discussions', 'ohmyzshohmyzsh', 'laptops', 'actions', 'tensorflow', 'dev', 'conflicts', 'support', 'emptybody', 'flow', 'month', 'zero', 'maintain', 'swift', 'repo', 'octocat', 'arm', 'press', 'collaboration', 'jobs', 'zsh', 'cat', 'browser', 'requests', 'ship', 'contribution', 'fortune', 'efficient', 'codespaces', 'react', 'linkedin', 'npm', 'fields', 'software', 'addstatusscreens', 'infrastructure', 'core', 'system', 'eslint', 'plans', 'delightful', 'mag', 'story', 'environment', 'record', 'builds', 'space', 'jump', 'operating', 'topics', 'keeps', 'github', 'atom', 'help', 'kuberneteskubernetes', 'services', 'experiment', 'confirmation', 'requestsconversations', 'jasonetcooctocatclassifier', 'accelerate', 'flutter', 'youtube', 'security', 'source', 'fix', 'program', 'anything', 'inclusion', 'customer', 'host', 'gh', 'desktop', 'automate', 'indexjs', 'secure', 'data', 'vault', 'ohmyzsh', 'company', 'rust', 'pull', 'propose', 'session', 'blog', 'productiongrade', 'sign', 'connect', 'integrations', 'impact', 'build', 'secret', 'voice', 'clicli', 'set', 'onthe', 'automation', 'organization', 'reviews', 'commits', 'stop', 'containers', 'rewind', 'githubthe', 'cloud', 'kubernetes', 'language', 'control', 'handle', 'search', 'rustlangrust', 'heart', 'companies', 'repository', 'privacy', 'depend', 'sync', 'remote', 'install', 'oauth', 'someone', 'paid', 'container', 'hashicorpterraform', 'vm', 'world', 'gatsbyjsgatsby', 'development', 'technologies', 'value', 'input', 'interfaces', 'details', 'sales', 'fast', 'service', 'choice', 'web', 'sponsors', 'tokens', 'puts', 'joshaber', 'macos', 'explore', 'repositories', 'classifier', 'terms', 'simple', 'questions', 'create', 'readme', 'minutes', 'production', 'tensorflowtensorflow', 'conversations', 'thanks', 'funding', 'passing', 'days', 'machine', 'management', 'scale', 'careers', 's', 'vulnerability', 'user', 'desire', 'apps', 'confirmations', 'building', 'tests', 'small', 'check', 'contributions', 'jasonetco', 'flutterflutter', 'homeassistantcore', 'risk', 'developer', 'declarative', 'merge', 'partner', 'branch', 'tab', 'keys', 'stars', 'number', 'developers', 'team', 'site', 'terraform', 'enables', 'gatsby', 'bugs', 'depends', 'top', 'installation', 'home', 'resources', 'enterprise', 'events', 'change', 'others', 'forum', 'ci', 'issues', 'current', 'appleswift', 'compare', 'share', 'review', 'community', 'organizations', 'steps', 'shop', 'vulnerabilities', 'stories', 'marketplace', 'lab', 'date', 'setup', 'cli', 'beautiful', 'product', 'matrix', 'identifies', 'everything', 'level', 'want', 'git', 'map', 'facebookreact', 'guides', 'window', 'api', 'pmarsceill', 'mobile', 'it', 'package', 'invalidate', 'browse', 'results', 'contact', 'size', 'ask', 'sophshep', 'fees', 'features', 'communitydriven', 'configuration', 'workflow', 'experiments', 'stickers', 'answer', 'reach', 'x', 'notify', 'roadmap', 'projects', 'refresh', 'environments', 'npmcli', 'blazing', 'found', 'library', 'changes', 'everyone', 'write', 'docs', 'packages', 'codebase', 'script', 'websites', 'place', 'photographs', 'project', 'public', 'work', 'token', 'manager', 'cover', 'inventions', 'matters', 'code', 'reload', 'squash', 'electron', 'teams', 'pr', 'player', 'platform', 'education', 'supports', 'suggestion', 'key', 'built', 'framework', 'profile', 'green', 'millions', 'octocatclassifier', 'status', 'windows', 'facebook', 'https', 'studio', 'starts', 'push', 'origin', 'request', 'checks', 'registry', 'collections', 'readmemd', 'octocats', 'contribute', 'fund'}
https://textblob.readthedocs.io/en/dev/install.html
Needless to mention, please use this tool very very carefully. The authors won't be responsible for any consequences.