A lightweight Python bot that scrapes any website URL and returns the most frequently used keywords found in the page's visible content. Ideal for basic SEO analysis, keyword research, or content auditing.
β
Extracts visible text from any given URL
β
Automatically filters out HTML, JavaScript, and CSS content
β
Removes common English stopwords (using NLTK)
β
Displays the top N most-used keywords
β
Fully terminal-based, no complex setup needed
- Python 3.7+
- pip (Python package manager)
requestsβ for fetching the web pagebeautifulsoup4β for parsing and cleaning HTMLnltkβ for stopword filtering and text processing
- Clone or extract the project
- Navigate to the folder in terminal/command prompt
- Install the dependencies:
pip install -r requirements.txt
python scraper.py
Sample Output:
**yaml
Copy
Edit
Top keywords:
development: 18
web: 14
design: 12
seo: 10
services: 9**