Skip to content

lstoltz/keyword-checker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Keyword checker

Version: 1.0
Author: Linus Stoltz

A simple web scraper to identify key words for a given url. Allows various configuration parameters. Exports structured JSON for each scrape event.


Requirements

You can install all dependencies with:

pip install -r requirements.txt

Usage

python scrape_webpage.py https://example.com [OPTIONS]
Options Description
-k, --keywords One or more keywords/phrases to search for (multi‑word phrases in quotes). separate each entry with a space.
-f, --keywords-file Path to a text file with one keyword or phrase per line (mutually exclusive with -k flag).
-d, --depth Maximum crawl depth (0 = only the start page). Defaults to 1.
-t, --timeout HTTP request timeout in seconds. Defaults to 10.
--json Path to output JSON file. Defaults to <timestamp>_matches.json.

About

tool to scrape webpage and identify the url and work snippet match.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages