Clawdit: Skill Audit Toolkit

Clawdit is a small toolkit for pulling OpenClaw skills from multiple sources, auditing SKILL.md instructions with an LLM for finding malware.

This repo is intentionally simple: a few Python collectors + one Vite frontend.

What’s in this repo

fetch_clawhub_skills.py
- Pulls skills from a ClawHub-style API (/api/v1/skills + /api/v1/download?slug=).
- Downloads ZIPs.
- Extracts SKILL.md.
- Audits with a selectable LLM provider (minimax default, openai optional).
- Saves audit output incrementally after each skill attempt.
fetch_skills_sh_skills.py
- Scrapes skills.sh list pages, resolves skill pages, builds ZIP artifacts with SKILL.md, and can run the same audit flow.
fetch_skillsmp_skills.py
- Pulls from SkillsMP API-style endpoints with pagination/retries, builds ZIP artifacts with SKILL.md, and can run the same audit flow.
src/ + Vite/Tailwind config
- React dashboard for browsing skill_audit_report.json.
- Includes search, filtering, and sorting.

Requirements

Python 3.10+
Node 18+ and npm
MINIMAX_API_KEY (audit mode default) or OPENAI_API_KEY (if using --llm-provider openai)
OPENAI_API_KEY (only needed for audit mode)
Convex account/deployment (only needed for Convex DB mode)

Quick start

Install frontend deps:

npm install

Run the dashboard:

npm run dev

Open the local Vite URL and inspect loaded data from:

public/skill_audit_report.json (autoload)
or upload a JSON file from the UI

Convex database mode (Python + Convex)

This repo now supports storing audit records in Convex and loading them from the dashboard.

Set up Convex in this repo (first time only):

npx convex dev

Install the Python Convex client:

pip install convex

Sync your local report into Convex:

python3 sync_audit_report_to_convex.py \
  --input skill_audit_report.json \
  --convex-url https://YOUR-DEPLOYMENT.convex.cloud \
  --clear-first

Point the frontend to Convex via .env.local:

VITE_CONVEX_URL=https://YOUR-DEPLOYMENT.convex.cloud
# Optional override (default already matches this repo)
VITE_CONVEX_QUERY_PATH=skillAudits:list

When VITE_CONVEX_URL is set, the UI loads from Convex first. If that fails, it falls back to public/skill_audit_report.json.

Core workflow

Fetch skills list.
Download each skill ZIP with delay.
Extract SKILL.md.
Audit via LLM.
Save report incrementally.
Visualize in frontend.

This lets you stop/restart long runs without losing prior audit entries.

Alert email notifications (critical/high)

All pullers now support SMTP alert emails when an audit result contains critical/high findings (or matching risk level).

Configure via flags (or equivalent env vars):

--alert-email-to (ALERT_EMAIL_TO)
--alert-email-from (ALERT_EMAIL_FROM)
--alert-email-smtp-host (ALERT_EMAIL_SMTP_HOST)
--alert-email-smtp-port (ALERT_EMAIL_SMTP_PORT, default 587)
--alert-email-smtp-user (ALERT_EMAIL_SMTP_USER, optional)
--alert-email-smtp-password (ALERT_EMAIL_SMTP_PASSWORD, optional)
--alert-email-use-ssl (ALERT_EMAIL_USE_SSL, default false)
--alert-email-use-starttls / --no-alert-email-use-starttls (ALERT_EMAIL_USE_STARTTLS, default true)
--alert-levels (ALERT_LEVELS, default critical,high)
--alert-email-subject-prefix (ALERT_EMAIL_SUBJECT_PREFIX, default [Puller Alert])

Example:

OPENAI_API_KEY=... \
ALERT_EMAIL_TO=you@example.com \
ALERT_EMAIL_FROM=bot@example.com \
ALERT_EMAIL_SMTP_HOST=smtp.example.com \
ALERT_EMAIL_SMTP_PORT=587 \
ALERT_EMAIL_SMTP_USER=bot@example.com \
ALERT_EMAIL_SMTP_PASSWORD=... \
python3 fetch_clawhub_skills.py \
  --download-all-from-list \
  --audit-skill-md

Collector usage

1) ClawHub-compatible collector

Single list pull:

python3 fetch_clawhub_skills.py \
  --base-url https://wry-manatee-359.convex.site \
  --limit 100 \
  --output clawhub_skills.json

Full sequential scan + audit:

MINIMAX_API_KEY=... python3 fetch_clawhub_skills.py \
  --base-url https://wry-manatee-359.convex.site \
  --limit 100 \
  --output clawhub_skills.json \
  --download-all-from-list \
  --download-dir skill_zips \
  --delay 1.5 \
  --audit-skill-md \
  --audit-output skill_audit_report.json

Single slug test:

MINIMAX_API_KEY=... python3 fetch_clawhub_skills.py \
  --base-url https://wry-manatee-359.convex.site \
  --skip-list-fetch \
  --download-slug gifgrep \
  --download-dir skill_zips \
  --audit-skill-md \
  --audit-output skill_audit_report.json

Use GitHub repo source mode (for openclaw/skills style repos):

OPENAI_API_KEY=... python3 fetch_clawhub_skills.py \
  --github-repo-url https://github.com/openclaw/skills \
  --github-ref main \
  --github-skills-path skills \
  --limit 12000 \
  --download-all-from-list \
  --download-dir skill_zips \
  --delay 0.2 \
  --audit-skill-md \
  --audit-output skill_audit_report.json

Single name match test:

python3 fetch_clawhub_skills.py \
  --base-url https://wry-manatee-359.convex.site \
  --limit 100 \
  --download-name "gifgrep"

2) skills.sh collector

MINIMAX_API_KEY=... python3 fetch_skills_sh_skills.py \
  --limit 100 \
  --output skills_sh_skills.json \
  --download-all-from-list \
  --download-dir skill_zips \
  --delay 1.5 \
  --audit-skill-md \
  --audit-output skill_audit_report.json

3) skillsmp.com collector

MINIMAX_API_KEY=... python3 fetch_skillsmp_skills.py \
  --category backend \
  --sort-by recent \
  --max-pages 5 \
  --output skillsmp_skills.json \
  --download-all-from-list \
  --download-dir skill_zips \
  --delay 1.5 \
  --audit-skill-md \
  --audit-output skill_audit_report.json

Output files

clawhub_skills.json, skills_sh_skills.json, skillsmp_skills.json
- Raw/discovered skill entries per source.
skill_zips/*.zip
- Downloaded or generated ZIP artifacts.
skill_audit_report.json
- Main audit dataset used by the dashboard.
- Updated after each processed attempt in audit mode.
sync_audit_report_to_convex.py
- Pushes JSON audit records into Convex (skillAudits table) via upsert.
convex/schema.js, convex/skillAudits.js
- Convex schema/functions for storing and querying audit records.
public/skill_audit_report.json
- Frontend autoload copy.

Frontend features

Search by slug, summary, or finding titles
Filter by risk and status
Sort by:
- risk
- finding count
- name
- source (skills.sh first / clawhub.ai first)
Source pill logic:
- slug contains / -> skills.sh (green)
- otherwise -> clawhub.ai (orange)

Safety notes

Collectors do static instruction analysis (SKILL.md) and metadata handling.
Do not execute unknown scripts from downloaded archives on your host machine.
Keep delays (--delay) non-zero to reduce rate-limit churn.

Troubleshooting

HTTP 429 / rate limits:
- increase --delay
- lower --limit
- run smaller batches
Missing LLM audits:
- confirm MINIMAX_API_KEY is set (or OPENAI_API_KEY if --llm-provider openai)
- verify network access from your runtime
Frontend shows no data:
- ensure public/skill_audit_report.json exists
- or upload your latest report manually

Dev notes

Vite config: vite.config.js
Tailwind config: tailwind.config.js
Main app: src/App.jsx

If you change report schema, update src/App.jsx mapping logic first.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
convex		convex
public		public
src		src
.gitignore		.gitignore
README.md		README.md
alert_mail.py		alert_mail.py
clawhub_skills.json		clawhub_skills.json
context.md		context.md
fetch_clawhub_skills.py		fetch_clawhub_skills.py
fetch_skills_sh_skills.py		fetch_skills_sh_skills.py
fetch_skillsmp_skills.py		fetch_skillsmp_skills.py
ideas.md		ideas.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
skill_audit_report.json		skill_audit_report.json
skills_sh_skills.json		skills_sh_skills.json
styles.css		styles.css
sync_audit_report_to_convex.py		sync_audit_report_to_convex.py
tailwind.config.js		tailwind.config.js
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clawdit: Skill Audit Toolkit

What’s in this repo

Requirements

Quick start

Convex database mode (Python + Convex)

Core workflow

Alert email notifications (critical/high)

Collector usage

1) ClawHub-compatible collector

2) skills.sh collector

3) skillsmp.com collector

Output files

Frontend features

Safety notes

Troubleshooting

Dev notes

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clawdit: Skill Audit Toolkit

What’s in this repo

Requirements

Quick start

Convex database mode (Python + Convex)

Core workflow

Alert email notifications (critical/high)

Collector usage

1) ClawHub-compatible collector

2) skills.sh collector

3) skillsmp.com collector

Output files

Frontend features

Safety notes

Troubleshooting

Dev notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages