ChoopScoop is a professional-grade, Playwright-powered web auditing and tag detection tool.
Developed by Jerusha Gray as part of her MarTech and Data Strategy portfolio, under IdeoPraxis Collective LLC β DBA GetFunnelCaked.
ChoopScoop automates the auditing of websites to detect analytics and marketing tags, identify underlying technologies, analyze dataLayer events, and generate structured reports.
Itβs designed for accuracy, transparency, and performance β ideal for marketing operations professionals, analysts, and engineers who want actionable insights into digital ecosystems.
Version 2.1 (MVP) focuses on stability, accuracy, and scalability, setting the foundation for future visualization and compliance modules.
- Modern Playwright crawler with asynchronous performance
- Comprehensive tag detection: GA4, GTM, Facebook, LinkedIn, TikTok, Adobe, Segment, and more
- DataLayer analysis: Automatically parses GA4 and ecommerce events
- Performance metrics: Load time, first contentful paint, DOM timings
- Cross-platform: Works on macOS, Windows, and Linux
- Clean exports: JSON, CSV, and an interactive HTML dashboard
- Resumable crawls: State management for large audits
- Low memory footprint: Smart flush-to-disk and batch processing
git clone https://github.com/<your-handle>/choopscoop.git
cd choopscoop
pip install -r requirements_v2.txt
bash install_v2.1.shOnce published or locally packaged:
pip install .After installation, run this once to install Playwright browsers (required for audits):
choopscoop setupThis step ensures the Chromium browser engine is properly configured.
choopscoop https://example.comchoopscoop https://example.com --max-pages 200 --max-depth 3 --format allYou can also define settings in config.yaml for reusable crawl parameters.
ChoopScoop automatically generates three export formats:
| Format | File | Description |
|---|---|---|
| JSON | site-audit.json |
Full crawl data including tags, technologies, and metrics |
| CSV | site-audit.csv |
Summarized audit metrics |
| HTML | site-audit.html |
Interactive dashboard for visual review |
choopscoop/
βββ choopscoop_site_auditor_v2_1.py
βββ tag_patterns.py
βββ config.yaml
βββ requirements_v2.txt
βββ install_v2.1.sh
βββ LICENSE.md
βββ ROADMAP.md
βββ CONTRIBUTING.md
βββ docs/
βββ DELIVERY-SUMMARY-v2.1.md
βββ PATCH-NOTES-v2.1.md
βββ VERSION-COMPARISON.md
βββ QUICK-START-v2.1.md
ChoopScoop evolves thoughtfully β see ROADMAP.md for planned enhancements and version milestones.
This project is maintained as a personal portfolio artifact.
However, it follows open documentation and structure standards to support long-term maintainability.
See CONTRIBUTING.md for details on project principles and conventions.
Jerusha Gray
Marketing Operations, MarTech & Data Strategy
IdeoPraxis Collective LLC β DBA GetFunnelCaked
Licensed under the MIT License.
Β© 2025 IdeoPraxis Collective LLC β DBA GetFunnelCaked