Growth Agent: AI-Powered Content Intelligence & Automated Blog Generation

Automated content curation, LLM-powered analysis, and blog generation for modern growth teams

Workflows • Features • Quick Start • Deployment • Development

🔄 Workflows

📦 Workflow A: GitHub Quality Management

Status: ✅ Active | Purpose: Sync GitHub issues to local storage

# Manual execution
uv run python scripts/sync_github_issues.py

Features:

🐙 GitHub CLI wrapper (gh issue list)
⏰ Timestamp-based upsert logic
📊 Issue state tracking (open/closed)
🔒 Atomic file operations

Output: data/github/issues.jsonl

🧠 Workflow B: Content Intelligence & Blog Creation

Status: ✅ Active | Purpose: Ingest, curate, and generate content

# Manual execution
uv run python -m growth_agent.main run workflow-b

Three-Stage Pipeline:

📥 Ingestion Stage
- Fetch from X/Twitter creators (20 tweets per creator)
- Fetch from RSS feeds (20 articles per feed)
- Store in data/inbox/items.jsonl
- Index in LanceDB for semantic search
🎯 Curation Stage
- LLM evaluates each item (score 0-100)
- Filter by minimum score (default: 60)
- Select top-K items (default: 10)
- Store in data/curated/{date}_ranked.jsonl
✍️ Generation Stage
- LLM generates blog post from curated items
- YAML frontmatter with metadata
- Save as data/blogs/{ID}_{slug}.md

Output:

📥 data/inbox/items.jsonl
🎯 data/curated/{YYYY-MM-DD}_ranked.jsonl
✍️ data/blogs/*.md

📊 Workflow C: Social Media & Product Analytics Tracking

Status: ✅ Active | Purpose: Track engagement metrics across multiple platforms

# Manual execution - X/Twitter metrics
uv run python scripts/sync_metrics.py --source x

# Google Search Console metrics
uv run python scripts/sync_metrics.py --source gsc --days 7

# PostHog product analytics
uv run python scripts/sync_metrics.py --source posthog --days 1

# Sync all data sources
uv run python scripts/sync_metrics.py --source all

Features:

🐦 X/Twitter: Fetch latest tweets and engagement metrics (likes, retweets, replies)
🔍 Google Search Console: Search analytics, CTR, ranking positions, Core Web Vitals
📊 PostHog: User behavior events, insights, funnels, feature flags
💾 Separate JSONL files per platform (stats.jsonl, gsc_stats.jsonl, posthog_stats.jsonl)
🔄 Overwrite mode (keeps latest data only)

Output:

data/metrics/stats.jsonl - X/Twitter metrics
data/metrics/gsc_stats.jsonl - Google Search Console data
data/metrics/posthog_stats.jsonl - PostHog analytics data

📣 Workflow D: PuppyOne Social Listener

Status: ✅ Integrated | Purpose: Discover daily social opportunities and blog ideas, optionally render images, and post to Discord

# Initialize the default social listener configs
python -m growth_agent.main init

# Run the social listener manually
python -m growth_agent.main run workflow-d

# Handle x1 / b1 style image regeneration commands
python -m growth_agent.main social-reply x1
python -m growth_agent.main social-reply b1 --force

What it does:

Fetches RSS / X-RSS sources from data/social_listener/config/sources.json
Fetches blog-material sources from data/social_listener/config/blog_sources.json
Scores social post opportunities and SEO blog ideas with PuppyOne-specific prompts
Saves JSON / Markdown / text reports to data/social_listener/reports/
Optionally renders top images via qwen-image-2.0
Optionally sends a daily digest and top items to Discord via webhook

✨ Features

🧠 Workflow B - Content Intelligence & Blog Creation

📥 Multi-Source Ingestion
- 🔗 X/Twitter creators via RapidAPI
- 📰 RSS feed subscriptions
- 📊 LanceDB vector indexing for semantic search
🎯 AI-Powered Curation
- 🤖 LLM-based content evaluation and scoring
- 📈 Quality filtering (configurable thresholds)
- 🏆 Top-K selection for high-value content
✍️ Automated Blog Generation
- 📝 YAML frontmatter with metadata
- 🎨 GitHub-flavored markdown output
- 📅 Daily scheduled execution (8 AM Beijing)

🔧 Workflow A - GitHub Quality Management

🐙 GitHub CLI integration
🔄 Automatic issue synchronization
⏱️ Timestamp-based upsert logic
📂 Local caching with JSONL storage

📊 Workflow C - Multi-Platform Analytics Tracking

🐦 X/Twitter: Engagement metrics (likes, retweets, replies, impressions)
🔍 Google Search Console: SEO performance, search analytics, Core Web Vitals
📊 PostHog: Product analytics, user events, funnels, insights
🔄 Separate storage per platform for efficient querying
🎯 OAuth 2.0 and API Key authentication support

🏗️ Infrastructure

⚙️ Configuration: Pydantic-settings with environment variables
💾 Storage: File-system database with JSONL format (separate per platform)
📅 Scheduling: Linux cron jobs for production deployments
📝 Logging: Structured logging to files and console
🔒 Security: Atomic file operations, OAuth 2.0, API Key authentication
🌐 Multi-Platform: X/Twitter, GitHub, Google Search Console, PostHog integration

🚀 Quick Start

📋 Prerequisites

Python 3.10 or higher
uv (recommended) or pip
API Keys:
- X/Twitter RapidAPI Key
- OpenRouter API Key
- GitHub Token (optional, for Workflow A)

🔧 Installation

# Clone the repository
git clone https://github.com/HYPERVAPOR/growth-agent.git
cd growth-agent

# Install dependencies with uv (recommended)
uv sync

# Or with pip
pip install -e .

⚙️ Configuration

# Copy environment template
cp .env.example .env

# Edit configuration
vim .env

Required environment variables:

# API Keys
X_RAPIDAPI_KEY=your_x_api_key_here
OPENROUTER_API_KEY=your_openrouter_key_here

# Optional - Workflow A (GitHub)
GITHUB_TOKEN=your_github_token_here
REPO_PATH=puppyone-ai/puppyone

# Optional - Workflow C (Google Search Console)
GSC_ENABLED=true
GSC_SITE_URL=https://example.com
# Option 1: Use service account file
GSC_SERVICE_ACCOUNT_PATH=path/to/service-account.json
# Option 2: Use environment variables (recommended for deployments)
GSC_CLIENT_EMAIL=your-service-account@project-id.iam.gserviceaccount.com
GSC_PRIVATE_KEY="-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n"

# Optional - Workflow C (PostHog)
POSTHOG_ENABLED=true
POSTHOG_API_KEY=phx_your_project_api_key_here  # Use Project API Key, not Personal
POSTHOG_HOST=app.posthog.com
POSTHOG_PROJECT_ID=your_project_id

# Optional - Workflow D (PuppyOne Social Listener)
SOCIAL_LISTENER_ENABLED=true
SOCIAL_LISTENER_DISCORD_WEBHOOK_URL=https://discord.com/api/webhooks/...
SOCIAL_LISTENER_RENDER_IMAGES=false
SOCIAL_LISTENER_IMAGE_COUNT=1

# Optional - qwen-image-2.0 rendering
DASHSCOPE_API_KEY=your_dashscope_api_key_here
DASHSCOPE_BASE_URL=https://dashscope.aliyuncs.com/api/v1

# LLM Configuration
LLM_MODEL=anthropic/claude-3.5-sonnet
LLM_TEMPERATURE=0.3
LLM_MAX_TOKENS=2000

🔑 Setting up API Keys

X/Twitter RapidAPI:

Visit RapidAPI
Subscribe to Twitter API v2
Copy your API key to .env

OpenRouter:

Visit OpenRouter
Create an account and get API key
Add to .env

Google Search Console:

Create Google Cloud Project
Enable Search Console API
Create service account with JSON key
Add service account email to GSC property permissions
Configure in .env (see above)

PostHog:

Login to PostHog
Navigate to Settings → Project → API Keys
Copy Project API Key (not Personal API Key)
Add to .env

🎯 Usage

# Initialize data directory
uv run python -m growth_agent.main init

# Add subscriptions
vim data/subscriptions/x_creators.jsonl
vim data/subscriptions/rss_feeds.jsonl

# Run Workflow B immediately
uv run python -m growth_agent.main run workflow-b

# Start scheduler daemon (Ctrl+C to stop)
uv run python -m growth_agent.main schedule

📦 Project Structure

growth-agent/
├── 📂 src/growth_agent/
│   ├── 📂 core/                  # Core infrastructure
│   │   ├── schema.py            # Pydantic data models
│   │   ├── storage.py           # File-system database
│   │   ├── llm.py               # LLM client (OpenRouter)
│   │   ├── vector_store.py      # LanceDB integration
│   │   ├── logging.py           # Logging configuration
│   │   └── scheduler.py         # APScheduler setup
│   ├── 📂 workflows/             # Workflow orchestration
│   │   ├── base.py              # Abstract workflow base
│   │   ├── workflow_a.py        # GitHub sync
│   │   ├── workflow_b.py        # Content intelligence
│   │   └── workflow_c.py        # Metrics tracking
│   ├── 📂 ingestors/             # Data ingestion
│   │   ├── x_twitter.py         # X/Twitter API client
│   │   ├── rss_feed.py          # RSS feed parser
│   │   ├── github.py            # GitHub CLI wrapper
│   │   ├── metrics.py           # Metrics collector (X/Twitter)
│   │   ├── gsc_search_console.py # Google Search Console API
│   │   └── posthog.py           # PostHog analytics API
│   ├── 📂 processors/            # Data processing
│   │   ├── curator.py           # LLM content evaluator
│   │   ├── ranker.py            # Content ranking
│   │   └── blog_generator.py    # Blog post generator
│   ├── config.py                # Configuration management
│   └── main.py                  # CLI entry point
├── 📂 data/                      # File-system database
│   ├── subscriptions/           # X/RSS subscriptions
│   ├── inbox/                   # Raw ingested items
│   ├── curated/                 # LLM-evaluated content
│   ├── blogs/                   # Generated blog posts
│   ├── github/                  # GitHub issues cache
│   ├── metrics/                 # Social media metrics
│   ├── logs/                    # Execution logs
│   └── index/                   # LanceDB vector store
├── 📂 scripts/                   # Utility scripts
│   ├── sync_github_issues.py   # Manual Workflow A trigger
│   ├── sync_metrics.py         # Manual Workflow C trigger
│   └── test_posthog.py         # PostHog API validation
├── 📂 tests/                     # Test suite
├── pyproject.toml              # Project configuration
└── .env.example                # Environment template

🚢 Deployment

🖥️ Server Deployment with Cron Jobs

1. Clone & Install

# Clone repository
git clone https://github.com/HYPERVAPOR/growth-agent.git
cd growth-agent

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install dependencies
uv sync

# Initialize data directory
uv run python -m growth_agent.main init

2. Configure Environment

# Copy environment template
cp .env.example .env

# Edit configuration (add API keys)
vim .env

Required environment variables:

# API Keys
X_RAPIDAPI_KEY=your_x_api_key_here
OPENROUTER_API_KEY=your_openrouter_key_here

# Optional - Workflow A (GitHub)
GITHUB_TOKEN=your_github_token_here
REPO_PATH=puppyone-ai/puppyone

# Optional - Workflow C (GSC & PostHog)
GSC_ENABLED=true
GSC_SITE_URL=https://example.com
GSC_CLIENT_EMAIL=your-service-account@project-id.iam.gserviceaccount.com
GSC_PRIVATE_KEY="-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----\n"

POSTHOG_ENABLED=true
POSTHOG_API_KEY=phx_your_project_api_key_here
POSTHOG_HOST=app.posthog.com
POSTHOG_PROJECT_ID=your_project_id

# LLM Configuration
LLM_MODEL=anthropic/claude-3.5-sonnet
LLM_TEMPERATURE=0.3
LLM_MAX_TOKENS=2000

3. Setup Cron Jobs

# Edit crontab
crontab -e

Add the following cron jobs:

# Workflow A: GitHub Issues Sync (every 2 hours)
0 */2 * * * cd /path/to/growth-agent && /usr/local/bin/uv run python scripts/sync_github_issues.py >> data/logs/cron_workflow_a.log 2>&1

# Workflow B: Content Intelligence & Blog Generation (daily at 8 AM)
0 8 * * * cd /path/to/growth-agent && /usr/local/bin/uv run python -m growth_agent.main run workflow-b >> data/logs/cron_workflow_b.log 2>&1

# Workflow C: X/Twitter Metrics (every 6 hours)
0 */6 * * * cd /path/to/growth-agent && /usr/local/bin/uv run python scripts/sync_metrics.py --source x >> data/logs/cron_workflow_c.log 2>&1

# Workflow C: Google Search Console (daily at 9 AM)
0 9 * * * cd /path/to/growth-agent && /usr/local/bin/uv run python scripts/sync_metrics.py --source gsc --days 7 >> data/logs/cron_workflow_c.log 2>&1

# Workflow C: PostHog Analytics (every 6 hours)
0 */6 * * * cd /path/to/growth-agent && /usr/local/bin/uv run python scripts/sync_metrics.py --source posthog --days 1 >> data/logs/cron_workflow_c.log 2>&1

Important:

Replace /path/to/growth-agent with your actual project path
Replace /usr/local/bin/uv with your uv executable path (find with which uv)
Adjust schedule times based on your timezone and needs
Logs are written to data/logs/cron_workflow_*.log

4. Verify Cron Jobs

# List current cron jobs
crontab -l

# Check cron service status
sudo systemctl status cron

# View cron logs (Ubuntu/Debian)
sudo grep CRON /var/log/syslog

# View application logs
tail -f data/logs/cron_workflow_b.log

5. Monitor Execution

# View workflow logs
tail -f data/logs/$(date +%Y-%m-%d).log

# View specific cron job logs
tail -f data/logs/cron_workflow_a.log  # GitHub sync
tail -f data/logs/cron_workflow_b.log  # Content intelligence
tail -f data/logs/cron_workflow_c.log  # Metrics tracking

# Check last execution time
ls -lh data/blogs/  # Workflow B output
ls -lh data/metrics/  # Workflow C output
ls -lh data/github/  # Workflow A output

🔄 Updates

# Pull latest code
git pull origin main

# Reinstall dependencies (if needed)
uv sync

# Test workflows manually
uv run python -m growth_agent.main run workflow-b
uv run python scripts/sync_metrics.py --source all

🐳 Docker Deployment (Optional)

If you prefer Docker over cron jobs:

# Build image
docker build -t growth-agent .

# Run with environment file
docker run -d \
  --env-file .env \
  -v $(pwd)/data:/app/data \
  --name growth-agent \
  growth-agent

🧪 Development

🏃 Running Tests

# Install development dependencies
uv sync --all-extras

# Run tests
pytest

# Run with coverage
pytest --cov=src/growth_agent --cov-report=html

# View coverage report
open htmlcov/index.html

📝 Code Style

# Format code
black src/ tests/

# Check linting
ruff check src/ tests/

# Type checking
mypy src/

🔍 Debugging

# Enable verbose logging
export LOG_LEVEL=DEBUG

# Run with verbose output
uv run python -m growth_agent.main run workflow-b --verbose

📊 Data Schemas

📥 InboxItem

Base schema for all ingested content.

Fields:

id: Unique identifier
source: "x" or "rss"
content_type: "post" or "article"
url: Original URL
content: Text content
author_name: Author display name
title: Content title
published_at: ISO 8601 timestamp

🎯 CuratedItem

LLM-evaluated content with quality scores.

Fields:

All InboxItem fields
score: Quality rating (0-100)
summary: AI-generated summary
comment: AI evaluation comment
rank: Position in ranked list

✍️ BlogPost

Generated blog post with YAML frontmatter.

Fields:

id: Unique blog ID (UUID first 8 chars)
slug: URL-friendly slug
title: Blog title
date: Publication date
summary: Brief summary (50-300 chars)
tags: List of tags
author: Author name
content: Markdown content

See data/schemas/ for detailed documentation.

❓ FAQ

🤔 Why JSONL instead of a database?

JSONL (JSON Lines) provides:

✅ Simple version control with git
✅ Human-readable format
✅ Easy debugging and manual inspection
✅ No database dependencies
✅ Atomic writes prevent corruption
✅ AI-friendly structure for LLM analysis

⏰ How do I change cron job schedules?

Edit your crontab:

crontab -e

Modify the cron schedule format: minute hour day month weekday

Examples:

0 8 * * * - Daily at 8 AM
0 */6 * * * - Every 6 hours
0 9 * * 1 - 9 AM every Monday

🔍 How do I get Google Search Console credentials?

Create Google Cloud Project
Enable Search Console API
Create service account & download JSON key
Add service account email to GSC property permissions
Configure in .env (use environment variables for security)

Helper script available:

# Create GSC credentials JSON from environment variables
uv run python scripts/create_gsc_creds.py

📊 Why does PostHog return 401 Unauthorized?

You're likely using a Personal API Key instead of a Project API Key.

Fix:

Login to PostHog
Go to Settings → Project → API Keys
Copy the Project API Key (starts with phx_)
Update .env: POSTHOG_API_KEY=phx_...

Verify:

uv run python scripts/test_posthog.py

🔄 How does deduplication work?

Workflow A (GitHub): Issue number as unique key, upsert based on updated_at
Workflow B (Content): No deduplication (daily snapshots with timestamps)
Workflow C (Metrics): Overwrite mode per platform (always latest data)

📈 Can I track multiple X accounts?

Yes! Add them to data/subscriptions/x_creators.jsonl:

{"id": "123456", "username": "elonmusk", "followers_count": 1000000, "subscribed_at": "2026-02-05T10:00:00Z", "last_fetched_at": null}
{"id": "789012", "username": "puppyone_ai", "followers_count": 1000, "subscribed_at": "2026-02-05T10:00:00Z", "last_fetched_at": null}

🚀 Can I run workflows without cron jobs?

Yes! Manual execution:

# Workflow A
uv run python scripts/sync_github_issues.py

# Workflow B
uv run python -m growth_agent.main run workflow-b

# Workflow C (all platforms)
uv run python scripts/sync_metrics.py --source all

# Workflow C (specific platform)
uv run python scripts/sync_metrics.py --source gsc --days 7

📜 License

MIT License - see LICENSE for details.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📞 Support

📧 Email: support@hypervapor.com
🐛 Issues: GitHub Issues
📖 Documentation: data/schemas/

Built with ❤️ by HYPERVAPOR

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
data		data
docs		docs
images		images
prompts		prompts
scripts		scripts
src/growth_agent		src/growth_agent
tests		tests
tmp		tmp
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
test_selection_strategy.py		test_selection_strategy.py

Folders and files

Latest commit

History

Repository files navigation

Growth Agent: AI-Powered Content Intelligence & Automated Blog Generation

🔄 Workflows

📦 Workflow A: GitHub Quality Management

🧠 Workflow B: Content Intelligence & Blog Creation

📊 Workflow C: Social Media & Product Analytics Tracking

📣 Workflow D: PuppyOne Social Listener

✨ Features

🧠 Workflow B - Content Intelligence & Blog Creation

🔧 Workflow A - GitHub Quality Management

📊 Workflow C - Multi-Platform Analytics Tracking

🏗️ Infrastructure

🚀 Quick Start

📋 Prerequisites

🔧 Installation

⚙️ Configuration

🔑 Setting up API Keys

🎯 Usage

📦 Project Structure

🚢 Deployment

🖥️ Server Deployment with Cron Jobs

🔄 Updates

🐳 Docker Deployment (Optional)

🧪 Development

🏃 Running Tests

📝 Code Style

🔍 Debugging

📊 Data Schemas

📥 InboxItem

🎯 CuratedItem

✍️ BlogPost

❓ FAQ

🤔 Why JSONL instead of a database?

⏰ How do I change cron job schedules?

🔍 How do I get Google Search Console credentials?

📊 Why does PostHog return 401 Unauthorized?

🔄 How does deduplication work?

📈 Can I track multiple X accounts?

🚀 Can I run workflows without cron jobs?

📜 License

🤝 Contributing

📞 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages