🧠 Big Mood Detector

Objective mood-episode risk from your Apple Watch — clinically informed, privacy-first, open-source.

The clinical problem: No objective tool exists for predicting mood episodes or distinguishing unipolar from bipolar depression or borderline personality disorder. Clinicians rely on subjective recall; patients often seek help after crises begin.

Big Mood Detector analyzes your Apple Health data to predict mood episode risk using AI. Two complementary models:

PAT transformer screens for current depression.
XGBoost predicts tomorrow's depression/mania/hypomania risk.

When used together (--ensemble), see how your mood state evolves from today to tomorrow.

Built by a clinical psychiatrist, implementing published research. Runs 100% on-device, no cloud, no signup.

Current status: Research prototype — the first of its kind, but not yet clinically validated.

Why Use Big Mood Detector?

Our breakthrough:

Early detection: Spot mood episode risk before symptoms spiral
Two applications: Current depression screening (PAT) + next-day episode prediction (XGBoost)
Objective data: Complement clinical assessment with continuous behavioral biomarkers
Research foundation: First implementation combining two breakthrough papers:
- PAT: Dartmouth Foundation Model (arXiv)
- XGBoost: Nature Digital Medicine 2024 (GitHub)
Privacy-first: Runs entirely on your device — your data never leaves your machine

For researchers: Validate these approaches across populations, build the evidence base for digital mental health. See PAT model training details for replication.

🚀 Quick Start

Option 1: Docker (Recommended)

Consistent environment with all dependencies

# 1. Create security credentials
cat > .env << EOF
SECRET_KEY=$(python3 -c 'import secrets; print(secrets.token_urlsafe(32))')
API_KEY_SALT=$(python3 -c 'import secrets; print(secrets.token_urlsafe(32))')
EOF

# 2. Start services
docker-compose up -d api redis

# 3. Export Apple Health data (Settings → Health → Export)
#    Place export.xml in: data/input/apple_export/

# 4. Analyze your data
docker run --rm \
  -e BIGMOOD_DATA_DIR=/app/data \
  -v "$(pwd)/data:/app/data" \
  -v "$(pwd)/model_weights:/app/model_weights:ro" \
  big-mood-detector:latest \
  predict /app/data/input/apple_export/export.xml --report

Option 2: Local Installation

Requires Python 3.12+

# 1. Install (numpy first to avoid conflicts)
pip install 'numpy<2.0'
pip install big-mood-detector

# 2. Export Apple Health data (Settings → Health → Export)
#    Place export.xml in: data/input/apple_export/

# 3. For large files (>100MB), scan first to check data availability
big-mood predict data/input/apple_export/export.xml --scan

# 4. Analyze your data (research purposes)
big-mood predict data/input/apple_export/export.xml --report

# 5. With temporal analysis (NOW vs TOMORROW)
big-mood predict data/input/apple_export/export.xml --ensemble --report

# 6. Auto-window selection (enabled by default in v0.5.6+)
# Automatically finds optimal data windows for both models
big-mood predict data/input/apple_export/export.xml --report

# 7. To disable auto-window and specify dates manually
big-mood predict data/input/apple_export/export.xml --no-auto-window --date-range 2024-01-01:2024-03-31

See full output in data/output/clinical_report.txt

📦 Model Weights for Contributors & Researchers

Need model weights? Please contact me for a convenient Google Drive link containing:

Pre-trained PAT transformer weights
Fine-tuned depression detection head
XGBoost model files
Sample data for testing

This ensures you can run the application immediately without training models from scratch.

How It Works

Your Apple Health Data
      ↓
┌─────────────────────┐
│   Past 7 Days       │ ← PAT (transformer) analyzes activity patterns
├─────────────────────┤
│   Past 30 Days      │ ← XGBoost models circadian rhythms  
└─────────────────────┘
      ↓
Research Risk Scores (Not Diagnostic)

PAT = transformer AI, XGBoost = gradient boosting, ensemble = enhanced reliability.

Working with Large Files (v0.5.8+)

New! Fast data scanning to preview what's available before processing:

# Scan a large export (takes seconds, not minutes)
big-mood predict export.xml --scan

# Example output:
📊 Data Summary:
• Total records: 8,755,251
• Heart Rate: 5,074,424 records
• Step Count: 3,513,756 records
• Sleep Analysis: 3,608 records

✅ Available Features:
• depression_risk: Depression risk prediction (XGBoost)
• mania_risk: Mania/hypomania risk prediction (XGBoost)

⚠️ Unavailable Features:
• hrv_analysis: Missing required type: HKQuantityTypeIdentifierHeartRateVariabilitySDNN

Files over 100MB will automatically prompt to scan first. This prevents waiting 10+ minutes only to discover missing data.

Technical Features

Component	Status	Performance
Apple Health XML parsing	✅	33MB/s, <100MB RAM
XML data scanning	✅	12s for 545MB file
PAT transformer model	✅	0.593 AUC depression (NHANES)
XGBoost circadian model	✅	0.80-0.98 AUC (Korean cohort)
Privacy-first processing	✅	100% local, no data sharing
Clinical feature extraction	✅	DSM-5 aligned thresholds
REST API	✅	Real-time predictions

Performance Benchmarks

Processing Speed:

365 days of data: 17 seconds
Memory usage: <100MB for any file size
Parsing throughput: 33MB/s

Model Accuracy (from original research):

Current depression screening (PAT, general population):
- Depression detection: 0.593 AUC (US NHANES 2013-14)¹
Next-day episode prediction (XGBoost, mood disorder patients):
- Depression episodes: 0.80 AUC (Korean cohort, MDD+BD patients)²
- Manic episodes: 0.98 AUC (Korean cohort, BD patients)²
- Hypomanic episodes: 0.95 AUC (Korean cohort, BD patients)²

¹Ruan et al., 2024 | ²Lim et al., 2024

What Makes This Different

Clinical innovation: First open-source tool combining two state-of-the-art approaches to predict mood episodes from wearable data
Scientific rigor: Faithful implementation of published algorithms:
- XGBoost circadian model from Seoul National University (Nature Digital Medicine 2024)
- PAT transformer from Dartmouth (first foundation model for actigraphy)
Privacy breakthrough: No cloud dependency, no data collection — your mental health data stays private
Intelligent data handling: Auto-window selection (v0.5.6+) automatically finds optimal analysis periods:
- Handles sparse Apple Watch data (gaps are OK for XGBoost)
- Prioritizes overlapping windows where both models can run
- Clear feedback on data availability and model selection
Open research: Complete transparency enables validation, improvement, and trust

⚠️ Research Limitations

Population specificity:

XGBoost: Trained on 168 Korean adults (18-35y) with mood disorders
PAT: Pre-trained on 21,538 US adults, fine-tuned on 2,800 with PHQ-9 scores

Performance constraints:

Current depression screening: Moderate accuracy (0.593 AUC)
Next-day episode prediction: High accuracy but limited to Korean cohort (0.80-0.98 AUC)
No validation across ethnicities, age groups, or comorbid conditions
Research tool only — not FDA approved or clinically validated

📚 Documentation

Audience	Start Here
Users	Quick Start Guide
Developers	Architecture Overview
Researchers	Clinical Requirements • PAT Training Details
AI Assistants	CLAUDE.md

Contributing

Critical research needs:

🏥 Clinical validation across diverse populations
🌍 Multi-ethnic, multi-age cohort studies
📱 Integration with additional wearable devices
🧪 Improving transformer model accuracy

See CONTRIBUTING.md for guidelines.

License

Apache 2.0 - Use freely, even commercially. See LICENSE.

Acknowledgments

Built on pioneering research:

XGBoost models: Seoul National University, Korea University, KAIST (Lim et al., 2024)
PAT foundation model: Dartmouth College (Ruan et al., 2024)

Have feedback? Open an issue or see discussions.

Name		Name	Last commit message	Last commit date
Latest commit History 1,051 Commits
.github		.github
LICENSES		LICENSES
alembic		alembic
analysis/pat_training		analysis/pat_training
config		config
data		data
docker		docker
docs		docs
model_weights		model_weights
scripts		scripts
src/big_mood_detector		src/big_mood_detector
tests		tests
.cursorrules		.cursorrules
.dmypy.json		.dmypy.json
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
EMAIL-RESPONSE.md		EMAIL-RESPONSE.md
GITHUB_ISSUE_TRAIN_TEST_BUG.md		GITHUB_ISSUE_TRAIN_TEST_BUG.md
LICENSE		LICENSE
Makefile		Makefile
NHANES_SAMPLE_SIZE_INVESTIGATION.md		NHANES_SAMPLE_SIZE_INVESTIGATION.md
NOTICE		NOTICE
PAT_DATA_LEAKAGE_FINDINGS.md		PAT_DATA_LEAKAGE_FINDINGS.md
PAT_TRAINING_COMPREHENSIVE_FINDINGS.md		PAT_TRAINING_COMPREHENSIVE_FINDINGS.md
README.md		README.md
ROADMAP_TO_MVP_V1.0.md		ROADMAP_TO_MVP_V1.0.md
TRAINING_NUMBERS_COMPARISON.md		TRAINING_NUMBERS_COMPARISON.md
TRAIN_TEST_SPLIT_BUG_INVESTIGATION.md		TRAIN_TEST_SPLIT_BUG_INVESTIGATION.md
TRAIN_TEST_SPLIT_FIX_COMPLETE.md		TRAIN_TEST_SPLIT_FIX_COMPLETE.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
make-wsl.sh		make-wsl.sh
mkdocs.yml		mkdocs.yml
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Big Mood Detector

Why Use Big Mood Detector?

🚀 Quick Start

Option 1: Docker (Recommended)

Option 2: Local Installation

📦 Model Weights for Contributors & Researchers

How It Works

Working with Large Files (v0.5.8+)

Technical Features

Performance Benchmarks

What Makes This Different

⚠️ Research Limitations

📚 Documentation

Contributing

License

Acknowledgments

About

Uh oh!

Releases 15

Packages

Contributors 2

Languages

License

Clarity-Digital-Twin/big-mood-detector

Folders and files

Latest commit

History

Repository files navigation

🧠 Big Mood Detector

Why Use Big Mood Detector?

🚀 Quick Start

Option 1: Docker (Recommended)

Option 2: Local Installation

📦 Model Weights for Contributors & Researchers

How It Works

Working with Large Files (v0.5.8+)

Technical Features

Performance Benchmarks

What Makes This Different

⚠️ Research Limitations

📚 Documentation

Contributing

License

Acknowledgments

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 15

Packages 0

Contributors 2

Languages

Packages