Framework for AI safety forecasting

--- Initial skeleton set up in collaboration with Claude. ---

AI safety forecasting with trajectory analysis, using LLMs to synthesize expert opinions and track AI capabilities progress.

2025.08.17: Initial draft - WIP!

The motivation for this project is threefold:

Explore forecasting.
Explore the current landscape of AI safety forecasting.
Test out performance of different LLM models in this domain.

Automatic and timely database updates, comprehensive coverage of key data sources, and optimizing prompts will be some of the primary future goals for the project, alongside development of benchmarks that aren't represented in other forecasting datasets and dashboards.

Tech stack

PostgreSQL: persistent storage
Redis: cache query data
Streamlit: low-effort frontend/viz, no-frills deploy
- if this grows beyond being a toy app, move to a more performant frontend
Docker: self-contained deploy
Airflow: data scheduler
MCP/CrewAI: orchestrate LLM agents (separate from target data source API polling)
- analyze websites and newsletters for relevant new/updated datasets
- analyze expert chatter and evaluate the system (LLM as judge of the forecasting system)
dbt: data transformation management/versioning (possibly overkill)

Use

git clone git@github.com:msyvr/forecast-aisafety
cd forecast-aisafety
# Setup environment
cp .env.example .env
# Edit .env with your configuration

# Initialize database and load seed data
python run_app.py

# Explore data
jupyter notebook notebooks/data_exploration.ipynb

Tech stack

fireducks: drop-in replacement for pandas with identical api (literally, just import fireducks as pd, no other changes)
- performance vs polars, duckdb, pandas
better known but, coming from pandas, the api takes getting used to - polars: multithreaded on a single node (for distributed processing, use Apache Spark); pandas is single-threaded
ibis: notes from a fan

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
dashboard		dashboard
data_collection		data_collection
database		database
design-notes		design-notes
llm_integration		llm_integration
notebooks		notebooks
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
__init__.py		__init__.py
demo_llm_integration.py		demo_llm_integration.py
main.py		main.py
package-lock.json		package-lock.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Framework for AI safety forecasting

Tech stack

Use

Tech stack

About

Uh oh!

Releases

Packages

Languages

msyvr/forecast-aisafety

Folders and files

Latest commit

History

Repository files navigation

Framework for AI safety forecasting

Tech stack

Use

Tech stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages