SafeSABR

English | 中文

Risk-calibrated adaptive bitrate streaming over Starlink networks.

SafeSABR is a learned adaptive bitrate (ABR) framework for high-bitrate video streaming over volatile Starlink access links. It targets a practical failure mode that is easy to miss with average QoE alone: a learned ABR policy may keep requesting aggressive high-bitrate chunks during handover-induced throughput drops, causing severe session-level rebuffering.

Why SafeSABR?

Starlink makes high-bitrate video streaming possible in areas where terrestrial broadband is unavailable, but its access throughput can change rapidly because of satellite mobility and handovers. This creates a QoE--severe-risk tradeoff: aggressive bitrate selection improves average quality when the link is strong, but a few bad decisions during abrupt drops can drain the playback buffer and produce long stalls.

SafeSABR addresses this with a three-stage design:

Behavior-cloning pretraining learns a high-QoE ABR prior from an expert policy.
Risk-calibrated RL fine-tuning penalizes severe rebuffering tails so the policy becomes less prone to high-risk actions.
Runtime safety auditing checks the policy-requested bitrate against a safe-capacity estimate before execution.

Repository Scope

This repository provides the core SafeSABR implementation:

SafeSABR training code for behavior-cloning pretraining and risk-calibrated PPO fine-tuning.
Runtime safety-auditing code for safe-capacity-guided bitrate correction.
StarNet-to-SABR trace conversion utilities.
Synthetic high-bitrate video-size generation utilities for 4K/8K-style ABR experiments.
Paper figures for explaining the problem setting and SafeSABR design.

Repository Layout

SafeSABR/
├── assets/                    # Paper-style figures for README and project page
├── docs/                      # Data and reproduction notes
├── safesabr/                  # Core SafeSABR code
│   ├── train_sabr.py          # Entry point for BC + RL fine-tuning
│   ├── train_sabr_logged.py   # Structured training/logging implementation
│   ├── evaluate_action_shield.py
│   ├── test_ppo_sb.py         # Evaluation loop with runtime safety auditing
│   ├── sim_env/               # ABR simulator wrappers
│   ├── rl/                    # Behavior-cloning / DAgger utilities
│   ├── utils_tool/            # Evaluation and logging helpers
│   └── build_env_c_plus/      # C++ expert backend for behavior cloning
└── tools/
    ├── prepare_starlink_traces.py
    ├── make_synthetic_video_size.py
    └── export_predictor_safe_caps.py

Data

SafeSABR uses Starlink measurement traces derived from the StarNet dataset:

https://github.com/ConnectedSystemsLab/StarNet

The dataset is not redistributed in this repository. Please download or prepare the StarNet processed throughput files separately, then follow docs/DATA.md to convert them into SABR replay traces.

Quick Start

Create the environment:

conda env create -f environment.yml
conda activate safesabr

Generate the synthetic high-bitrate video-size files:

python tools/make_synthetic_video_size.py

Prepare StarNet-derived SABR traces after placing processed StarNet files under data/starnet_pkl:

python tools/prepare_starlink_traces.py --source-root data/starnet_pkl

Build the C++ expert backend used by behavior-cloning pretraining:

cd safesabr
bash build_env_c_plus/build_rl.sh

Train a SafeSABR policy:

python train_sabr.py 1 0 4 100000 \
  --seed 42 \
  --risk-mode cvar_rebuf \
  --risk-alpha 0.95 \
  --risk-lambda 20 \
  --run-name safesabr_seed42

Evaluate with runtime safety auditing:

python evaluate_action_shield.py \
  --log-root ./experiment_logs/starlink_high \
  --model-dir ./experiment_logs/starlink_high/<run-id>/models/rl_model/ppo \
  --model-label safesabr \
  --shield external_predictor \
  --external-predictor-file ./experiment_logs/starlink_high/predictor_safe_caps/BG-CFQS_safe_caps.csv \
  --external-predictor-name BG-CFQS \
  --shield-margin 0.90 \
  --shield-low-buffer-margin 0.90

See docs/REPRODUCE.md for the intended full workflow.

Result Snapshot

The paper evaluates SafeSABR by the QoE--severe-risk operating point rather than average QoE alone.

Citation

If you find SafeSABR useful, please cite:

@misc{xie2026safesabrriskcalibratedadaptivebitrate,
      title={SafeSABR: Risk-Calibrated Adaptive Bitrate Streaming over Starlink Networks}, 
      author={Hongjun Xie and Jiahang Zhu and Zhiming Shao and Chao Fan and Zenghui Zhang and Genke Yang and Pengcheng Luo},
      year={2026},
      eprint={2605.23560},
      archivePrefix={arXiv},
      primaryClass={eess.SY},
      url={https://arxiv.org/abs/2605.23560}, 
}

Acknowledgements

SafeSABR builds on the broader open-source ABR research ecosystem. We thank the authors of Comyco-Lin and Pensieve Retrain, whose public implementations were valuable references for ABR simulation, training, and evaluation workflows.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
docs		docs
safesabr		safesabr
scripts		scripts
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
README_zh-CN.md		README_zh-CN.md
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SafeSABR

Why SafeSABR?

Repository Scope

Repository Layout

Data

Quick Start

Result Snapshot

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SafeSABR

Why SafeSABR?

Repository Scope

Repository Layout

Data

Quick Start

Result Snapshot

Citation

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages