Repository for codes of 'Deep Reinforcement Learning'
-
Updated
Oct 4, 2019 - Python
Repository for codes of 'Deep Reinforcement Learning'
Curiosity-driven Exploration by Self-supervised Prediction for Street Fighter III Third Strike
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Curiosity-driven Exploration by Self-supervised Prediction
三竞赛 (CUMCM/MCM/电工杯) 数学建模 skill — harness-agnostic, 同时支持 Claude Code 与 Codex CLI, 全程问答式 (Friendly Mode), 10 阶段 + 4 反馈层 + per-Qi 加权聚合 + 题型 dim 加权 + empirical 实测分位锚定
Pytorch implementation of intrinsic curiosity module with proximal policy optimization
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML
Image denoising using Markov random fields.
2026 C 题 F 奖得主开发的数学建模/MCM/ICM O 奖论文 RAG 问答引擎:基于 LlamaIndex 章节级检索,支持年份/题号/章节过滤、建模方法/灵敏度分析/写作结构拆解,帮数模备赛快速复盘高分美赛论文
Curiosity-driven PPO agent for BipedalWalker-v3 with Intrinsic Curiosity Module (ICM) for improved exploration in continuous control environments.
2025 ICM Problem F - Cyber Strong?
A Claude Code template for running a content/design business as a solo operator. One orchestrator + several worker sandboxes; briefs as handoff contracts; ICM-aligned context layers (L0/L1/L3/L4); weekly + monthly maintenance routines; four hooks, two skills, one canonical example brief.
Git repository archaeology framework — mine commit history, detect signals, run 6 analysis vectors, and generate engineering reports. Python CLI + AI-agent ready.
Reusable Interpretable Context Methodology (ICM) workspace template for filesystem-based AI agent workflows
Skill ICM para Claude Code — bootstrap one-shot, 9 estagios, subagentes paralelos, drift gate automatico
Add a description, image, and links to the icm topic page so that developers can more easily learn about it.
To associate your repository with the icm topic, visit your repo's landing page and select "manage topics."