Skip to content
View CraKane's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report CraKane

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
42 results for source starred repositories
Clear filter

《互联网大厂推荐算法实战》资料库

Python 317 56 Updated Apr 25, 2023

Inference code for Llama models

Python 59,119 9,829 Updated Jan 26, 2025
Python 99 3 Updated Jun 12, 2024

Generative Agents: Interactive Simulacra of Human Behavior

20,515 2,834 Updated Aug 5, 2024

Explanation to key concepts in ML

8,499 696 Updated Jun 30, 2025

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 917 122 Updated Mar 23, 2024

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

Jupyter Notebook 9,577 1,769 Updated Jan 11, 2026

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 12,159 1,959 Updated Dec 6, 2024

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 13,892 3,132 Updated Jan 30, 2026

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,881 364 Updated Jul 18, 2024

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 526 81 Updated Jul 21, 2023

An improved version of EOI on Starcraft II task so_many_baneling. (The Emergence of Individuality)

Python 17 4 Updated Oct 28, 2021

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 36,612 7,847 Updated Jan 19, 2026

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,922 1,211 Updated Jul 25, 2024

Deep RL algorithm in pytorch

Jupyter Notebook 315 65 Updated Sep 5, 2023

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,648 4,955 Updated Aug 1, 2024

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,725 820 Updated Apr 9, 2024

Scalable Multi-Agent RL Training School for Autonomous Driving

Python 1,107 212 Updated Jan 31, 2025

Python Multi-Agent Reinforcement Learning framework

Python 2,154 407 Updated Dec 8, 2022

SMAC: The StarCraft Multi-Agent Challenge

Python 1,325 236 Updated Feb 18, 2024

智能巡逻机器人,可以以一定距离主动跟随人,主动前进后退,可以是别人的骨架和手势,同时根据人体的衣服颜色分辨身份,敌人还是朋友,敌人为红色,朋友为蓝色。

C++ 6 1 Updated Mar 1, 2020

An elegant PyTorch deep reinforcement learning library.

Python 10,117 1,261 Updated Dec 1, 2025

豆瓣读书的爬虫

Python 2,766 1,294 Updated Apr 8, 2020

一个股票数据(沪深)爬虫和选股策略测试框架

Python 1,485 626 Updated Aug 14, 2020

推荐系统实践书及笔记

31 12 Updated Oct 16, 2017

DGN Code

Python 362 87 Updated Mar 25, 2023

Data has 6 features & 1 output. the target for this is regression by at least 3 machine learning methods. And at least 1 NN method.

MATLAB 2 Updated Mar 8, 2020

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,473 4,652 Updated Jan 9, 2026
Next