🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 15,569 1,234 Updated Oct 6, 2025

ChangWinde / PiCor

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 5 Updated Jul 26, 2025

continuedev / continue

⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI

TypeScript 29,221 3,605 Updated Oct 9, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,383 2,260 Updated Oct 5, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,016 678 Updated Jul 10, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,911 451 Updated Oct 6, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,116 2,515 Updated Oct 9, 2025

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,688 358 Updated Aug 13, 2025

codelion / openevolve

Open-source implementation of AlphaEvolve

Python 4,077 589 Updated Oct 9, 2025

MingshiYangUIUC / AI-Doudizhu

easydou

Python 5 Updated Dec 22, 2024

deecamp2019-group20 / RuleBasedModelV2

改进过的rule版本

Python 9 4 Updated Aug 14, 2019

1310183534 / DouDiZhu

Python 13 7 Updated Sep 14, 2021

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 844 46 Updated Jul 10, 2025

CUHK-ARISE / GAMABench

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

Jupyter Notebook 88 1 Updated May 1, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,301 2,771 Updated Oct 9, 2025

apexrl / Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

618 25 Updated Nov 29, 2024

opendilab / awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

1,354 68 Updated Sep 12, 2025

qordmlwls / WDPOP

Forked from OpenRLHF/OpenRLHF

Jupyter Notebook 1 Updated May 7, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,204 56 Updated Oct 1, 2025

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 327 37 Updated Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hongming Zhang initial-h

Achievements

Achievements

Highlights

Block or report initial-h

Stars

langgenius / dify

Physical-Intelligence / openpi

11cafe / jaaz

BunsenFeng / model_swarm

Wuyxin / collabllm

AI-Research-TeamX / SEER

HW-whistleblower / True-Story-of-Pangu

tingaicompass / AI-Compass

thu-coai / CharacterGLM-6B

LC1332 / Chat-Haruhi-Suzumiya

xming521 / WeClone