Skip to content
View initial-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report initial-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Production-ready platform for agentic workflow development.

TypeScript 116,031 17,894 Updated Oct 9, 2025

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 4,925 414 Updated Sep 24, 2025
Python 23 4 Updated Dec 4, 2024

(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators

Jupyter Notebook 232 24 Updated Sep 25, 2025

Self-Guided Function Calling in Large Language Models via Stepwise Experience Recall

HTML 5 Updated Sep 29, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,392 1,372 Updated Jul 9, 2025

“AI-Compass”将为社区指引在 AI 技术海洋中航行的方向,无论你是初学者还是进阶开发者,都能在这里找到通往 AI 各大方向的路径。旨在帮助开发者系统性地了解 AI 的核心概念、主流技术、前沿趋势,并通过实践掌握从理论到落地的全过程。

320 32 Updated Sep 28, 2025

[EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Python 480 36 Updated Oct 2, 2025

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 2,017 180 Updated Aug 13, 2024

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 15,569 1,234 Updated Oct 6, 2025

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 5 Updated Jul 26, 2025

⏩ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CI

TypeScript 29,221 3,605 Updated Oct 9, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,383 2,260 Updated Oct 5, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 7,016 678 Updated Jul 10, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,911 451 Updated Oct 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,116 2,515 Updated Oct 9, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,688 358 Updated Aug 13, 2025

Open-source implementation of AlphaEvolve

Python 4,077 589 Updated Oct 9, 2025

easydou

Python 5 Updated Dec 22, 2024

改进过的rule版本

Python 9 4 Updated Aug 14, 2019
Python 13 7 Updated Sep 14, 2021

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 844 46 Updated Jul 10, 2025

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

Jupyter Notebook 88 1 Updated May 1, 2025

A framework for few-shot evaluation of language models.

Python 10,301 2,771 Updated Oct 9, 2025

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

618 25 Updated Nov 29, 2024

A curated list of Diffusion Model in RL resources (continually updated)

1,354 68 Updated Sep 12, 2025
Jupyter Notebook 1 Updated May 7, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,204 56 Updated Oct 1, 2025

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 327 37 Updated Aug 6, 2024
Next