Skip to content
View monitor-small's full-sized avatar

Block or report monitor-small

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…

Python 3,040 333 Updated Apr 24, 2025

Memory for AI Agents in 6 lines of code

Python 10,396 953 Updated Dec 19, 2025

[NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasks

Python 496 48 Updated Sep 19, 2025

This is a cleanroom deobfuscation of the official Claude Code npm package.

TypeScript 758 387 Updated Mar 1, 2025

Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding

Python 45 2 Updated Sep 4, 2025

Neural Code Intelligence Survey 2024-25; Reading lists and resources

279 15 Updated Jul 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,629 2,853 Updated Dec 19, 2025

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Python 166 10 Updated Jun 13, 2024

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,436 121 Updated Dec 19, 2025

Various extensions for the Eino framework: https://github.com/cloudwego/eino

Go 538 229 Updated Dec 19, 2025

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 11,674 3,034 Updated Jul 19, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,438 1,991 Updated Nov 1, 2025

DeepSeek Native Sparse Attention pytorch implementation

Jupyter Notebook 109 10 Updated Dec 17, 2025

Our paper about robust LLM fingerprints.

1 3 Updated Jul 4, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,037 700 Updated Feb 10, 2025

飞书文档导出服务

C# 636 68 Updated Jul 8, 2024

一键命令下载飞书文档为 Markdown(寻找维护者)

Go 1,867 181 Updated Dec 2, 2025

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,884 348 Updated May 21, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,924 2,082 Updated May 19, 2025

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,112 227 Updated Aug 17, 2024

The official implementation of LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning

Python 15 Updated Mar 14, 2025

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,197 97 Updated May 3, 2025

A quick guide (especially) for trending instruction finetuning datasets

3,328 226 Updated Nov 28, 2023

Awesome papers about unifying LLMs and KGs

2,523 176 Updated May 2, 2025

This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners.

Python 1,212 72 Updated Dec 19, 2025

LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…

Jupyter Notebook 11,266 891 Updated Nov 29, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,322 12,143 Updated Dec 19, 2025

The human face subset of LAION-400M for large-scale face pretraining.

Python 316 16 Updated Feb 1, 2023

中文 CSL 样式 - Zotero 中文社区

XML 5,921 900 Updated Dec 12, 2025
Next