Skip to content
View chaojiewang94's full-sized avatar

Block or report chaojiewang94

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE

Python 51 1 Updated Nov 5, 2025

🙌 OpenHands: AI-Driven Development

Python 65,800 8,086 Updated Dec 20, 2025

various experiments for scaling inference time compute with small reasoning models

Python 17 4 Updated Jan 16, 2025

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,219 369 Updated Sep 11, 2025

[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents

Python 220 25 Updated Jun 16, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Python 78 4 Updated Oct 16, 2024

Fast and memory-efficient exact attention

Python 21,199 2,232 Updated Dec 20, 2025

Boosting the AI research efficiency

Python 149 22 Updated Sep 24, 2024

Ongoing research training transformer models at scale

Python 14,654 3,398 Updated Dec 20, 2025

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

137 7 Updated Jun 12, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,828 12,088 Updated Dec 20, 2025

Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,338 54 Updated Oct 15, 2025

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 291 26 Updated Nov 16, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 60,960 7,527 Updated Oct 4, 2025

A natural language interface for computers

Python 61,143 5,244 Updated Dec 5, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,710 449 Updated May 29, 2024

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Python 318 24 Updated Sep 29, 2023

NLP超强入门指南,包括各任务sota模型汇总(文本分类、文本匹配、序列标注、文本生成、语言模型),以及代码、技巧

1,837 305 Updated Oct 16, 2022

Generative Flow Networks

Python 663 79 Updated Feb 28, 2023

[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control

HTML 63 11 Updated Jan 11, 2025

An Open-Ended Embodied Agent with Large Language Models

JavaScript 6,537 623 Updated Apr 3, 2024

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,459 142 Updated Mar 7, 2025

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,367 234 Updated Sep 22, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,160 395 Updated Jul 11, 2024

The last data dump of Freebase with introductory explanation of its schema

Python 100 18 Updated Jul 16, 2023

diffusion-based layout-to-image generation model

Python 323 23 Updated Apr 12, 2025

Rocket.Chat mobile clients

TypeScript 2,302 1,382 Updated Dec 19, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 38,483 4,157 Updated Dec 19, 2025
Next