Stars
MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE
various experiments for scaling inference time compute with small reasoning models
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
A series of math-specific large language models of our Qwen2 series.
LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models
Fast and memory-efficient exact attention
Ongoing research training transformer models at scale
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
A high-throughput and memory-efficient inference and serving engine for LLMs
Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A natural language interface for computers
a state-of-the-art-level open visual language model | 多模态预训练模型
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.
NLP超强入门指南,包括各任务sota模型汇总(文本分类、文本匹配、序列标注、文本生成、语言模型),以及代码、技巧
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
An Open-Ended Embodied Agent with Large Language Models
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
The last data dump of Freebase with introductory explanation of its schema
diffusion-based layout-to-image generation model
Rocket.Chat mobile clients
A library for efficient similarity search and clustering of dense vectors.