We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with the increase in the number of agents, using the simple(st) way…

Python 143 14 Updated Oct 8, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,989 4,775 Updated Apr 3, 2026

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,371 4,519 Updated Mar 30, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 15,925 3,786 Updated Apr 5, 2026

Harvard-AI-and-Robotics-Lab / FairFedMed

[IEEE Medcial Imaging 2025] FairFedMed: Benchmarking Group Fairness in Federated Medical Imaging with FairLoRA

Python 16 4 Updated Oct 28, 2025

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,596 1,271 Updated Nov 4, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 101,580 16,394 Updated Apr 5, 2026

unslothai / unsloth

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

Python 59,551 5,054 Updated Apr 5, 2026

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,833 927 Updated Apr 1, 2026

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 13,018 1,583 Updated Feb 27, 2026

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 22,485 3,051 Updated Mar 1, 2026

eseckel / ai-for-grant-writing

A curated list of resources for using LLMs to develop more competitive grant applications.

Python 4,115 511 Updated Mar 1, 2024

Tyrrrz / YoutubeDownloader

Downloads videos and playlists from YouTube

C# 14,588 1,812 Updated Apr 3, 2026

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,260 1,818 Updated Feb 26, 2025

sign / translate

Effortless Real-Time Sign Language Translation

TypeScript 741 153 Updated Mar 18, 2026

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

5,005 536 Updated Sep 25, 2024

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,410 1,101 Updated Feb 3, 2026

lllyasviel / ControlNet

Let us control diffusion models!

Python 33,783 3,002 Updated Feb 25, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 27,044 1,968 Updated Jan 9, 2026

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 2,005 149 Updated Dec 6, 2024

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 55,442 10,672 Updated Apr 5, 2026

wmj142326 / PVCP

NeurIPS 2024 | 🤸‍♂️💥🚗Pedestrian-Centric 3D Pre-collision Pose and Shape Estimation from Dashcam Perspective

Python 18 Updated Sep 5, 2025

QiWang233 / DailyDVS-200

[ECCV-2024] DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Jupyter Notebook 35 2 Updated May 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bruce X.B. Yu bruceyo

Achievements

Achievements

Block or report bruceyo

Stars

openclaw / openclaw

openai / tiktoken

HumanMLLM / IRG-MotionLLM

libsdl-org / SDL

modelcontextprotocol / servers

QwenLM / Qwen-Agent

ZJUI-AI4H / Hulu-Med

MoreAgentsIsAllYouNeed / AgentForest