-
Thoughtworks
- Singapore
- https://cemse.kaust.edu.sa/people/person/fangyuan-yu
-
-
-
unidisc Public
Forked from alexanderswerdlow/unidiscUniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.
Python UpdatedApr 2, 2025 -
Agent-S Public
Forked from simular-ai/Agent-SAgent S: an open agentic framework that uses computers like a human
Python Apache License 2.0 UpdatedApr 2, 2025 -
-
-
CharacterLM Public
vocabulary curriculum + LLM
-
FAR Public
Forked from showlab/FARCode for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"
Python MIT License UpdatedMar 26, 2025 -
STEVE-R1 Public
Forked from FanbinLu/STEVE-R1R1-like Computer-use Agent
Python UpdatedMar 21, 2025 -
json_repair Public
Forked from mangiucugna/json_repairA python module to repair invalid JSON from LLMs
Python MIT License UpdatedMar 19, 2025 -
-
VisualThinker-R1-Zero Public
Forked from turningpoint-ai/VisualThinker-R1-ZeroExplore the Multimodal “Aha Moment” on 2B Model
Python UpdatedMar 16, 2025 -
-
chitu Public
Forked from thu-pacman/chituHigh-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
Python Apache License 2.0 UpdatedMar 14, 2025 -
Tiny-GRPO Public
minimal GRPO implementation from scratch
-
EmergentCommunication Public
Trying to create language from communication between neural nets
MIT License UpdatedMar 14, 2025 -
Sana Public
Forked from NVlabs/SanaSANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Python Apache License 2.0 UpdatedMar 14, 2025 -
nanobrowser Public
Forked from nanobrowser/nanobrowserOpen-source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
TypeScript Apache License 2.0 UpdatedMar 13, 2025 -
bd3lms Public
Forked from kuleshov-group/bd3lmsBlock Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
Python UpdatedMar 12, 2025 -
gidd Public
Forked from dvruette/giddCode accompanying the paper "Generalized Interpolating Discrete Diffusion"
Python MIT License UpdatedMar 10, 2025 -
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
Python UpdatedMar 7, 2025 -
macOS-use Public
Forked from browser-use/macOS-useMake Mac apps accessible for AI agents
Python MIT License UpdatedMar 5, 2025 -
tilelang Public
Forked from tile-ai/tilelangDomain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
C++ MIT License UpdatedMar 4, 2025 -
native-sparse-attention Public
Forked from fla-org/native-sparse-attention🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
Python MIT License UpdatedMar 4, 2025 -
elimination_game Public
Forked from lechmazur/elimination_gameA multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other
UpdatedMar 3, 2025 -
GamingAgent Public
Forked from lmgame-org/GamingAgentInteresting CUA environment with conventional games
Python MIT License UpdatedMar 2, 2025 -
VocabularyCurriculum Public
Vocabulary curriculum improves scaling
-
fractalgen Public
Forked from LTH14/fractalgenPyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
Python MIT License UpdatedFeb 25, 2025 -
slamkit Public
Forked from slp-rl/slamkitSlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"
Python MIT License UpdatedFeb 25, 2025 -
VLM-R1 Public
Forked from om-ai-lab/VLM-R1Solve Visual Understanding with Reinforced VLMs
Python Apache License 2.0 UpdatedFeb 21, 2025