-
-
verifiers Public
Forked from PrimeIntellect-ai/verifiersVerifiers for LLM Reinforcement Learning
Python MIT License UpdatedJun 8, 2025 -
ART Public
Forked from OpenPipe/ARTAgent Reinforcement Trainer for training multi-turn agents using GRPO
Python Apache License 2.0 UpdatedJun 7, 2025 -
-
smolagents Public
Forked from huggingface/smolagents🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
Python Apache License 2.0 UpdatedFeb 18, 2025 -
vision-agent Public
Forked from landing-ai/vision-agentVision agent
Python Apache License 2.0 UpdatedFeb 16, 2025 -
TinyZero Public
Forked from Jiayi-Pan/TinyZeroClean, minimal, accessible reproduction of DeepSeek R1-Zero
Python Apache License 2.0 UpdatedFeb 1, 2025 -
-
-
search-and-learn Public
Forked from huggingface/search-and-learnPython Apache License 2.0 UpdatedDec 18, 2024 -
-
LanguageAgentTreeSearch Public
Forked from lapisrocks/LanguageAgentTreeSearch[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Python MIT License UpdatedJul 30, 2024 -
tensor Public
Forked from novusnota-forks/EurekaLabsAI-tensorThe Tensor (or Array)
Python UpdatedJul 27, 2024 -
-
ngram Public
Forked from novusnota-forks/EurekaLabsAI-ngramThe n-gram Language Model
C UpdatedJul 24, 2024 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedMar 4, 2024 -
composer Public
Forked from mosaicml/composerSupercharge Your Model Training
Python Apache License 2.0 UpdatedMar 4, 2024 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedFeb 28, 2024 -
-
dspy Public
Forked from stanfordnlp/dspyDSPy: The framework for programming—not prompting—foundation models
Python MIT License UpdatedFeb 21, 2024 -
jepa Public
Forked from facebookresearch/jepaPyTorch code and models for V-JEPA self-supervised learning from video.
Python Other UpdatedFeb 16, 2024 -
simulai Public
Forked from IBM/simulaiA toolkit with data-driven pipelines for physics-informed machine learning.
Python Apache License 2.0 UpdatedFeb 13, 2024 -
rlx Public
Forked from noahfarr/rlxA reinforcement learning framework based on MLX.
Python MIT License UpdatedFeb 11, 2024 -
-
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedFeb 3, 2024 -
WikiChat Public
Forked from stanford-oval/WikiChatWikiChat stops the hallucination of large language models by retrieving data from Wikipedia.
Python Apache License 2.0 UpdatedJan 28, 2024 -
reflexion Public
Forked from noahshinn/reflexion[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Python MIT License UpdatedNov 26, 2023 -
ray-summit-2023-training Public
Forked from anyscale/ray-summit-2023-training -
ecco Public
Forked from jalammar/eccoExplain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedSep 1, 2023 -
llm-course Public
Forked from mlabonne/llm-courseCourse with a roadmap and notebooks to get into Large Language Models (LLMs).
Jupyter Notebook Apache License 2.0 UpdatedAug 28, 2023