-
SWELancer-Benchmark Public
Forked from openai/SWELancer-BenchmarkThis repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"
Python MIT License UpdatedFeb 18, 2025 -
ReasonFlux Public
Forked from Gen-Verse/ReasonFluxReasonFlux beats o1-preview and DeepSeek-V3 with hierarchical RL and 500 thought templates
Python Apache License 2.0 UpdatedFeb 12, 2025 -
Logic-RL Public
Forked from Unakar/Logic-RLReproduce R1 Zero on Logic Puzzle
Python Apache License 2.0 UpdatedFeb 7, 2025 -
scaling-book Public
Forked from jax-ml/scaling-bookHome for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs
HTML MIT License UpdatedFeb 6, 2025 -
unsloth Public
Forked from unslothai/unslothFinetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Python Apache License 2.0 UpdatedJan 31, 2025 -
RAGEN Public
Forked from mll-lab-nu/RAGENRAGEN is the first open-source reproduction of DeepSeek-R1 for training agentic models via reinforcement learning.
Python Apache License 2.0 UpdatedJan 29, 2025 -
Janus Public
Forked from deepseek-ai/JanusJanus-Series: Unified Multimodal Understanding and Generation Models
Python MIT License UpdatedJan 27, 2025 -
-
-
-
-
minimind Public
Forked from jingyaogong/minimind「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!
Python Apache License 2.0 UpdatedOct 30, 2024 -
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedOct 15, 2024 -
pyinstrument Public
Forked from joerick/pyinstrument🚴 Call stack profiler for Python. Shows you why your code is slow!
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 11, 2024 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Jupyter Notebook Other UpdatedOct 4, 2024 -
Hands-On-Large-Language-Models Public
Forked from HandsOnLLM/Hands-On-Large-Language-ModelsOfficial code repo for the O'Reilly Book - "Hands-On Large Language Models"
Jupyter Notebook UpdatedSep 30, 2024 -
tiny-universe Public
Forked from datawhalechina/tiny-universe《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Python UpdatedSep 15, 2024 -
WCCommon Public
Forked from david830wu/WCCommonFrequently used small tools and functions
C++ MIT License UpdatedSep 2, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedAug 29, 2024 -
-
Time-Series-Library Public
Forked from thuml/Time-Series-LibraryA Library for Advanced Deep Time Series Models.
Python MIT License UpdatedJun 6, 2024 -
cocotb Public
Forked from cocotb/cocotbcocotb, a coroutine based cosimulation library for writing VHDL and Verilog testbenches in Python
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 10, 2024 -
workbench-example-nemotron-finetune Public
Forked from NVIDIA/workbench-example-nemotron-finetuneAn NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model
Jupyter Notebook Apache License 2.0 UpdatedApr 15, 2024 -
DeepSeek-MoE Public
Forked from deepseek-ai/DeepSeek-MoEDeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Python MIT License UpdatedJan 16, 2024 -
vivado-on-silicon-mac Public
Forked from NelsonDane/vivado-on-silicon-macInstalls Vivado on M1/M2 macs
C Creative Commons Zero v1.0 Universal UpdatedNov 29, 2023 -
diffusion-literature-for-robotics Public
Forked from mbreuss/diffusion-literature-for-roboticsSummary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.
UpdatedOct 4, 2023 -
Xilinx-FPGA-PCIe-XDMA-Tutorial Public
Forked from WangXuan95/Xilinx-FPGA-PCIe-XDMA-TutorialXilinx FPGA PCIe 保姆级教程 ——基于 PCIe XDMA IP核
-
TD7 Public
Forked from sfujim/TD7Author's PyTorch implementation of TD7 for online and offline RL
Python MIT License UpdatedSep 12, 2023 -
lleaves Public
Forked from siboehm/lleavesCompiler for LightGBM gradient-boosted trees, based on LLVM. Speeds up prediction by ≥10x.
Python MIT License UpdatedJul 5, 2023