Lists (18)
Sort Name ascending (A-Z)
Stars
An implementation of Tiny Recursive Models (TRM)
Hierarchical Reasoning Model Official Release
Join Professor Torchenstein on an electrifying quest to bend PyTorch to your will. Mwahahaha
A powerful framework for building realtime voice AI agents 🤖🎙️📹
The Hugging Face Course on Transformers for Audio
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
verl: Volcano Engine Reinforcement Learning for LLMs
Official PyTorch implementation for "Large Language Diffusion Models"
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
Code for the paper "On the Tip of the Tongue: Analyzing Conceptual Representation in Large Language Models with Reverse-Dictionary Probe"
A PyTorch implementation of Perceiver, Perceiver IO and Perceiver AR with PyTorch Lightning scripts for distributed training
Data and tools for generating and inspecting OLMo pre-training data.
A minimal implementation of diffusion models for text generation
Lexical database for ~70k English words with morphological variables
Morfessor is a tool for unsupervised and semi-supervised morphological segmentation
A quick guide (especially) for trending instruction finetuning datasets
Minimal reproduction of DeepSeek R1-Zero
[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unita…
Set of tools to assess and improve LLM security.
Efficient Triton Kernels for LLM Training
🔥 Proxy is a high performance HTTP(S) proxies, SOCKS5 proxies,WEBSOCKET, TCP, UDP proxy server implemented by golang. Now, it supports chain-style proxies,nat forwarding in different lan,TCP/UDP po…
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly and quickly.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.