Skip to content
View iamaziz's full-sized avatar
🎲
🎲

Block or report iamaziz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

llm

40 repositories

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 10,435 1,125 Updated Mar 27, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,099 308 Updated Apr 13, 2026

Unified interface for interacting with various LLMs hundreds of models, caching, fallback mechanisms, and enhanced reliability.

Python 47 5 Updated Apr 15, 2026

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,580 713 Updated Apr 17, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,819 483 Updated Oct 27, 2025

LLM inference in C/C++

C++ 104,851 17,031 Updated Apr 19, 2026

Official inference framework for 1-bit LLMs

Python 38,407 3,458 Updated Mar 10, 2026

Prompts for our Grok chat assistant and the `@grok` bot on X.

Jinja 4,050 438 Updated Nov 17, 2025

A lightweight, powerful framework for multi-agent workflows

Python 22,721 3,585 Updated Apr 19, 2026

Demo of a customer service use case implemented with the OpenAI Agents SDK

Python 5,961 922 Updated Dec 18, 2025

Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.

Python 813 118 Updated Jul 8, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

10,633 813 Updated Jan 21, 2026

🤗 smolagents: a barebones library for agents that think in code.

Python 26,726 2,494 Updated Apr 17, 2026

Build Real-Time Knowledge Graphs for AI Agents

Python 25,103 2,492 Updated Apr 18, 2026

A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI agents in parallel to deliver comprehensive, multi-perspecti…

Python 1,109 184 Updated Jul 16, 2025

Official repository for LTX-Video

Python 10,042 979 Updated Jan 5, 2026

Simple & Scalable Pretraining for Neural Architecture Research

Python 328 34 Updated Mar 31, 2026

An open-source AI agent that lives in your terminal.

TypeScript 23,525 2,231 Updated Apr 19, 2026

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 7,310 550 Updated May 5, 2025

Hierarchical Reasoning Model Official Release

Python 12,384 1,801 Updated Mar 31, 2026

AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.

Python 8,444 744 Updated Apr 17, 2026

[ICLR2026] Test-Time Scaling with Reflective Generative Model

Python 302 22 Updated Jan 28, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,017 2,065 Updated Mar 27, 2026

AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each problem, and is faster than existing implementations.

Python 95 14 Updated Mar 12, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,137 587 Updated Apr 17, 2026

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 941 82 Updated Feb 28, 2026

Run LLMs with MLX

Python 4,851 581 Updated Apr 15, 2026

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 4,019 346 Updated Mar 26, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,062 470 Updated Apr 19, 2026