bitnet

Official implementation of BitMamba-2. A scalable 1.58-bit State Space Model (Mamba-2 + BitNet) trained from scratch on 150B tokens. Includes JAX training code and high-performance C++ inference engine.

cpp flax quantization ssm mamba efficient-inference tpu jax bitnet llm 1-58-bit

Updated Feb 4, 2026
Python

kamkyu94 / BitNet

Star

BitNet: Learning-Based-Bit-Depth-Expansion

bde bitnet bit-depth bit-depth-expansion learning-based-bit-depth-expansion

Updated Feb 9, 2025
Python

Zhayr1 / bitmamba.cpp

Star

Ultra-lightweight C++ inference engine for BitMamba-2 (1.58-bit SSM). Runs 1B models on consumer CPUs at 50+ tok/s using <700MB RAM. No heavy dependencies.

cpp inference simd quantization ssm mamba edge-computing low-resource bitnet embedded-ai

Updated Feb 18, 2026
Python

spmfrance-cloud / aria-protocol

Star

Peer-to-peer distributed AI inference using 1-bit quantized models. CPU-only, 70-82% energy savings, 103+ tokens/sec. Validated on Zen 4 & Zen 5 (+35% cross-gen improvement).

open-source distributed-systems cpu peer-to-peer inference quantization avx512 bitnet llm 1-bit-llm

Updated Apr 7, 2026
Python

lapp0 / distily

Star

Distily: Language Model Distillation Toolkit and Library

transformer language-model knowledge-distillation distillation bitnet

Updated Sep 25, 2024
Python

kreasof-ai / Homunculus-Project

Sponsor

Star

Long term project about a custom AI architecture. Consist of cutting-edge technique in machine learning such as Flash-Attention, Group-Query-Attention, ZeRO-Infinity, BitNet, etc.

python machine-learning deep-learning jupyter-notebook pytorch transformer bitnet pytorch-lightning vision-transformer large-language-models low-rank-adaptation flash-attention

Updated Oct 15, 2024
Python

ednialzavlare / MixKABRN

Star

This is the repo for the MixKABRN Neural Network (Mixture of Kolmogorov-Arnold Bit Retentive Networks), and an attempt at first adapting it for training on text, and later adjust it for other modalities.

ai neural-network model architecture moe bitnet mixture-of-experts ai-models llms retnet retentive-network kolmogorov-arnold-networks

Updated May 14, 2024
Python

mindscope-world / fastapi-bitnet-inference

Star

BitNet Inference Web UI: A modern web interface for running Microsoft's BitNet models efficiently on CPU. This project provides a user-friendly way to download, manage, and run inference with 1-bit quantized language models.

bitnet fastapi llms

Updated May 1, 2025
Python

UIC-InDeXLab / RSR-core

Star

RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication

matrix-multiplication bitnet llm llm-inference quantized-models

Updated Apr 1, 2026
Python

syn-999 / core58-w2a8-msvc

Star

Windows-native BitNet and ternary LLM inference with CPU GGUF, GPU runtime, terminal and browser chat, and release zips.

windows cuda pytorch quantization bitnet fastapi llama-cpp local-llm llm-inference gguf 1-bit-llm ternary-llm falcon3

Updated Mar 20, 2026
Python

WanRui37 / transformers

Star

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

bitnet

Updated Sep 1, 2025
Python

cloudcover95 / JuniorOmega

Star

The JuniorOmega SDK is a sovereign, local-first spatial engineering stack optimized specifically for Apple Silicon. It is designed to bridge the gap between high-density sensor ingestion (LiDAR/TrueDepth) and automated fabrication (G-code/CNC).

macos protobuf blender sandbox mps optical g-code mlx fsd facetime truedepth facetimehd bpy bitnet rhino3dm cadintegrator bitnet-llm devtunnels svdkernal

Updated Apr 5, 2026
Python

ma3u / neo4j-agentframework

Star

🚀 Hybrid RAG: Local Neo4j + BitNet.cpp RAG System and Azure SaaS deployment. Fast vector search, instant Docker deployment via GitHub Container Registry. Complete RAG pipeline with ultra-efficient LLMs for enterprise knowledge management.

Updated Feb 11, 2026
Python

Siddharthjagtap346 / BitNet-1-Quantized-Transformer

Star

BitNet-inspired 1-bit Quantized Transformer for efficient protein function prediction and biological sequence modeling on low-power devices.

deep-learning genomics pytorch bioinformatics-pipeline bitnet fastapi transformer-architecture edge-ai quantization-aware-training research-ai quantized-models

Updated Mar 14, 2026
Python

Aizhee / python-bitneural32

Star

BitNet-inspired quantization-aware training and model compiler for running neural networks efficiently on ESP32 devices.

esp32 keras neural-networks bitnet

Updated Feb 17, 2026
Python

grapheneaffiliate / h4-polytopic-attention

Star

Modular AI system where independently-trained ternary specialists load on demand from disk, routed by geometric classification, sharing a unified lattice knowledge base, capable of autonomously growing new specialists — all running on a CPU laptop.

Updated Apr 5, 2026
Python

Improve this page

Add a description, image, and links to the bitnet topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bitnet topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bitnet

Here are 25 public repositories matching this topic...

microsoft / unilm

ustcwhy / BitVLA

grctest / FastAPI-BitNet

kevbuh / bitnet

Zhayr1 / BitMamba-2

kamkyu94 / BitNet

Zhayr1 / bitmamba.cpp

spmfrance-cloud / aria-protocol

lapp0 / distily

kreasof-ai / Homunculus-Project

ednialzavlare / MixKABRN

mindscope-world / fastapi-bitnet-inference

UIC-InDeXLab / RSR-core

syn-999 / core58-w2a8-msvc

WanRui37 / transformers

cloudcover95 / JuniorOmega

ma3u / neo4j-agentframework

Siddharthjagtap346 / BitNet-1-Quantized-Transformer

Aizhee / python-bitneural32

grapheneaffiliate / h4-polytopic-attention

Improve this page

Add this topic to your repo