vipmath

vipmath

5 followers · 0 following

Lists (1)

Sort

janggi-variants

variant chess-janggi developing

Stars

KellerJordan / negative-self-influence

neural networks don't minimize loss [caution: probably due to batchnorm]

Python 3 Updated Aug 28, 2024

norxornor / modded-nanogpt-jax

NanoGPT speedrun in JAX. Originally at https://nor-git.pages.dev/modded-nanogpt-jax/

Python 8 3 Updated Aug 28, 2025

Zcchill / Value-Residual-Learning

Python 12 Updated Mar 20, 2025

RWKV / RWKV-LM

Forked from BlinkDL/RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 58 5 Updated Mar 17, 2025

BlinkDL / nanoRWKV

Forked from karpathy/nanoGPT

RWKV in nanoGPT style

Python 197 11 Updated Jun 9, 2024

nikhilvyas / SOAP

Python 230 16 Updated Dec 2, 2024

sanyalsunny111 / FLOW_finetuning

Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting

Python 5 3 Updated Feb 12, 2025

shawntan / stickbreaking-attention

Stick-breaking attention

Python 62 5 Updated Jul 1, 2025

PySpur-Dev / pyspur

A visual playground for agentic workflows: Iterate over your agents 10x faster

TypeScript 5,624 420 Updated Jul 20, 2025

sanyalsunny111 / LLM-Inheritune

This is the official repository for Inheritune.

Python 117 10 Updated Feb 10, 2025

Gumini-Research / Gumini_sLLM_Report

Gumini 1B - 1.5B Benchmark Report

HTML 1 Updated Dec 17, 2025

llmsresearch / scone

Implementation and evaluation of Scaling Embedding Layers in Language Models research paper

Python 8 2 Updated Mar 1, 2025

allenai / bolmo-core

Code for Bolmo: Byteifying the Next Generation of Language Models

Python 81 8 Updated Dec 15, 2025

wolfecameron / nanoMoE

Forked from karpathy/nanoGPT

An extension of the nanoGPT repository for training small MOE models.

Python 219 26 Updated Mar 9, 2025

AviSoori1x / makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Jupyter Notebook 778 91 Updated Oct 30, 2024

ngavhane / moe-beyond

Python 2 Updated Oct 12, 2025

infinigence / Infini-Megrez

339 20 Updated Oct 11, 2025

EvanZhuang / vector-icl

Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)

21 2 Updated Jun 2, 2025

EvanZhuang / mixinputs

Official implementation for Text Generation Beyond Discrete Token Sampling

Python 20 2 Updated Aug 11, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,615 764 Updated Jun 25, 2025

DJC-GO-SOLO / Latent-SFT

Python 25 Updated Dec 10, 2025

multimodal-art-projection / LatentCoT-Horizon

📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.

315 6 Updated Nov 5, 2025

casper-hansen / OpenCoconut

OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.

Python 174 23 Updated Jan 16, 2025

Intelligent-Computing-Research-Group / ViStream

[CVPR2025] Official Implementation of ViStream: Improving Computation Efficiency of Visual Streaming Perception via Law-of-Charge-Conservation Inspired Spiking Neural Network

Python 4 Updated Oct 27, 2025

Naixu-Guo / Quantum-Transformer

Data and code for paper Quantum Transformer: Accelerating model inference via quantum linear algebra

Jupyter Notebook 1 Updated Jun 16, 2025

charlierabea / FORTE

The original code for paper "Towards a Holistic Framework for Multimodal LLM in 3D Brain CT Radiology Report Generation"

Python 45 4 Updated Apr 24, 2025

Intelligent-Computing-Research-Group / SpikeZIP-TF

Official repository of SpikeZIP-TF in ICML2024

Python 47 13 Updated Dec 4, 2024

ensemble-core / NdLinear

NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload any model and get back a smaller, faster version, ready to de…

Python 299 19 Updated Jun 4, 2025

Lanerra / reasoning-bank-slm

An experiment that applies Google Research's `ReasoningBank` technique to Small Language Models. This experiment hopes to show that the same gains from the ReasoningBank paper also applies to much …

Python 77 9 Updated Oct 14, 2025

brillm05 / BriLLM0.5

Python 302 40 Updated Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly