Skip to content
View kashif's full-sized avatar
  • Berlin, Germany
  • 00:28 (UTC +02:00)
  • X @krasul

Highlights

  • Pro

Block or report kashif

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-weights time-series forecasting foundation model from The Forecasting Company.

Python 4 Updated Jun 14, 2026
Jupyter Notebook 115 11 Updated Jun 11, 2026

An LLM post-training framework with vLLM for RL Scaling

Python 241 17 Updated Jun 14, 2026

Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

Swift 902 68 Updated Jun 14, 2026

Triton kernels for dynamic causal short convolutions.

Python 21 1 Updated Jun 4, 2026

Context Parallelism utilities for Training Language Models

Python 1 Updated Jun 8, 2026

Native macOS semantic search over your local files - text, images, audio, video in one vector space, on-device on Apple silicon.

Swift 167 12 Updated Jun 14, 2026

AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

Python 629 98 Updated May 28, 2026

CRANE: Cluster-Reactive Adaptive News Ensemble — A CPU-native sentiment engine that reads news, predicts markets, and adapts to regime shifts without a GPU.

TeX 2 Updated Jun 5, 2026

torch_remat fine-grained activation checkpointing API

Python 12 Updated Jun 8, 2026
Python 11 4 Updated Jun 11, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,427 156 Updated Jun 14, 2026

Open ABI and FFI for Machine Learning Systems

C++ 412 80 Updated Jun 13, 2026

GEMMs with metal

Python 21 5 Updated Jun 13, 2026

Convert any Repo into an RL Environment

Python 325 47 Updated Jun 8, 2026

Code for Retrofitting Large Language Models with Dynamic Tokenization.

Python 13 4 Updated Jul 22, 2025
Python 2 Updated May 27, 2026

DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

Python 224 21 Updated Feb 18, 2026

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 231 22 Updated Jun 8, 2026

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 515 104 Updated Jun 13, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 888 253 Updated Jun 14, 2026

26m function call model that runs on incredibly small devices

Python 2,605 176 Updated May 16, 2026

Open-source framework for the research and development of foundation models.

Python 1,104 131 Updated Jun 14, 2026

Personal dev tool that exposes a local shell and filesystem to any Model Context Protocol client. Built on the official TypeScript SDK, with a custom WebSocket server transport.

JavaScript 2 Updated Apr 24, 2026

The home of Carbon Genomic Foundation Model 🧬

Python 195 27 Updated Jun 6, 2026

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 10,451 1,108 Updated Jun 14, 2026

Simple & Scalable Pretraining for Neural Architecture Research

Python 333 34 Updated Mar 31, 2026
Next