Skip to content
View kholam's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report kholam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of papers in 100 lines of code.

Python 2,814 254 Updated Apr 8, 2026

Contexts Optical Compression

Python 23,321 2,153 Updated Jan 27, 2026

Detail code implementation and experimental setting for our paper: Federated Learning on Multilabel Evolving Data Streams

Python 1 Updated Oct 22, 2025

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 23,480 1,797 Updated Sep 21, 2025

AIs for nature

Rust 215 7 Updated Jun 23, 2026

Microsoft AI for Good Lab — Biodiversity research hub. Open-source AI models, edge devices, and tools for biodiversity monitoring and conservation. Your source for MegaDetector, SPARROW, PytorchWil…

Python 1,016 293 Updated Jun 4, 2026

Self-hosted AI coding assistant

Rust 33,642 1,756 Updated Mar 2, 2026

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

14,115 797 Updated Jul 30, 2025

An open-source framework for machine learning and other computations on decentralized data.

Python 2,440 604 Updated Jun 23, 2026

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,940 308 Updated Jan 16, 2024

DeepSeek LLM: Let there be answers

Makefile 7,095 1,100 Updated Feb 4, 2024

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,710 1,063 Updated Apr 30, 2026

Analyze computation-communication overlap in V3/R1.

1,170 148 Updated Mar 21, 2025

A lightweight data processing framework built on DuckDB and 3FS.

Python 4,961 445 Updated Mar 5, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,405 1,060 Updated Jun 23, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,967 327 Updated Jan 14, 2026

An elegant PyTorch deep reinforcement learning library.

Python 10,813 1,320 Updated Apr 3, 2026

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,985 1,055 Updated May 7, 2026

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,306 1,815 Updated Feb 26, 2025

Integrate the DeepSeek API into popular software

37,968 4,163 Updated Feb 23, 2026

s1: Simple test-time scaling

Python 6,656 757 Updated Jun 25, 2025

This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report

Jupyter Notebook 58 111 Updated Dec 2, 2025

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,449 87 Updated Apr 30, 2026

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 5,190 454 Updated May 1, 2026
1 Updated Feb 14, 2025

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

Python 738 57 Updated Apr 15, 2026

Machine Learning Engineering Open Book

Python 18,160 1,152 Updated May 18, 2026
Next