Skip to content
View kibitzing's full-sized avatar

Block or report kibitzing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the paper "Fishing for Magikarp"

Python 191 16 Updated Jun 12, 2026

A conda-forge distribution.

Shell 9,900 511 Updated Jun 3, 2026

Fully open reproduction of DeepSeek-R1

Python 26,326 2,443 Updated Apr 2, 2026

Survey and paper list on efficiency-guided LLM agents (memory, tool learning, planning).

264 9 Updated Jun 15, 2026

😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond

355 13 Updated Jan 22, 2026

kill that logs papers to Google Sheets — just say "add this paper" or paste an arXiv URL.

JavaScript 2 Updated Mar 26, 2026

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 2,222 189 Updated Aug 26, 2025

Turn complex codebases into clear, navigable architecture diagrams with Claude Code.

TypeScript 1,397 123 Updated Apr 7, 2026

Teams-first Multi-agent orchestration for Claude Code

TypeScript 36,509 3,312 Updated Jun 16, 2026

iPhone/iPad version of lama cleaner

Swift 82 20 Updated May 28, 2026

Converted CoreML Model Zoo.

Swift 1,782 168 Updated Jun 6, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,384 1,051 Updated Jun 4, 2026

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 303 23 Updated Nov 7, 2025
Python 2 Updated Mar 21, 2026

NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to fa…

Python 298 53 Updated Jun 16, 2026

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 601 32 Updated Mar 13, 2026

어르신계정 password: 1234 (가족 계정: family1/password123)

TypeScript 1 Updated Feb 28, 2026

Ongoing research training transformer models at scale

Python 16,720 4,086 Updated Jun 16, 2026

Mamba SSM architecture

Python 18,447 1,757 Updated Jun 15, 2026

43 tips for getting the most out of Claude Code, from basics to advanced - includes a custom status line script and Claude Code running itself in a container. Also includes the dx plugin.

HTML 8,804 671 Updated Jun 12, 2026

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,928 317 Updated Jun 8, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 3,090 273 Updated May 26, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,546 379 Updated Jun 16, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,897 7,692 Updated Jun 16, 2026

Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)

Python 102 12 Updated Feb 20, 2025
Python 1 Updated Apr 3, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 2,428 236 Updated Jan 14, 2026

<밑바닥부터 시작하는 딥러닝 4>

Jupyter Notebook 64 48 Updated Apr 21, 2025
Next