Skip to content
View davendw49's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report davendw49

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Xiaomi Miloco

Python 1,939 127 Updated Dec 17, 2025

NVR with realtime local object detection for IP cameras

TypeScript 28,264 2,638 Updated Dec 21, 2025

Xiaomi Camera 360 hacks

27 3 Updated Sep 16, 2024

🏡 Open source home automation that puts local control and privacy first.

Python 83,327 36,232 Updated Dec 21, 2025

Boosting RAG on model and system performance with context reuse

Python 14 1 Updated Dec 19, 2025

AIOS: AI Agent Operating System

Python 4,872 643 Updated Nov 24, 2025

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 540 50 Updated Sep 13, 2025

An API conversion tool for popular external reinforcement learning environments

Python 197 24 Updated Dec 15, 2025

A curated list of recent progress and resources on Reinforcement Learning for AI Agents.

5 1 Updated Sep 11, 2025

Tool for data extraction and interacting with Lean programmatically.

Python 739 115 Updated Sep 13, 2025

Snapshot is an encoder-decoder transformer that learns to compress context into fixed memories, for more efficient long-context inference.

Python 1 Updated Sep 15, 2025

Yet Another Papers With Code

Python 36 1 Updated Sep 7, 2025

Using Unified Memory on Jetson

Jupyter Notebook 30 6 Updated Mar 21, 2022

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,446 1,996 Updated Nov 1, 2025

A flipped classroom series on understanding LLMs for non-CS/AI students

39 7 Updated Jun 5, 2025
Python 5 1 Updated Jul 25, 2025

A framework for fine-tuning retrieval-augmented generation (RAG) systems.

Python 137 26 Updated Dec 19, 2025

LLM training in simple, raw C/CUDA

Cuda 28,437 3,335 Updated Jun 26, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 14,555 1,226 Updated Dec 18, 2025
Python 251 47 Updated Dec 20, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 790 51 Updated Aug 15, 2025

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 418 25 Updated Sep 23, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,077 2,287 Updated Dec 25, 2024

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,078 814 Updated May 12, 2025

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

Jupyter Notebook 1,942 127 Updated Dec 18, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 599 75 Updated Sep 11, 2024

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,391 803 Updated Dec 21, 2025
Next