-
F9 Research
- Harpenden, UK
- https://ljubomirj.github.io/
- @ljupc0
- @ljupco.bsky.social
- in/ljubomirjosifovski
Stars
PlunderStruck / opencode
Forked from anomalyco/opencodeA fork of OpenCode for local AI models.
ljubomirj / ds4
Forked from antirez/ds4DeepSeek 4 Flash local inference engine for Metal
Pointer-so / OSWorld
Forked from xlang-ai/OSWorld[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
LLAMA Turboquant implementation with CUDA support
Swival / ds4-m5
Forked from antirez/ds4DeepSeek 4 Flash local inference engine for Metal and CUDA with M5 optimizations.
AtomicBot-ai / Atomic-Chat
Forked from janhq/janLocal AI app and inference engine for agents. Run open-weight LLMs locally — private, 100% offline on your computer.
llama.cpp fork with TurboQuant WHT-rotated KV cache & weight compression + Gemma 4 MTP and Qwen 3.6 NextN speculative decoding (+30-50% throughput).
PrismML-Eng / llama.cpp
Forked from ggml-org/llama.cppLLM inference in C/C++
turbo-tan / llama.cpp-tq3
Forked from ggml-org/llama.cppllama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs w…
ivanfioravanti / mlx-openbench
Forked from groq/openbenchProvider-agnostic, open-source evaluation infrastructure for language models
AI agents running research on single-GPU nanochat training automatically
miolini / autoresearch-macos
Forked from karpathy/autoresearchAI agents running research on single-GPU nanochat training automatically adopted for MacOS
ncdrone / autoresearch-ANE
Forked from karpathy/autoresearchLLM training on Apple's Neural Engine — native Obj-C, private APIs, zero GPU. Dynamic weight pipeline for training without kernel recompilation.
ljubomirj / sglang
Forked from sgl-project/sglangSGLang is a high-performance serving framework for large language models and multimodal models.
ljubomirj / torchmd-net
Forked from torchmd/torchmd-netTraining neural network potentials
lee101 / codex-infinity
Forked from openai/codexinfinite coding agent
just-every / code
Forked from openai/codexEvery Code - push frontier AI to it limits. A fork of the Codex CLI with validation, automation, browser integration, multi-agents, theming, and much more. Orchestrate agents from OpenAI, Claude, G…
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
LongDangHoang / HRM_RL_Agent
Forked from sapientinc/HRMHRM Agent repo
shangshang-wang / Tora
Forked from meta-pytorch/torchtuneTora: Torchtune-LoRA for RL
shisa-ai / OpenRLHF
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
REverse-Engineered Reasoning for Open-Ended Generation
A curated list of awesome platforms, tools, practices and resources that helps run LLMs locally
The official github repo for "Diffusion Language Models are Super Data Learners".
Goekdeniz-Guelmez / mlx-lm
Forked from ml-explore/mlx-lmLLM text generation and fine-tuning with MLX
Codys12 / transformers
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sail-sg / SkyLadder
Forked from jzhang38/TinyLlamaThe official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling