- Moncton, NB
- in/chris-macleod-44272853
Stars
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)
Aidan Bench attempts to measure <big_model_smell> in LLMs.
Data related to mind uploading project via prompt
this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.
Machine Learning Engineering Open Book
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀
NVIDIA Linux open GPU with P2P support
lightweight, standalone C++ inference engine for Google's Gemma models.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Extremely simple implementation of path patching (aka causal scrubbing) in PyTorch.
Stanford NLP Python library for understanding and improving PyTorch models via interventions
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
Named tensors with first-class dimensions for PyTorch
Must-read Papers on Textual Adversarial Attack and Defense
sanjeevanahilan / nanoChatGPT
Forked from karpathy/nanoGPTA crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
Hackable and optimized Transformers building blocks, supporting a composable construction.