jbarnes850

🎯

Focusing

Jarrod Barnes jbarnes850

🎯

Focusing

43 followers · 168 following

@Arc-Computer
New York, New York

Achievements

x4 x3

Achievements

x4 x3

Highlights

sglang Public
Forked from sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python Apache License 2.0 Updated Apr 29, 2026
synthetic-pretrain Public

Code release for a small-scale adaptation of self-improving pretraining, interleaved thinking SFT, RLMT, and causal thought-use probes on Qwen3-0.6B.

Python Other Updated Apr 28, 2026
metacognition Public

Measuring whether language models know when to change their mind. d-prime for belief revision across Qwen3.5 0.8B-9B.

Python Updated Apr 20, 2026
autoagent Public
Forked from kevinrgu/autoagent

autonomous harness engineering

Python Updated Apr 3, 2026
surprisal Public

Open-ended scientific discovery via Bayesian surprise. MCTS + Claude/Codex agents + Docker sandboxes.

Python 1 Updated Mar 30, 2026
ai-scientist-training Public

Synthetic Prime/verifiers environment for epistemic experiment selection and belief revision

Python Updated Mar 22, 2026
cohere-aya-analysis Public

Mechanistic interpretability analysis of Cohere Aya Expanse 8B

multilingual-nlp

Python 1 MIT License Updated Feb 23, 2026
test-time-training Public

Code for "Surprisal-Guided Selection: Compute-Optimal Test-Time Strategies for Execution-Grounded Code Generation"

reinforcement-learning test-time-adaptation

Python 1 Other Updated Feb 21, 2026
opensec-env Public

Dual-control RL environment for incident response training with adversarial evidence, OpenEnv-compatible, plus evaluation tooling and datasets.

reinforcement-learning environments ai-security

Python 1 1 Updated Feb 19, 2026
Slime-RLVE Public

Verifiable math/logic environments for slime RL training

Python 2 Apache License 2.0 Updated Jan 16, 2026
Tau2-RL-Pipeline Public

Multi-turn tool-use training pipeline for tau2-bench using slime

Python 5 Apache License 2.0 Updated Jan 16, 2026
slime Public
Forked from THUDM/slime

slime is an LLM post-training framework for RL Scaling.

Python 1 Apache License 2.0 Updated Jan 15, 2026
jbarnes850 Public

Personal bio

MIT License Updated Jan 13, 2026
verl Public
Forked from verl-project/verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 2 Apache License 2.0 Updated Jul 16, 2025
verifiers Public
Forked from PrimeIntellect-ai/verifiers

Verifiers for LLM Reinforcement Learning

Python MIT License Updated Jul 16, 2025
mlx-lm-lora Public
Forked from Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

Python Apache License 2.0 Updated Jul 7, 2025
acp-evals Public

ACP Evals is an evaluation framework for multi-agent systems built on the Agent Communication Protocol.

Python 2 Apache License 2.0 Updated Jun 22, 2025
ambient Public

[Open AI Hackathon] A meta-agent wellness platform that auto-generates personalized AI agents in one click

Python 2 Updated Jun 5, 2025
atropos Public
Forked from NousResearch/atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 1 MIT License Updated May 19, 2025
github-webhook-analyzer Public

A multi-agent system that automatically analyzes GitHub repository events (PRs, Issues, Comments) using Orchestra's AI framework.

Python Updated Mar 29, 2025
deephermes-mlx Public

A Python implementation for DeepHermes models on Apple Silicon

machine-learning ai mlx

Python 3 MIT License Updated Mar 20, 2025
cortex-1 Public

NEAR Cortex-1 is a specialized AI model that can reason, understand, and predict crypto market movements by learning from cross-chain data.

Python 3 1 MIT License Updated Mar 5, 2025
shade-agent-template Public
Forked from NearDeFi/shade-agent-twitter

JavaScript Other Updated Mar 2, 2025
docs Public
Forked from defuse-protocol/gitbook-docs

near intents docs

Updated Feb 5, 2025
near-intents-example Public
Forked from referencedev/test-intent

Python 2 MIT License Updated Feb 2, 2025
deepseek-r1-finetune Public

A step by step guide to fine-tuning the DeepSeek R1 Distilled models on Apple Silicon machines.

Python 59 9 MIT License Updated Feb 1, 2025
Agent-R Public
Forked from ByteDance-Seed/Agent-R

Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"

Python Apache License 2.0 Updated Jan 25, 2025
mflux Public
Forked from filipstrand/mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python MIT License Updated Jan 5, 2025
HunyuanVideo_MLX Public
Forked from gaurav-nelson/HunyuanVideo_MLX

HunyuanVideo ported to run on Native Apple Hardware (requires M1 or better CPU)

Python Other Updated Jan 5, 2025
mlx-omni-server Public
Forked from madroidmaq/mlx-omni-server

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…

Python Updated Jan 5, 2025

Jarrod Barnes jbarnes850

Achievements

Achievements

Highlights

sglang Public

Uh oh!

synthetic-pretrain Public

Uh oh!

metacognition Public

Uh oh!

autoagent Public

Uh oh!

surprisal Public

Uh oh!

ai-scientist-training Public

Uh oh!

cohere-aya-analysis Public

Uh oh!

test-time-training Public

Uh oh!

opensec-env Public

Uh oh!

Slime-RLVE Public

Uh oh!

Tau2-RL-Pipeline Public

Uh oh!

slime Public

Uh oh!

jbarnes850 Public

Uh oh!

verl Public

Uh oh!

verifiers Public

Uh oh!

mlx-lm-lora Public

Uh oh!

acp-evals Public

Uh oh!

ambient Public

Uh oh!

atropos Public

Uh oh!

github-webhook-analyzer Public

Uh oh!

deephermes-mlx Public

Uh oh!

cortex-1 Public

Uh oh!

shade-agent-template Public

Uh oh!

docs Public

Uh oh!

near-intents-example Public

Uh oh!

deepseek-r1-finetune Public

Uh oh!

Agent-R Public

Uh oh!

mflux Public

Uh oh!

HunyuanVideo_MLX Public

Uh oh!

mlx-omni-server Public

Uh oh!