jbarnes850

🎯

Focusing

Jarrod Barnes jbarnes850

🎯

Focusing

43 followers · 152 following

@Arc-Computer
New York, New York

Achievements

x3 x3

Achievements

x3 x3

Highlights

Lists (1)

Sort

✨ Inspiration

4 repositories

Stars

953 results for source starred repositories

Clear filter

PrimeIntellect-ai / prime

Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrastructure for AI development at scale.

Python 149 31 Updated Feb 4, 2026

abirharrasse / MultilingualCLTs

Python 1 Updated Dec 10, 2025

agno-agi / dash

Self-learning data agent that grounds its answers in 6 layers of context. Inspired by OpenAI's in-house implementation.

Python 1,104 97 Updated Feb 1, 2026

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,244 1,059 Updated Feb 3, 2026

windows7lover / DTE-DynamicTrainingEngine

Generic building-block toolbox for training neural networks with adaptive and recursive execution. It provides reusable components to control iteration, stopping, and unrolling during training, ena…

Python 23 Updated Feb 4, 2026

dwzhu-pku / PaperBanana

PaperBanana: Automating Academic Illustration For AI Scientists

JavaScript 989 43 Updated Feb 2, 2026

X1AOX1A / Word2World

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Jupyter Notebook 40 5 Updated Dec 25, 2025

modaic-ai / microcode

context-efficient terminal agent

Python 40 4 Updated Feb 1, 2026

allenai / asta-theorizer

Staging area for a public release of Theorizer

HTML 128 13 Updated Jan 27, 2026

shenao-zhang / BARL

Bayes-Adaptive RL for LLM Reasoning

Python 45 9 Updated May 28, 2025

guestrin-lab / deepscholar

build and benchmark deep research

Python 227 24 Updated Jan 29, 2026

allenai / sera-cli

A tool to use the Ai2 Open Coding Agents Soft-Verified Efficient Repository Agents (SERA) model with Claude Code

Python 201 18 Updated Feb 2, 2026

kanishkg / endless-terminals

Python 54 6 Updated Jan 28, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 162,514 25,498 Updated Feb 4, 2026

test-time-training / discover

Python 382 43 Updated Jan 29, 2026

tanweai / wooyun-legacy

wooyun-legacy skill for claude code

1,157 269 Updated Feb 2, 2026

GabrieleGiudic / BARD

Python 4 Updated Dec 8, 2025

gkroiz / agent-interp-envs

A framework for testing and evaluating AI agents across various task domains, designed for misalignment interpretability research.

Python 4 1 Updated Feb 2, 2026

VectifyAI / PageIndex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python 13,183 957 Updated Jan 25, 2026

jbarnes850 / opensec-env

Dual-control RL environment for incident response training with adversarial evidence, OpenEnv-compatible, plus evaluation tooling and datasets.

Python 1 1 Updated Feb 1, 2026

The-LLM-Data-Company / rubric

A Python library for LLM-based evaluation using weighted rubrics.

Python 47 3 Updated Feb 3, 2026

evolving-machines-lab / evolve

Run, deploy and monitor CLI agents in secure cloud sandboxes.

Python 35 1 Updated Feb 3, 2026

always-further / deepfabric

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

Python 830 71 Updated Feb 3, 2026

usnistgov / caisi-cyber-evals

Python 11 6 Updated Jan 6, 2026

UKGovernmentBEIS / inspect_ai

Inspect: A framework for large language model evaluations

Python 1,720 387 Updated Feb 4, 2026

sunblaze-ucb / cybergym

CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

Python 108 22 Updated Jan 13, 2026

Mercor-Intelligence / archipelago

Harness for running and evaluating AI agents against RL environments

Python 92 5 Updated Jan 24, 2026

anthropics / original_performance_takehome

Anthropic's original performance take-home, now open for you to try!

Python 3,307 721 Updated Jan 22, 2026

abundant-ai / SWE-gen

Convert GitHub PRs into Harbor tasks

Python 42 6 Updated Feb 2, 2026

Pi3AI / DreamGym

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 35 2 Updated Nov 9, 2025