Skip to content
View jbarnes850's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report jbarnes850

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
953 results for source starred repositories
Clear filter

Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrastructure for AI development at scale.

Python 149 31 Updated Feb 4, 2026
Python 1 Updated Dec 10, 2025

Self-learning data agent that grounds its answers in 6 layers of context. Inspired by OpenAI's in-house implementation.

Python 1,104 97 Updated Feb 1, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 15,244 1,059 Updated Feb 3, 2026

Generic building-block toolbox for training neural networks with adaptive and recursive execution. It provides reusable components to control iteration, stopping, and unrolling during training, ena…

Python 23 Updated Feb 4, 2026

PaperBanana: Automating Academic Illustration For AI Scientists

JavaScript 989 43 Updated Feb 2, 2026

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Jupyter Notebook 40 5 Updated Dec 25, 2025

context-efficient terminal agent

Python 40 4 Updated Feb 1, 2026

Staging area for a public release of Theorizer

HTML 128 13 Updated Jan 27, 2026

Bayes-Adaptive RL for LLM Reasoning

Python 45 9 Updated May 28, 2025

build and benchmark deep research

Python 227 24 Updated Jan 29, 2026

A tool to use the Ai2 Open Coding Agents Soft-Verified Efficient Repository Agents (SERA) model with Claude Code

Python 201 18 Updated Feb 2, 2026
Python 54 6 Updated Jan 28, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 162,514 25,498 Updated Feb 4, 2026

wooyun-legacy skill for claude code

1,157 269 Updated Feb 2, 2026
Python 4 Updated Dec 8, 2025

A framework for testing and evaluating AI agents across various task domains, designed for misalignment interpretability research.

Python 4 1 Updated Feb 2, 2026

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

Python 13,183 957 Updated Jan 25, 2026

Dual-control RL environment for incident response training with adversarial evidence, OpenEnv-compatible, plus evaluation tooling and datasets.

Python 1 1 Updated Feb 1, 2026

A Python library for LLM-based evaluation using weighted rubrics.

Python 47 3 Updated Feb 3, 2026

Run, deploy and monitor CLI agents in secure cloud sandboxes.

Python 35 1 Updated Feb 3, 2026

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

Python 830 71 Updated Feb 3, 2026

Inspect: A framework for large language model evaluations

Python 1,720 387 Updated Feb 4, 2026

CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

Python 108 22 Updated Jan 13, 2026

Harness for running and evaluating AI agents against RL environments

Python 92 5 Updated Jan 24, 2026

Anthropic's original performance take-home, now open for you to try!

Python 3,307 721 Updated Jan 22, 2026

Convert GitHub PRs into Harbor tasks

Python 42 6 Updated Feb 2, 2026

This is AI implementation (not official) of the DreamGym framework from the paper "Scaling Agent Learning via Experience Synthesis" (arXiv:2511.03773).

Python 35 2 Updated Nov 9, 2025
Next