Skip to content
View wangray's full-sized avatar

Organizations

@TheMITTech @x64dbg @TechSecCTF

Block or report wangray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modern HTTP benchmarking tool

C 40,163 3,029 Updated Dec 30, 2023

A Prometheus exporter for Celery metrics

Jsonnet 540 125 Updated Mar 3, 2026

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Python 327 48 Updated Mar 11, 2026

Unified high-performance Python client for object and file stores.

Python 62 14 Updated Mar 24, 2026

A Zsh theme

Shell 53,504 2,397 Updated Mar 14, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,423 966 Updated Mar 29, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 30,648 2,375 Updated Mar 4, 2026

NVIDIA NCCL Tests for Distributed Training

Shell 139 28 Updated Mar 26, 2026

Module, Model, and Tensor Serialization/Deserialization

Python 297 49 Updated Feb 6, 2026

🦉 Data Versioning and ML Experiments

Python 15,484 1,289 Updated Mar 27, 2026

A user-space file system for interacting with Google Cloud Storage

Go 2,217 485 Updated Mar 29, 2026

CTF Archives: Collection of CTF Challenges.

Python 1,406 190 Updated Mar 29, 2026

[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation

Python 75 17 Updated Mar 23, 2026

Harbor is a framework for running agent evaluations and creating and using RL environments.

Python 1,153 822 Updated Mar 29, 2026

An extremely fast Python package and project manager, written in Rust.

Rust 82,242 2,872 Updated Mar 29, 2026
Kotlin 3 1 Updated Nov 20, 2025

Inspect: A framework for large language model evaluations

Python 1,856 447 Updated Mar 28, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,029 141 Updated Mar 28, 2026

A guidance language for controlling large language models.

Jupyter Notebook 21,360 1,155 Updated Mar 18, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,938 438 Updated Mar 28, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,874 2,040 Updated Mar 24, 2026

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 606 115 Updated Mar 23, 2026

Parses cron schedules to iterate over datetime objects.

Python 525 120 Updated Mar 20, 2026

Fast and memory-efficient exact attention

Python 23,037 2,560 Updated Mar 28, 2026

pure golang library for reading/writing parquet file

Go 1,423 309 Updated Dec 10, 2025

Provider-agnostic, open-source evaluation infrastructure for language models

Python 753 99 Updated Mar 16, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,018 674 Updated Mar 29, 2026

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

Go 56,339 4,998 Updated Mar 28, 2026
Next