justinchiu-test

All

31 repositories

transformers
Public
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python
•
Apache License 2.0
•32k•0•0•1•Updated Sep 25, 2025Sep 25, 2025
obfuscation-experiment
Public
Python
•0•0•0•0•Updated Sep 19, 2025Sep 19, 2025
comp-coding
Public
Python
•0•0•0•0•Updated Sep 5, 2025Sep 5, 2025
public_benchmarks_example
Public
Simple examples of how to run public benchmarks with Runloop
Python
•1•0•0•0•Updated Aug 29, 2025Aug 29, 2025
codex
Public
Lightweight coding agent that runs in your terminal
Rust
•
Apache License 2.0
•6.9k•0•0•0•Updated Jul 20, 2025Jul 20, 2025
gemini-cli
Public
An open-source AI agent that brings the power of Gemini directly into your terminal.
TypeScript
•
Apache License 2.0
•10k•0•0•0•Updated Jul 20, 2025Jul 20, 2025
librarybench
Public
Python
•0•2•0•0•Updated Jul 13, 2025Jul 13, 2025
SWE-agent
Public
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Python
•
MIT License
•1.9k•0•0•0•Updated Jul 13, 2025Jul 13, 2025
litellm
Public
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Python
•
Other
•5.1k•0•0•0•Updated Jul 2, 2025Jul 2, 2025
nano-diffusion-lm
Public
Python
•0•0•0•0•Updated Jul 1, 2025Jul 1, 2025
diff-viz
Public
JavaScript
•0•0•0•0•Updated Jun 23, 2025Jun 23, 2025
OpenHands
Public
🙌 OpenHands: Code Less, Make More
Python
•
MIT License
•8.1k•0•0•0•Updated Jun 4, 2025Jun 4, 2025
codecontests-repo
Public
Shell
•0•0•0•1•Updated May 14, 2025May 14, 2025
lcb_utils
Public
Python
•0•0•0•0•Updated May 8, 2025May 8, 2025
arithmetic
Public
Python
•0•0•0•0•Updated Feb 24, 2025Feb 24, 2025
rl-fun
Public
0•0•0•0•Updated Feb 23, 2025Feb 23, 2025
SWE-bench
Public
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
Python
•
MIT License
•720•0•0•0•Updated Feb 22, 2025Feb 22, 2025
SWE-ReX
Public
Sandboxed code execution for AI agents, locally or on the cloud.
Python
•
MIT License
•92•0•0•0•Updated Feb 20, 2025Feb 20, 2025
debugger
Public
Python
•0•0•0•0•Updated Feb 7, 2025Feb 7, 2025
remote-swebench
Public
Python
•0•0•0•0•Updated Jan 31, 2025Jan 31, 2025
pragmatic-code-generation
Public
Python
•1•0•0•0•Updated Dec 4, 2024Dec 4, 2024
llm-self-training
Public
0•0•1•0•Updated Dec 3, 2024Dec 3, 2024
puzzles
Public
Python
•0•0•0•0•Updated Nov 11, 2024Nov 11, 2024
search
Public
Rust
•0•0•0•0•Updated Nov 10, 2024Nov 10, 2024
bcb-modal
Public
run bigcodebench on modal
Python
•0•0•0•0•Updated Oct 17, 2024Oct 17, 2024
LLMDebugger
Public
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
Python
•
Apache License 2.0
•54•0•0•0•Updated Oct 17, 2024Oct 17, 2024
EAI4Math
Public
Python
•2•0•0•0•Updated Oct 12, 2024Oct 12, 2024
swebench-modal
Public
Shell
•0•0•0•0•Updated Oct 8, 2024Oct 8, 2024
commit0-analysis
Public
Python
•0•0•0•0•Updated Oct 2, 2024Oct 2, 2024
position-embeddings
Public
TeX
•0•0•0•0•Updated Sep 3, 2024Sep 3, 2024