Skip to content
View wise-east's full-sized avatar
🔭
Focusing
🔭
Focusing

Highlights

  • Pro

Block or report wise-east

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source implementation of AlphaEvolve

Python 4,965 764 Updated Dec 24, 2025

Framework and toolkits for building and evaluating collaborative agents that can work together with humans.

Python 114 17 Updated Dec 4, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,765 2,888 Updated Dec 24, 2025

The best ChatGPT that $100 can buy.

Python 39,202 4,964 Updated Dec 23, 2025

⚽️ Extract, prepare and publish Transfermarkt datasets.

Python 327 77 Updated Jun 14, 2025

(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators

Jupyter Notebook 268 28 Updated Sep 25, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,208 31,531 Updated Dec 24, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,844 910 Updated Sep 1, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,854 348 Updated Jul 15, 2024

The AI Code Editor

31,914 2,156 Updated Nov 19, 2025

Which questions improve learning most? Utility Estimation of Questions with LLM-based Simulations

Python 3 1 Updated Oct 28, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,759 707 Updated Nov 7, 2025

Official Repo for MIME benchmark from ACL 2025 paper "Can Vision Language Models Understand Mimed Actions?"

Python 2 Updated Oct 20, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,216 51 Updated Nov 16, 2024

Mangrove is the backend module of Estuary, a framework for building multimodal real-time Socially Intelligent Agents (SIAs).

Python 13 2 Updated Jul 11, 2025

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 220 17 Updated Jun 13, 2025

LLM101n: Let's build a Storyteller

35,946 1,962 Updated Aug 1, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,227 104 Updated May 8, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,083 55 Updated Feb 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,432 7,814 Updated Dec 24, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,650 840 Updated Dec 18, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 934 69 Updated Feb 16, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,853 4,112 Updated Dec 23, 2025
JavaScript 3,836 1,666 Updated Jun 21, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,283 884 Updated Sep 4, 2025
Next