Skip to content
View lwaekfjlk's full-sized avatar
🙇‍♂️
Attention is all I need
🙇‍♂️
Attention is all I need

Highlights

  • Pro

Organizations

@web-arena-x @sotopia-lab @WebPixie @consciousness-lab @ulab-uiuc @hemm-lab

Block or report lwaekfjlk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,411 511 Updated Jul 10, 2025

The first continuous diffusion language model that rivals discrete counterparts on standard language modeling benchmarks like LM1B and OpenWebText.

Python 68 2 Updated May 5, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 124,482 20,491 Updated May 18, 2026

Coinjure: A Trading Agent Harness for Prediction Markets

Python 32 2 Updated May 5, 2026

Create, Evaluate, and Connect AI Skills

Python 749 73 Updated May 3, 2026

My Python scripts to make high-quality figures for publications in top AI conferences and journals.

Python 1,972 132 Updated May 11, 2026
Python 5 Updated Apr 22, 2026
Python 3 Updated Jan 31, 2026

LLMRouter: An Open-Source Library for LLM Routing

Python 1,825 172 Updated May 13, 2026

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 675 63 Updated Mar 21, 2026
Python 31 4 Updated Jul 24, 2025

A dynamic forecasting benchmark for LLMs

Python 63 10 Updated May 15, 2026
Python 165 16 Updated May 18, 2026

Official implementation of "Continuous Autoregressive Language Models"

Python 807 92 Updated May 7, 2026

[EMNLP 2025] DiagramEval: Evaluating LLM-Generated Diagrams via Graphs

Python 17 Updated Nov 1, 2025

[ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation

Python 846 78 Updated May 4, 2026

A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning

Python 82 11 Updated Jan 16, 2026

This is the github repo for our LM4SCI@COLM25 paper "The Ram\'{o}n Llull's Thinking Machine for Automated Ideation".

Jupyter Notebook 8 Updated Aug 28, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 1,223 170 Updated Mar 17, 2026

A lightweight platform for realistic social simulations

Python 14 2 Updated Oct 24, 2025

Live evaluation of trading agents

Python 139 14 Updated Feb 17, 2026

Materials for course on language model programming (LMProgramming)

13 1 Updated Sep 14, 2025

Sotopia-RL: Reward Design for Social Intelligence

Python 50 9 Updated Apr 1, 2026

Course resources for the ESSLLI 2024 class on language model programming

9 3 Updated Feb 5, 2025

Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…

Python 72 3 Updated Jun 11, 2025

Tutorial for UIUC NCSA Cluster

Shell 7 Updated Apr 11, 2026

[EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents

Python 135 16 Updated Mar 4, 2026

[ACL 2025 Main] A Python package that evaluates Large Language Models (LLMs) through a novel tree-based approach

Python 4 Updated Jan 5, 2026
Next