Skip to content
View yil384's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • UCSD Picasso Lab
  • 11:19 (UTC -08:00)

Highlights

  • Pro

Block or report yil384

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 379 42 Updated Nov 20, 2025

一颗美丽的圣诞树,由Gemini 3 Pro Preview协作生成,支持手势、鼠标交互,可显示自定义图片及拍立得签名

TypeScript 26 14 Updated Dec 9, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,318 117 Updated Dec 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,795 2,898 Updated Dec 25, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 2,069 218 Updated Dec 23, 2025

SCoRe: Training Language Models to Self-Correct via Reinforcement Learning

Python 14 Updated Jan 24, 2025

JWT login microservice with plugable backends such as OAuth2, Google, Github, htpasswd, osiam, ..

Go 1,928 147 Updated Feb 27, 2021

Implement an advanced backend that efficiently manages JWT-based Access and Refresh Tokens, configures SMTP for sending activation and password reset emails, and enforces single user sessions by lo…

TypeScript 48 3 Updated May 18, 2024

General CNN_Accelerator design.卷积神经网络加速器设计。在PYNQ-Z2 FPGA开发板上实现了卷积池化全连接层等硬件加速计算。

VHDL 84 7 Updated Mar 6, 2025

Open-source implementation of AlphaEvolve

Python 4,971 764 Updated Dec 24, 2025