RManLuo

😀

LOL

Linhao Luo RManLuo

😀

LOL

Research Fellow at Monash University | AI, LLM, Graph

396 followers · 252 following

Monash University
Melbourne
https://rmanluo.github.io/
in/linhao-luo-36b489134
https://scholar.google.com.au/citations?user=RO46HpcAAAAJ&hl=zh-CN

Achievements

x2 x3

Achievements

x2 x3

Highlights

Lists (14)

Sort

Starred repositories

ai-in-pm / Titans---Learning-to-Memorize-at-Test-Time

Titans - Learning to Memorize at Test Time

Python 49 10 Updated Jan 16, 2025

lucidrains / titans-pytorch

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,737 171 Updated Dec 18, 2025

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 1,447 145 Updated Dec 19, 2025

FalkorDB / FalkorDB

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 2,609 196 Updated Dec 19, 2025

YangRui2015 / Generalizable-Reward-Model

Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

Python 43 4 Updated Feb 20, 2025

LeapLabTHU / limit-of-RLVR

repo for paper https://arxiv.org/abs/2504.13837

Python 300 17 Updated Dec 17, 2025

sansan0 / TrendRadar

🎯 告别信息过载，AI 助你看懂新闻资讯热点，简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台（抖音、知乎、B站、华尔街见闻、财联社等），智能筛选+自动推送+AI对话分析（用自然语言深度挖掘新闻：趋势追踪、情感分析、相似检索等13种工具）。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送，1分钟手机通知，无需…

Python 39,691 20,726 Updated Dec 19, 2025

RubyMetric / chsrc

chsrc 全平台通用换源工具与框架. Change Source everywhere for every software

C 6,533 266 Updated Dec 18, 2025

zowiezhang / Amulet

Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"

Python 14 Updated Mar 18, 2025

Wizardcoast / Linear_Alignment

This repo is reproduction resources for linear alignment paper, still working

Python 17 2 Updated May 19, 2024

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,674 309 Updated Nov 13, 2025

QwenQKing / Prompt-R1

Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning

Python 36 2 Updated Dec 2, 2025

assafdori / bypass-mdm

Bypass MDM Setup for MacOS, up to MacOS Tahoe 26.1

Shell 1,065 275 Updated Sep 16, 2025

SamuelSchmidgall / AgentClinic

Agent benchmark for medical diagnosis

Python 266 47 Updated Dec 31, 2024

AstrBotDevs / AstrBot

✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成（QQ / Telegram / 企微 / 飞书 / 钉钉等），强大易用的插件系统，支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体

Python 14,291 1,110 Updated Dec 19, 2025

HUST-AI-HYZ / MemoryAgentBench

Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 183 27 Updated Nov 29, 2025

Mirix-AI / MIRIX

Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…

Python 3,553 331 Updated Dec 18, 2025

ZHZisZZ / modpo

[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization

Python 93 7 Updated Aug 20, 2024

srzer / MOD

Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".

Python 29 4 Updated Oct 30, 2024

ElliottYan / LUFFY

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 393 48 Updated Oct 4, 2025

jiehua1995 / hexo-theme-researcher

A modern, responsive, and professional academic portfolio theme for researchers, built with Tailwind CSS, and DaisyUI.

EJS 25 7 Updated Nov 9, 2025

HugoBlox / hugo-blox-builder

⚡ HugoBlox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇

HTML 9,257 2,959 Updated Dec 18, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 833 59 Updated Jul 31, 2025

ytgui / Search-R3

Reasoning-Reinforced Representation for Search

12 Updated Oct 9, 2025

google-deepmind / meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Python 771 149 Updated Dec 17, 2025

TencentCloudADP / youtu-embedding

Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.

Python 161 15 Updated Nov 14, 2025

snap-stanford / Biomni

Biomni: a general-purpose biomedical AI agent

Python 2,395 403 Updated Dec 15, 2025

TencentCloudADP / youtu-graphrag

Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.

Python 977 136 Updated Oct 30, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,293 114 Updated Dec 11, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,048 643 Updated Dec 19, 2025