-
Monash University
- Melbourne
- https://rmanluo.github.io/
- in/linhao-luo-36b489134
- https://scholar.google.com.au/citations?user=RO46HpcAAAAJ&hl=zh-CN
Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Starred repositories
Titans - Learning to Memorize at Test Time
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
repo for paper https://arxiv.org/abs/2504.13837
🎯 告别信息过载,AI 助你看懂新闻资讯热点,简单的舆情监控分析 - 多平台热点聚合+基于 MCP 的AI分析工具。监控35个平台(抖音、知乎、B站、华尔街见闻、财联社等),智能筛选+自动推送+AI对话分析(用自然语言深度挖掘新闻:趋势追踪、情感分析、相似检索等13种工具)。支持企业微信/个人微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 推送,1分钟手机通知,无需…
chsrc 全平台通用换源工具与框架. Change Source everywhere for every software
Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"
This repo is reproduction resources for linear alignment paper, still working
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
Bypass MDM Setup for MacOS, up to MacOS Tahoe 26.1
Agent benchmark for medical diagnosis
✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成(QQ / Telegram / 企微 / 飞书 / 钉钉等),强大易用的插件系统,支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体
Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".
Official Repository of "Learning to Reason under Off-Policy Guidance"
A modern, responsive, and professional academic portfolio theme for researchers, built with Tailwind CSS, and DaisyUI.
⚡ HugoBlox: Markdown sites in minutes. Academic/resume/lab/portfolio for AI researchers & startups. Premium templates. Deploy to GitHub Pages now in 1-click 👇
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
A suite of test scenarios for multi-agent reinforcement learning.
Youtu-Embedding is an industry-leading, general-purpose text representation model developed by Tencent Youtu Lab.
Biomni: a general-purpose biomedical AI agent
Youtu-GraphRAG boosts cost efficiency, inference accuracy, and cross-domain adaptability, pushing the boundaries of performance in complex QA.
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!