-
CUHK
- Hong Kong
-
22:55
(UTC +08:00) - https://zfkarl.github.io
Stars
Official Repository of Orchestra-o1: Omnimodal Agent Orchestration
PARL (Parallel-Agent Reinforcement Learning) is a training paradigm that teaches models to decompose complex tasks into parallel subtasks and coordinate multiple agents simultaneously.
Open-source SERP API for AI, SEO & automation - Google, Yandex, Baidu, Bing, DuckDuckGo, Ecosia 🎉
🚀 Pre-process, annotate, evaluate, and train your Affect Computing (e.g., Multimodal Emotion Recognition, Sentiment Analysis) datasets ALL within MER-Factory! (LangGraph Based Agent Workflow)
Official repository for the paper “Rethinking Facial Expression Recognition in the Era of Multimodal Large Language Models”
[IJCAI 2025 (Oral)] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models".
A curated collection of papers, datasets, and resources on Scientific Datasets and Large Language Models (LLMs)
Official repository for the paper “MME-Emotion: A Holistic Evaluation Benchmark for Emotional Intelligence in Multimodal Large Language Models”
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
The Paper Collection of Inductive Reasoning from 2015 to 2025
[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.
[ACL 2025] Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models
CellVerse: Do Large Language Models Really Understand Cell Biology?
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Fully open reproduction of DeepSeek-R1
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Witness the aha moment of VLM with less than $3.
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。