-
Monash University
- Melbourne, Australia
-
21:19
(UTC +11:00) - jianghao.site
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
🌞 CareGPT (关怀GPT)是一个医疗大语言模型,同时它集合了数十个公开可用的医疗微调数据集和开放可用的医疗大语言模型,包含LLM的训练、测评、部署等以促进医疗LLM快速发展。Medical LLM, Open Source Driven for a Healthy Future.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.
这是一份入门AI/LLM大模型的逐步指南,包含教程和演示代码,带你从API走进本地大模型部署和微调,代码文件会提供Kaggle或Colab在线版本,即便没有显卡也可以进行学习。项目中还开设了一个小型的代码游乐场🎡,你可以尝试在里面实验一些有意思的AI脚本。同时,包含李宏毅 (HUNG-YI LEE)2024生成式人工智能导论课程的完整中文镜像作业。
SPINE: Token-Selective Test-Time Reinforcement Learning with Entropy-Band Regularization
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Videodl: A lightweight video downloader written in pure python. (轻量级视频下载器,优先高清无水印,支持抖音,小红书,B站,优酷,快手,腾讯视频,梨视频,推特,绿洲,皮皮虾,A站,虎牙,TED,百度贴吧,芒果,微视,微博视频,央视频CCTV,学习强国,全民K歌,新片场,搜狐,知乎,爱奇艺,YouTube,福克斯新闻,咪咕,网易公…
nnMIL: A generalizable multiple instance learning framework for computational pathology
Defeating the Training-Inference Mismatch via FP16
Collection of Unsupervised Learning Methods for Vision-Language Models (VLMs)
[NeurIPS 2025 Datasets & Benchmarks Track] The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities
很多年前,天涯社区曾出现了不少深受欢迎的帖子,成功地预言了许多形势和事件。这些帖子因此被冠以“天涯神贴”之名。遗憾的是,由于各种原因,天涯论坛目前已经无法打开。幸运的是,有人收集了这些帖子,并将它们整理为一份完整的合集。
[Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving
SicTTA: Single Image Continual Test-Time Adaptation for Medical Image Segmentation
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Official Repository for OTSurv - MICCAI 2025