Highlights
- Pro
Lists (9)
Sort Name ascending (A-Z)
Starred repositories
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
🔥 使用cloudflare 搭建免费的 OpenAI api代理 ,解决网络无法访问问题。支持流式输出
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Official inference repo for FLUX.1 models
Python + FastAPI + Playwright + Camoufox 中间层代理服务器,兼容 OpenAI API且支持部分参数设置。项目通过网页自动化模拟人工将请求转发到 Google AI Studio 网页,并同样按照OpenAI标准格式返回输出的工具。课余时间有限,随缘更新
🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.
The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 and [ICCV2023] DOT: A Distillation-Oriented Trainer https://openaccess.thecvf.com/content…
Official implementation of "MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation" (ACM MM 2025)
SalHe / allure-python
Forked from allure-framework/allure-pythonAllure integrations for Python test frameworks
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
Collection of awesome test-time (domain/batch/instance) adaptation methods
Python code for handling the Clotho dataset.
⚡ Dynamically generated stats for your github readmes
An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRW 2024].
[CVPR'25 Oral] LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
OneDrive SDK for Python! https://dev.onedrive.com
Survey on LLM Agents (Published on CoLing 2025)
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
A curated list of action recognition and related area resources