Skip to content
View dlutwy's full-sized avatar

Block or report dlutwy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository contains the official implementation code of NeurIPS 2025 paper: "Instance-Level Composed Image Retrieval".

Python 45 Updated Dec 22, 2025

Awesome Unified Multimodal Models

999 31 Updated Aug 17, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,303 359 Updated Dec 25, 2025

A lightweight LMM-based Document Parsing Model

Python 6,398 441 Updated Dec 8, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,074 678 Updated Dec 17, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,853 1,087 Updated Dec 26, 2025

macOS on the Microsoft Surface Laptop 3 thanks to Acidanthera's OpenCore bootloader

33 5 Updated Oct 18, 2025

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,805 201 Updated Apr 9, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 51,043 4,238 Updated Dec 24, 2025

检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.

Python 116 9 Updated Dec 10, 2024

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 906 77 Updated Aug 3, 2025

使用Github Action将国外的Docker镜像转存到阿里云私有仓库,供国内服务器使用,免费易用

2,772 15,531 Updated Aug 21, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,352 332 Updated Feb 27, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 11,463 2,416 Updated Aug 5, 2024

Chinese Translation for Book 《Reinforcement Learning- An Introduction》-Second Edition

Shell 126 26 Updated Apr 15, 2019

Publish your Home-Assistant Instance using Matter.

TypeScript 1,337 89 Updated Dec 15, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,950 288 Updated May 15, 2025
Jupyter Notebook 3 Updated Sep 3, 2020

《精通比特币》第二版 区块链研习社 云天明联合出品。本书更名《精通区块链编程第二版》已由机械工业出版社出版,京东有售。

1,787 486 Updated Apr 14, 2024

Xiaomi Home Integration for Home Assistant

Python 21,166 1,109 Updated Dec 26, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,049 817 Updated Dec 15, 2025

A curated list of awesome LLM/VLM/VLA for Autonomous Driving(LLM4AD) resources (continually updated)

1,601 92 Updated Aug 1, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,648 750 Updated Sep 22, 2025

Supercharge Your LLM Application Evaluations 🚀

Python 11,869 1,179 Updated Dec 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,399 1,457 Updated Nov 28, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,061 1,098 Updated Dec 25, 2025

GraphTranslator:Aligning Graph Model to Large Language Model for Open-ended Tasks

Python 116 18 Updated Aug 27, 2024

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,257 14,008 Updated Dec 24, 2025

Vuestic Admin is an open-source, ready-to-use admin template suite designed for rapid development, easy maintenance, and high accessibility. Built on Vuestic UI, Vue 3, Vite, Pinia, and Tailwind CS…

Vue 10,935 1,796 Updated Dec 18, 2025

Infrared remote library for Arduino: send and receive infrared signals with multiple protocols

C++ 4,891 1,805 Updated Dec 22, 2025
Next