Skip to content
View HTplex's full-sized avatar

Block or report HTplex

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple and reliable way to track your USB devices directly in the macOS menu bar

Swift 587 19 Updated Jun 10, 2026

🥢像老乡鸡🐔那样做饭。已添加2026年发布的《老乡鸡菜品溯源报告 2.0中新出现的菜品。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

Dockerfile 23,602 2,344 Updated May 8, 2026

Task and time-tracking management with calendar integration for Obsidian

TypeScript 1,879 174 Updated Jun 9, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,848 851 Updated Jan 21, 2026
Python 344 28 Updated Mar 23, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,823 267 Updated Nov 12, 2025

Download market data from Yahoo! Finance's API

Python 24,240 3,315 Updated Jun 12, 2026

baseline method for CROCS 2024

Python 10 2 Updated Jan 24, 2024

DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images int…

Python 374 77 Updated Dec 2, 2025

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 1,367 129 Updated May 20, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,274 1,313 Updated Jun 7, 2026

🔥Awesome Multimodal Large Language Models Paper List

155 5 Updated Mar 12, 2025

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 770 30 Updated Sep 7, 2025

[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Python 98 4 Updated Sep 14, 2024

A fork to add multimodal model training to open-r1

Python 1,566 72 Updated Feb 8, 2025
Python 50 39 Updated Oct 10, 2023

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,968 6,515 Updated Jun 13, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,006 375 Updated Apr 6, 2026

Fully open reproduction of DeepSeek-R1

Python 26,302 2,438 Updated Apr 2, 2026

[Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

Python 62 3 Updated Mar 26, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,572 10,272 Updated Nov 12, 2025

aider is AI pair programming in your terminal

Python 46,151 4,579 Updated May 22, 2026

tiny vision language model

Python 9,761 778 Updated Apr 20, 2026

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,637 1,720 Updated Apr 20, 2026

A simple Mac app that simulates mouse clicks

Objective-C 615 355 Updated Oct 16, 2022

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 209 16 Updated Jul 29, 2025

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Python 204 19 Updated Jun 11, 2026

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,759 168 Updated Dec 8, 2023

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 796 91 Updated Oct 8, 2024

The Best Autofill Since Sliced Bread.

TypeScript 26 16 Updated Feb 11, 2025
Next