HTplex

Follow

Haotian Zhang HTplex

Follow

17 followers · 8 following

htplex.io

Achievements

Achievements

Stars

rafaelSwi / MenuBarUSB

Simple and reliable way to track your USB devices directly in the macOS menu bar

Swift 587 19 Updated Jun 10, 2026

Gar-b-age / CookLikeHOC

🥢像老乡鸡🐔那样做饭。已添加2026年发布的《老乡鸡菜品溯源报告 2.0中新出现的菜品。主要部分于2024年完工，非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》，并做归纳、编辑与整理。CookLikeHOC.

Dockerfile 23,602 2,344 Updated May 8, 2026

callumalpass / tasknotes

Task and time-tracking management with calendar integration for Obsidian

TypeScript 1,879 174 Updated Jun 9, 2026

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

10,848 851 Updated Jan 21, 2026

ML-GSAI / LLaDA-V

Python 344 28 Updated Mar 23, 2026

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,823 267 Updated Nov 12, 2025

ranaroussi / yfinance

Download market data from Yahoo! Finance's API

Python 24,240 3,315 Updated Jun 12, 2026

crocs-ifly-ustc / CROCS-Baseline

baseline method for CROCS 2024

Python 10 2 Updated Jan 24, 2024

Kohulan / DECIMER-Image_Transformer

DECIMER Image Transformer is a deep-learning-based tool designed for automated recognition of chemical structure images. Leveraging transformer architectures, the model converts chemical images int…

Python 374 77 Updated Dec 2, 2025

Topdu / OpenOCR

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 1,367 129 Updated May 20, 2026

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,274 1,313 Updated Jun 7, 2026

Wang-Xiaodong1899 / Awesome-Multimodal-Large-Language-Models

🔥Awesome Multimodal Large Language Models Paper List

155 5 Updated Mar 12, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 770 30 Updated Sep 7, 2025

OpenGVLab / MMIU

[ICLR2025] MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Python 98 4 Updated Sep 14, 2024

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,566 72 Updated Feb 8, 2025

hhj1897 / face_detection

Python 50 39 Updated Oct 10, 2023

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,968 6,515 Updated Jun 13, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 5,006 375 Updated Apr 6, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,302 2,438 Updated Apr 2, 2026

KAIST-AILab / SyncVSR

[Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization

Python 62 3 Updated Mar 26, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,572 10,272 Updated Nov 12, 2025

Aider-AI / aider

aider is AI pair programming in your terminal

Python 46,151 4,579 Updated May 22, 2026

m87-labs / moondream

tiny vision language model

Python 9,761 778 Updated Apr 20, 2026

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,637 1,720 Updated Apr 20, 2026

inket / Autoclick

A simple Mac app that simulates mouse clicks

Objective-C 615 355 Updated Oct 16, 2022

roudimit / whisper-flamingo

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 209 16 Updated Jul 29, 2025

arvindrajan92 / DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Python 204 19 Updated Jun 11, 2026

MCG-NJU / VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,759 168 Updated Dec 8, 2023

OpenGVLab / VideoMAEv2

[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Python 796 91 Updated Oct 8, 2024

berellevy / job_app_filler

The Best Autofill Since Sliced Bread.

TypeScript 26 16 Updated Feb 11, 2025