Skip to content
View HastyJenny's full-sized avatar

Block or report HastyJenny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
62 results for source starred repositories
Clear filter

Parsing-free RAG supported by VLMs

Python 910 74 Updated Dec 7, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,500 231 Updated Feb 3, 2026

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Python 625 49 Updated Jan 11, 2026

Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark

Python 34 2 Updated Nov 6, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 23,175 3,658 Updated Feb 6, 2026

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 5,355 527 Updated Feb 4, 2026
Python 7 1 Updated Dec 31, 2025

Generate text line images for training deep learning OCR models

Python 894 174 Updated Jan 17, 2026

Object365 dataset downloader

Shell 10 Updated Aug 6, 2025

利用 onnxruntime 及 PaddleOCR 提供的模型, 对图片中的文字进行检测与识别.

Python 88 18 Updated Jan 10, 2023

卡证和文档检测和矫正

Python 79 21 Updated Sep 18, 2024

RapidOcr onnxruntime推理 for Android

C++ 100 14 Updated Apr 17, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 70,328 9,796 Updated Feb 6, 2026

Implementation of layer diffuse inference using refiners

Python 25 1 Updated Apr 25, 2024

Diffusers implementation of LayerDiffuse

Python 5 Updated May 29, 2024

Unofficial implementation of Layer Diffuse in diffusers

Python 27 3 Updated Apr 3, 2024

[WIP] Layer Diffusion for WebUI (via Forge)

Python 4,107 351 Updated Aug 30, 2024

Play ChatGPT and other LLM with Xiaomi AI Speaker

Python 6,773 936 Updated Dec 10, 2025

A TensorFlow.js Graph Model Converter

Python 140 21 Updated Jan 22, 2023

PALLAIDIUM — a generative AI movie studio, seamlessly integrated into the Blender Video Editor (VSE), enabling end-to-end production from script to screen and back.

Python 1,318 119 Updated Jan 27, 2026

AUTOMATIC1111版web UIをまねた、DiffusersベースのStable Diffusion用GUIです(画像生成のみ)

Jupyter Notebook 3 Updated Sep 29, 2024

💯2025年信息系统项目管理师(软考高级)备考资源库。

Rich Text Format 1,136 278 Updated Feb 1, 2026

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,524 482 Updated Aug 7, 2024

Inference Llama 2 in one file of pure C

C 19,154 2,444 Updated Aug 6, 2024

tf-keras code of Face Ear Landmark Detection System (with Multi-Task Learning).

Jupyter Notebook 20 3 Updated Aug 19, 2022

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,789 760 Updated Sep 22, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,082 404 Updated Feb 6, 2026

An Open-source Toolkit for LLM Development

Python 2,804 176 Updated Jan 13, 2025
Next