- Paris
- in/mei-gan-080238167
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.
My own note about Financial Market module at Yale University on Coursera
Open Source AI Platform - AI Chat with advanced features that works with every LLM
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
mei233 / CascadeTabNet
Forked from DevashishPrasad/CascadeTabNetThis repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Convert PDF to HTML without losing text or format.
C++ implementation of the Brown word clustering algorithm.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Exercises for the XQuery Workshops at XQuery at DH2017
搜集、整理、发布 预训练 中文 词向量/字向量,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
SophonPlus / ChineseAnnotator
Forked from jiesutd/YEDDA中文自然语言处理 (NLP) 标注工具,与 有志之士 共同 促进 中文 自然语言处理 的 发展。
搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。