-
Zhejiang Univeristy, Tencent
- Beijing
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
Companion webpage to the book "Mathematics For Machine Learning"
Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
Benchmarking End-to-End Photographed Document Parsing and Translation
OCR model that handles complex tables, forms, handwriting with full layout.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
LightMem: Lightweight and Efficient Memory-Augmented Generation
One second to read GitHub code with VS Code.
Overview of pipelines related to PDF to Markdown document processing.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
LLMPerf is a library for validating and benchmarking LLMs
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
MoonPalace(月宫)是由 Moonshot AI 月之暗面提供的 API 调试工具。
📰 Must-read papers and blogs on Speculative Decoding ⚡️
Jupyter notebooks testing different OCR models for document parsing (Dolphin, MonkeyOCR, Marker, Nanonets, ...)
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:阿里云金融大模型)