-
NYU
- New York
- https://jason-cs18.github.io/
- https://yanlu.substack.com/
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Easy, fast, and private LLM & VLM inference for every device
Ongoing research training transformer models at scale
The platform for LLM evaluations and AI agent testing
An extremely fast Python package and project manager, written in Rust.
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
The absolute trainer to light up AI agents.
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
100+ AI Agent & RAG apps you can actually run — clone, customize, ship.
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
The development and future prospects of large multimodal reasoning models.
slime is an LLM post-training framework for RL Scaling.
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
An open-source AI agent that brings the power of Gemini directly into your terminal.
CycleResearcher: Improving Automated Research via Automated Review
A compilation of the best multi-agent papers
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…
A python library for self-supervised learning on images.
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
A curated list of awesome Multimodal studies.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.