Skip to content
View XiaoYee's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report XiaoYee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
44 stars written in Jupyter Notebook
Clear filter

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,976 2,218 Updated Jul 29, 2024

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,290 4,645 Updated Jun 21, 2022

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,283 2,944 Updated Oct 21, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,196 1,291 Updated May 23, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,681 1,711 Updated Apr 26, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,413 520 Updated Oct 8, 2025

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Jupyter Notebook 5,126 1,363 Updated Mar 21, 2020

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,694 441 Updated Nov 9, 2025

Acceptance rates for the major AI conferences

Jupyter Notebook 4,661 312 Updated Sep 23, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,850 160 Updated Oct 9, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,246 100 Updated Oct 29, 2025

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Jupyter Notebook 1,968 175 Updated Mar 13, 2024

maximal update parametrization (µP)

Jupyter Notebook 1,621 104 Updated Jul 17, 2024

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Jupyter Notebook 1,363 390 Updated Jun 13, 2020
Jupyter Notebook 1,209 245 Updated Jun 5, 2020

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 1,079 85 Updated Jan 22, 2025

A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.

Jupyter Notebook 1,071 190 Updated Nov 4, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 901 83 Updated Sep 23, 2025

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 852 56 Updated Jul 20, 2025

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Jupyter Notebook 635 76 Updated Jul 11, 2023

Person Re-ranking (CVPR 2017)

Jupyter Notebook 614 174 Updated Oct 26, 2021

🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Jupyter Notebook 499 61 Updated Jun 11, 2021

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 483 37 Updated Oct 30, 2023
Jupyter Notebook 466 34 Updated Jul 22, 2024

Action recognition using soft attention based deep recurrent neural networks

Jupyter Notebook 352 158 Updated Oct 30, 2016

欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。

Jupyter Notebook 351 39 Updated Jul 21, 2024

An implementation of our CVPR 2018 work 'Domain Adaptive Faster R-CNN for Object Detection in the Wild'

Jupyter Notebook 349 70 Updated Oct 9, 2019

OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.

Jupyter Notebook 323 6 Updated Jun 1, 2025

Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)

Jupyter Notebook 197 9 Updated May 9, 2025
Next