liuyang079

LY liuyang079

DL/ML Algorithm engineer

Stars

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 1,969 93 Updated Jul 12, 2025

lilipads / gradient_descent_viz

interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI

C++ 1,347 156 Updated Aug 4, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 616 61 Updated Jun 9, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,259 1,758 Updated Oct 13, 2025

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,942 1,877 Updated Jul 15, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,353 2,114 Updated Nov 5, 2025

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,688 290 Updated Aug 14, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,109 391 Updated Jul 11, 2024

BlinkDL / ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,508 705 Updated Sep 27, 2025

bigscience-workshop / bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,006 102 Updated Jul 29, 2024

zsc / llama_infer

Inference script for Meta's LLaMA models using Hugging Face wrapper

Python 109 5 Updated Mar 24, 2023

tloen / llama-int8

Forked from meta-llama/llama

Quantized inference code for LLaMA models

Python 1,046 100 Updated Mar 17, 2023

meta-llama / llama

Inference code for Llama models

Python 58,900 9,812 Updated Jan 26, 2025

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,596 527 Updated Aug 29, 2025

vdogmcgee / SimCSE-Chinese-Pytorch

SimCSE在中文上的复现，有监督+无监督

Python 279 49 Updated Feb 21, 2025

SeanLee97 / xmnlp

xmnlp：提供中文分词, 词性标注, 命名体识别，情感分析，文本纠错，文本转拼音，文本摘要，偏旁部首，句子表征及文本相似度计算等功能

Python 1,291 189 Updated Nov 12, 2022

zjunlp / DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 4,179 730 Updated Jul 19, 2025

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,502 1,684 Updated Feb 29, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 71,753 10,517 Updated Jun 18, 2024

google-research / google-research

Google Research

Jupyter Notebook 36,664 8,230 Updated Oct 30, 2025

shengyu-meng / dreamfields-3D

Forked from ashawkey/dreamfields-torch

A colab friendly toolkit to generate 3D mesh model / video / nerf instance / multiview images of colourful 3D objects by text and image prompts input, based on dreamfields.

Python 459 37 Updated Oct 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LY liuyang079

Block or report liuyang079