Skip to content
View liuyang079's full-sized avatar

Block or report liuyang079

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Muon is an optimizer for hidden layers in neural networks

Python 1,969 93 Updated Jul 12, 2025

interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI

C++ 1,347 156 Updated Aug 4, 2024

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 616 61 Updated Jun 9, 2024

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,259 1,758 Updated Oct 13, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,942 1,877 Updated Jul 15, 2025

Fast and memory-efficient exact attention

Python 20,353 2,114 Updated Nov 5, 2025

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,688 290 Updated Aug 14, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,109 391 Updated Jul 11, 2024

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,508 705 Updated Sep 27, 2025

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,006 102 Updated Jul 29, 2024

Inference script for Meta's LLaMA models using Hugging Face wrapper

Python 109 5 Updated Mar 24, 2023

Quantized inference code for LLaMA models

Python 1,046 100 Updated Mar 17, 2023

Inference code for Llama models

Python 58,900 9,812 Updated Jan 26, 2025

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,596 527 Updated Aug 29, 2025

SimCSE在中文上的复现,有监督+无监督

Python 279 49 Updated Feb 21, 2025

xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能

Python 1,291 189 Updated Nov 12, 2022

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 4,179 730 Updated Jul 19, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 13,502 1,684 Updated Feb 29, 2024

A latent text-to-image diffusion model

Jupyter Notebook 71,753 10,517 Updated Jun 18, 2024

Google Research

Jupyter Notebook 36,664 8,230 Updated Oct 30, 2025

A colab friendly toolkit to generate 3D mesh model / video / nerf instance / multiview images of colourful 3D objects by text and image prompts input, based on dreamfields.

Python 459 37 Updated Oct 3, 2022

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,445 509 Updated Oct 25, 2025

Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data

C++ 1,333 122 Updated Oct 15, 2025

Multi Task Vision and Language

Jupyter Notebook 819 180 Updated Feb 16, 2022

全局指针统一处理嵌套与非嵌套NER的Pytorch实现

Python 403 49 Updated Mar 23, 2023

SpanNER: Named EntityRe-/Recognition as Span Prediction

Python 131 20 Updated May 13, 2022

Unified Structure Generation for Universal Information Extraction

Python 942 101 Updated Jul 30, 2022

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

Python 860 159 Updated Aug 25, 2022

基于轻量级的albert实现albert+BiLstm+CRF

Python 92 30 Updated May 25, 2023

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,537 249 Updated Apr 24, 2024
Next