Stars
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
Models and examples built with TensorFlow
A latent text-to-image diffusion model
Google Research
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
a vue2.0 minimal admin template
Fast and memory-efficient exact attention
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
High-Resolution Image Synthesis with Latent Diffusion Models
100+ Chinese Word Vectors 上百种预训练中文词向量
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Java Solutions to problems on LintCode/LeetCode
Tensorflow tutorial from basic to hard, 莫烦Python 中文AI教学
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Muon is an optimizer for hidden layers in neural networks
interactive visualization of 5 popular gradient descent methods with step-by-step illustration and hyperparameter tuning UI
Nearest Neighbor Search with Neighborhood Graph and Tree for High-dimensional Data