Stars
Models and examples built with TensorFlow
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fast and memory-efficient exact attention
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
100+ Chinese Word Vectors 上百种预训练中文词向量
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Tensorflow tutorial from basic to hard, 莫烦Python 中文AI教学
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Muon is an optimizer for hidden layers in neural networks
xmnlp:提供中文分词, 词性标注, 命名体识别,情感分析,文本纠错,文本转拼音,文本摘要,偏旁部首,句子表征及文本相似度计算等功能
tloen / llama-int8
Forked from meta-llama/llamaQuantized inference code for LLaMA models
Unified Structure Generation for Universal Information Extraction
EasyTransfer is designed to make the development of transfer learning in NLP applications easier.
Re-implement Kaiming He's deep residual networks in tensorflow. Can be trained with cifar10.
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
including text classifier, language model, pre_trained model, multi_label classifier, text generator, dialogue. etc
A colab friendly toolkit to generate 3D mesh model / video / nerf instance / multiview images of colourful 3D objects by text and image prompts input, based on dreamfields.
Implementation of ABCNN(Attention-Based Convolutional Neural Network) on Tensorflow