Stars
Hidden Markov Models in Python, with scikit-learn like API
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
Resources and Implementations of Generative Adversarial Nets: GAN, DCGAN, WGAN, CGAN, InfoGAN
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…
Make huge neural nets fit in memory
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Code for the paper "Improved Techniques for Training GANs"
Minimalistic large language model 3D-parallelism training
Multi-Task Deep Neural Networks for Natural Language Understanding
Fully open data curation for reasoning models
Dataset of GPT-2 outputs for research in detection, biases, and more
DeepIE: Deep Learning for Information Extraction
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Official Repository of Absolute Zero Reasoner
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
A pure Python interface to the Raspberry Pi camera module
A very simple generative adversarial network (GAN) in PyTorch
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
Neural machine translation and sequence learning using TensorFlow
Reference implementations of MLPerf® inference benchmarks