Skip to content
View addf400's full-sized avatar
🤣
Focusing
🤣
Focusing

Organizations

@Azure

Block or report addf400

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
400 stars written in Python
Clear filter

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,208 361 Updated Oct 19, 2025

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,201 547 Updated Sep 8, 2025

Learning to Learn in TensorFlow

Python 4,064 606 Updated Jun 29, 2021

An Open-Source Package for Knowledge Embedding (KE)

Python 3,992 991 Updated Jan 10, 2024

A tool for extracting plain text from Wikipedia dumps

Python 3,941 1,005 Updated May 23, 2024

A Tensorflow implementation of CapsNet(Capsules Net) in paper Dynamic Routing Between Capsules

Python 3,795 1,147 Updated Dec 22, 2018

A highly efficient implementation of Gaussian Processes in PyTorch

Python 3,786 576 Updated Nov 8, 2025

Pretrained language model with 100B parameters

Python 3,754 296 Updated Jul 10, 2023

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,709 527 Updated Jul 18, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 3,671 298 Updated Nov 5, 2025

GLIDE: a diffusion-based text-conditional image synthesis model

Python 3,667 502 Updated Mar 8, 2024

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Python 3,610 531 Updated Oct 16, 2024

A library for transfer learning by reusing parts of TensorFlow models.

Python 3,522 1,651 Updated Jan 17, 2025

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Python 3,475 345 Updated Aug 9, 2024

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Python 3,288 442 Updated Jun 25, 2023

AllenAI's post-training codebase

Python 3,286 454 Updated Nov 9, 2025

新浪微博爬虫(Scrapy、Redis)

Python 3,278 1,510 Updated Sep 5, 2018

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,272 573 Updated Apr 14, 2023

ResNeSt: Split-Attention Networks

Python 3,262 495 Updated Dec 9, 2022

Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.

Python 3,255 1,036 Updated Nov 3, 2023

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,161 602 Updated Oct 31, 2025

Line-by-line profiling for Python

Python 3,144 130 Updated Oct 31, 2025

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Python 3,143 642 Updated Jan 22, 2024

Foundation Architecture for (M)LLMs

Python 3,119 221 Updated Apr 11, 2024

根据网易云音乐的歌单, 下载flac无损音乐到本地. Download the FLAC music from Internet according to your NeteaseCloudMusic playlist.

Python 3,112 542 Updated May 22, 2023

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,097 526 Updated May 9, 2024

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,080 478 Updated Jul 29, 2024

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,922 497 Updated Feb 14, 2023
Python 2,907 336 Updated Nov 6, 2025