Skip to content
View mengshuliu's full-sized avatar

Block or report mengshuliu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profiling data 🚀

90 9 Updated May 7, 2024

A scientific instrument for investigating latent spaces

JavaScript 745 31 Updated Nov 17, 2025

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Python 4,229 726 Updated Aug 25, 2025

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,664 1,453 Updated Aug 21, 2024

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 4,786 464 Updated Dec 15, 2025

Search the most relevant emojis given a natural language query

Python 295 31 Updated Jan 1, 2023

A highly consumable list of profanities / bad words with severity ratings, exceptions, and tags.

JavaScript 58 15 Updated Oct 23, 2021

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 139,834 18,546 Updated Dec 19, 2025

Email response prediction model framework

Jupyter Notebook 3 Updated Mar 3, 2023

A technical report on convolution arithmetic in the context of deep learning

TeX 14,567 2,314 Updated Jun 8, 2023

Tensorflow implementation of Facebook TagSpace

Python 1 Updated Jan 29, 2019

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,063 654 Updated Apr 2, 2025

An open-source text summarization toolkit for non-experts. EMNLP'2021 Demo

Python 280 33 Updated Sep 30, 2023
Python 1,647 317 Updated Jul 20, 2023

Learning embeddings for classification, retrieval and ranking.

C++ 3,957 527 Updated Dec 4, 2022

This is the collection for the papers and source code for tag/hashtag recommendation

50 7 Updated Sep 23, 2022

Twitter NLP Tools

HTML 889 383 Updated Mar 10, 2023

A meta corpus of social media corpus

42 6 Updated Sep 7, 2024

Ready-to-run Docker images containing Jupyter applications

Python 8,379 3,004 Updated Dec 15, 2025

EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)

Python 438 87 Updated Apr 7, 2023

Python Keyphrase Extraction module

Python 1,587 296 Updated Jul 12, 2023

100 Days of ML Coding

49,042 11,248 Updated Dec 29, 2023

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 130,983 23,581 Updated Oct 8, 2025
Python 10 Updated Jul 19, 2022

The repository contains code to replicate the experiments in the paper "Robustness and Reliability of Gender Bias Assessment in Word Embeddings: The Role of Base Pairs", by Haiyang Zhang, Alison Sn…

Jupyter Notebook 5 Updated May 3, 2021

Remove problematic gender bias from word embeddings.

Jupyter Notebook 251 90 Updated May 9, 2023

Fast and lightweight header-only C++ library (with Python bindings) for approximate nearest neighbor search

C++ 268 47 Updated Jul 28, 2025

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 14,099 1,216 Updated Oct 29, 2025
Next