-
Alibaba
- Singapore
- https://sites.google.com/view/ruidan
Stars
Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。
LLM Zoo collects information of various open- and close-sourced LLMs
Making large AI models cheaper, faster and more accessible
Named Entity Recognition (LSTM + CRF) - Tensorflow
[ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
Baseline Models for Argumentative Text Understanding for AI Debater (NLPCC2021)
Pytorch implementation of LSTM/BERT-CRF for named entity recognition
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
The official implementation of ICLR 2020, "Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering".
Project page for "Neural Argument Generation Augmented with Externally Retrieved Evidence"
[EMNLP 2019 Workshop] Exploiting BERT for End-to-End Aspect-based Sentiment Analysis
State-of-the-Art Embeddings, Retrieval, and Reranking
Parallel t-SNE implementation with Python and Torch wrappers.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
100+ Chinese Word Vectors 上百种预训练中文词向量
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Tensorflow implementation of contextualized word representations from bi-directional language models
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习