- 中国深圳
Starred repositories
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Generative Agents: Interactive Simulacra of Human Behavior
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
An industrial deep learning framework for high-dimension sparse data
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
lalaboom / RecSys-1
Forked from mJackie/RecSys计算广告/推荐系统/机器学习(Machine Learning)/点击率(CTR)/转化率(CVR)预估/点击率预估
RUCAIBox / Few-Shot-KG2Text
Forked from turboLJY/Few-Shot-KG2TextSource for the ACL 2021 Findings paper "Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models"
Final Solution for MAG240M dataset of OGB-LSC@KDDCUP2021
Source code and datasets for IJCAI 2019 paper: Relation-Aware Entity Alignment for Heterogeneous Knowledge Graphs.
Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)
Move the cursor between multiple displays using a shortcut. (Version 1.2)
DeepIE: Deep Learning for Information Extraction
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
Data and code for EMNLP 2020 paper "Logic2Text: High-Fidelity Natural Language Generation from Logical Forms"