Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation)" (NAACL 2022).

Jupyter Notebook 45 6 Updated Jan 30, 2024

brickee / HarveyNER

A new dataset HarveyNER with fine-grained locations annotated in tweets with strong baseline models using Curriculum Learning.

Python 6 1 Updated Nov 8, 2022

GateNLP / broad_twitter_corpus

The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016)

Jupyter Notebook 68 6 Updated May 12, 2022

hitz-zentroa / GoLLIE

Guideline following Large Language Model for Information Extraction

Python 421 27 Updated Oct 27, 2024

quqxui / Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

1,035 61 Updated Nov 18, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 3,108 284 Updated Jun 4, 2024

IsakZhang / Generative-ABSA

Python 91 17 Updated Aug 3, 2021

zhangzhenyu13 / llm3s-conatiner

large language model training-3-stages+deployment

Python 49 12 Updated Aug 14, 2023

maitrix-org / PromptAgent

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that auton…

Python 344 43 Updated Jul 17, 2025

OpenMatch / Augmentation-Adapted-Retriever

[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".

Python 60 5 Updated Jul 12, 2024

AlexTMallen / adaptive-retrieval

Python 189 12 Updated Jul 2, 2025

HIT-SCIR / Chinese-Mixtral-8x7B

中文Mixtral-8x7B（Chinese-Mixtral-8x7B）

Python 656 35 Updated Aug 17, 2024

thu-coai / CritiqueLLM

Python 147 3 Updated Jul 1, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,606 1,002 Updated Nov 21, 2025

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 579 33 Updated Dec 9, 2024

VILA-Lab / ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Python 979 102 Updated May 28, 2024

sunzeyeah / RLHF

Implementation of Chinese ChatGPT

Python 289 35 Updated Nov 20, 2023

zai-org / ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,746 1,607 Updated Jan 13, 2025

SpongebBob / Finetune-ChatGLM2-6B

ChatGLM2-6B 全参数微调，支持多轮对话的高效微调。

Python 401 41 Updated Aug 17, 2023

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,285 963 Updated Feb 25, 2022

jingcangcang

Lists (13)

embedding

information extract

keyphase

keyphrase

llm

nlp code

prompt

rag

reward model

工具

数据

降维方法

零散知识片段

Stars