Skip to content
View lyutyuh's full-sized avatar

Highlights

  • Pro

Block or report lyutyuh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
36 results for source starred repositories
Clear filter

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 35,844 10,843 Updated Oct 19, 2025

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,291 4,645 Updated Jun 21, 2022

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,505 227 Updated Apr 3, 2024

An extendible and configurable PDF manipulation layer library written in java.

Java 532 67 Updated Oct 6, 2025

Research code for pixel-based encoders of language (PIXEL)

Python 344 37 Updated Jul 15, 2025

Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"

Python 296 28 Updated Oct 27, 2022

This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Micr…

Python 279 95 Updated Mar 16, 2024

Source code for "Packed Levitated Marker for Entity and Relation Extraction"

Python 270 38 Updated May 3, 2023

Course : Introduction to Computer Systems

C 231 53 Updated Jan 9, 2019

Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"

Python 190 22 Updated May 23, 2025

Controlled text generation with programmable constraints

Python 156 19 Updated Nov 10, 2025

PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxiv.org/pdf/2210.14698.pdf

Python 107 14 Updated Jan 22, 2024

CUDA kernels for generalized matrix-multiplication in PyTorch

Jupyter Notebook 85 14 Updated Oct 11, 2021

Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric Evaluation of Machine Translation with a Densely Annotated P…

Python 79 10 Updated Sep 21, 2023

Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction

Python 67 10 Updated May 4, 2022

Generic PyTorch implementation of einsum that supports different semirings

Python 50 7 Updated Jul 17, 2024

Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers

Python 32 1 Updated Jul 2, 2021

Automatic Generation of Scaffolding Questions for Learning Math, EMNLP 2022

Python 25 2 Updated Jun 30, 2023

A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semantic role labelling, etc.)

Python 21 1 Updated Jul 11, 2022

Code Repository for "Please Mind the Root: Decoding Arborescences for Dependency Parsing" and "On Finding the K-best Non-projective Dependency Trees"

Python 20 5 Updated Dec 12, 2022
Python 20 7 Updated Nov 19, 2023

This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).

Python 17 2 Updated Feb 17, 2025

本工具采用随机算法计算指定文件夹内两两 .docx 文件间的相似性。

Python 15 2 Updated Jun 15, 2020
Python 12 1 Updated Dec 13, 2022

A PyTorch implementation of the CorefQA Model.

Python 10 3 Updated Jun 27, 2020
Python 7 Updated Oct 31, 2024
Jupyter Notebook 7 2 Updated May 11, 2023

This is the repository containing code to replicate the experiments in our EMNLP 2024 paper, "A Probability–Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors".

Jupyter Notebook 4 1 Updated Jun 29, 2024
Next