Stars
The Main Page for paper "Bridging the Editing Gap in LLMs: FineEdit for Precise and Targeted Text Modifications" accepted by EMNLP 2025
CodeJudgeBench is a benchmark aimed at evaluating LLM-based judges for coding related tasks.
Official repository of the ICML2025 paper “Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination”
Build Real-Time Knowledge Graphs for AI Agents
The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words
Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals
Scrape data from Quora website: questions related to certain topics, answers given on certain questions and users profile data
Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)
[ICCV2021]"DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network", https://arxiv.org/abs/2207.10434
Code Repository for IEEE/ACM Transactions on Audio, Speech, and Language Processing Paper - D-score Framework For Open-domain Automatic Dialogue Evaluation
The Official Repository for the Automatic Dialogue Evaluation Sub-task of DSTC10 Track 5 (Automatic Evaluation and Moderation of Open-domain Dialogue Systems)
Code Repository For ACL2021 Paper - DynaEval: Unifying Turn and Dialogue Level Evaluation
EMI-Group / DenseNAS
Forked from JaminFong/DenseNASDensely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)
Official implementation of Solving Incremental Optimization Problems via Cooperative Coevolution