Stars
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a querya…
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
AI agents running research on single-GPU nanochat training automatically
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
Transformer related optimization, including BERT, GPT
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
This repository collects 100 papers related to negative sampling methods.
Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
[IJCV 2022] Bridging Composite and Real: Towards End-to-end Deep Image Matting
Learning embeddings for classification, retrieval and ranking.
A collection of papers of neural-symbolic AI (mainly focus on NLP applications)
This repository contains code for the paper "Decoupling Representation and Classifier for Long-Tailed Recognition", published at ICLR 2020