Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Low-Rank adapter extraction for fine-tuned transformers models
Post-training with Tinker
Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
TopoJson files of U.S. zip codes by Metropolitan Statistical Area (MSA) number.
Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code, Qwen Code, iFlow as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 2.5 Pro, GPT 5, Claude, Qwe…
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Simple scripts to deploy large language models
[ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
an opinionated approach to productive development with Claude Code
OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.
AI-powered resume tailoring skill for Claude Code
Proof-of-concept implementation of the Agentic Context Engineering (ACE) framework — demonstrating Generator-Reflector-Curator interactions for self-improving LLMs on the HotpotQA dataset.
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Open source alternative to Resend, Sendgrid, Postmark etc.
🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing