Skip to content
View Tar-ive's full-sized avatar

Highlights

  • Pro

Block or report Tar-ive

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Low-Rank adapter extraction for fine-tuned transformers models

Jupyter Notebook 180 9 Updated May 2, 2024

Post-training with Tinker

Python 2,577 245 Updated Dec 19, 2025

JavaScript Style Guide

JavaScript 148,001 26,788 Updated Nov 6, 2025

Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"

Python 12 Updated Oct 20, 2023

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,222 1,184 Updated Dec 19, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,146 1,695 Updated Dec 18, 2025

TopoJson files of U.S. zip codes by Metropolitan Statistical Area (MSA) number.

23 12 Updated Nov 27, 2015

Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code, Qwen Code, iFlow as an OpenAI/Gemini/Claude/Codex compatible API service, allowing you to enjoy the free Gemini 2.5 Pro, GPT 5, Claude, Qwe…

Go 3,050 474 Updated Dec 19, 2025

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

TypeScript 1,790 159 Updated Dec 10, 2025

Node.js JavaScript runtime ✨🐢🚀✨

JavaScript 114,822 34,142 Updated Dec 18, 2025

Simple scripts to deploy large language models

Python 7 1 Updated Jan 25, 2025

[ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts

Python 23 6 Updated Oct 9, 2025

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 663 40 Updated Jul 22, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,569 288 Updated May 21, 2025

an opinionated approach to productive development with Claude Code

JavaScript 1,426 225 Updated Dec 17, 2025

OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.

Python 332 47 Updated Nov 14, 2025

Claudette is Claude's friend

Jupyter Notebook 306 45 Updated Dec 19, 2025

AI-powered resume tailoring skill for Claude Code

40 4 Updated Nov 6, 2025

Proof-of-concept implementation of the Agentic Context Engineering (ACE) framework — demonstrating Generator-Reflector-Curator interactions for self-improving LLMs on the HotpotQA dataset.

Python 9 5 Updated Oct 25, 2025

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…

TypeScript 32,523 6,446 Updated Dec 19, 2025
Jupyter Notebook 44 16 Updated Apr 15, 2025
JavaScript 3 3 Updated Aug 12, 2025

Super power your agents with Langfuse metrics

Ruby 6 Updated Oct 26, 2025
JavaScript 2 Updated Aug 28, 2025

Making folding experiments more accessible .

Python 89 2 Updated Jul 17, 2025

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Jupyter Notebook 12,281 3,591 Updated Dec 18, 2025

Open source alternative to Resend, Sendgrid, Postmark etc.

TypeScript 3,486 270 Updated Dec 18, 2025

🚀🎉📚 SaaS Boilerplate built with Next.js + Tailwind CSS + Shadcn UI + TypeScript. ⚡️ Full-stack React application with Auth, Multi-tenancy, Roles & Permissions, i18n, Landing Page, DB, Logging, Testing

TypeScript 6,599 1,187 Updated Dec 12, 2025
Next