Skip to content
View teej's full-sized avatar

Highlights

  • Pro

Block or report teej

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Local-first memory for coding agents. Decisions, bugs, and context stored as Markdown, indexed locally with FTS5 plus optional semantic search. No RAM overhead at idle, no external servers.

Python 139 18 Updated Mar 26, 2026

Google Suite CLI: Gmail, GCal, GDrive, GContacts.

Go 6,772 515 Updated Mar 13, 2026

Mixing Language Models with Self-Verification and Meta-Verification

Jupyter Notebook 114 10 Updated Dec 12, 2024

Sample app to demonstrate instrumenting Python FastAPI Uvicorn app with Datadog, Elastic, New Relic and OpenTelemetry.

Python 1 Updated Dec 31, 2025

Structured Outputs

Python 13,645 677 Updated Mar 26, 2026

Reasoning Augmented Generation

Python 898 58 Updated Jul 15, 2025

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)

Python 36 5 Updated Dec 17, 2024

Synthetic data curation for post-training and structured data extraction

Python 1,660 136 Updated Mar 28, 2026

Benchmarking LLMs via Uncertainty Quantification

Python 261 14 Updated Jan 30, 2024

A curated list of awesome approaches to AI model routing

194 23 Updated Mar 24, 2025
Jupyter Notebook 56 19 Updated Jul 31, 2024

Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).

Python 411 63 Updated Apr 12, 2024

Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation

Python 9 Updated Jun 11, 2024
Jupyter Notebook 105 12 Updated Jun 30, 2024

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

815 54 Updated Apr 5, 2026

structured outputs for llms

Python 12,727 1,007 Updated Apr 9, 2026

Recipes to scale inference-time compute of open models

Python 1,131 130 Updated Apr 2, 2026

Optimizing inference proxy for LLMs

Python 3,413 269 Updated Mar 19, 2026

FrugalGPT: better quality and lower cost for LLM applications

Jupyter Notebook 252 31 Updated Feb 10, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,138 237 Updated Oct 16, 2025

DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems

Python 70 8 Updated Sep 29, 2024

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,977 574 Updated Jul 11, 2025

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 823 56 Updated Jul 15, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,531 92 Updated Jun 5, 2025

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

Python 155 14 Updated Nov 2, 2023
Jupyter Notebook 5 1 Updated Nov 12, 2024

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

Python 227 34 Updated Feb 13, 2024
Jupyter Notebook 162 10 Updated Dec 2, 2024
Next