Skip to content
View shuttie's full-sized avatar

Block or report shuttie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
26 stars written in Python
Clear filter

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 59,423 6,066 Updated Feb 4, 2026

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,265 4,016 Updated Jul 17, 2024

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 21,170 3,092 Updated Feb 4, 2026

Go ahead and axolotl questions

Python 11,222 1,248 Updated Feb 3, 2026

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Python 5,012 226 Updated Feb 4, 2026

MTEB: Massive Text Embedding Benchmark

Python 3,104 551 Updated Feb 3, 2026

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 2,022 211 Updated Feb 4, 2026

pytest fixture for benchmarking code

Python 1,408 127 Updated Nov 9, 2025

Metric learning and retrieval pipelines, models and zoo.

Python 984 73 Updated Nov 26, 2025

Finetuning Large Language Models on One Consumer GPU in 2 Bits

Python 734 76 Updated May 25, 2024
Python 579 61 Updated Sep 23, 2025

Code for our ECCV 2018 work.

Python 454 92 Updated Sep 15, 2019

high performance in-memory cache

Python 426 11 Updated Nov 21, 2025

Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

Python 341 69 Updated Oct 7, 2024

Code plagiarism detection tool

Python 321 43 Updated Apr 6, 2024

Full text search that feels like a numpy array

Python 301 13 Updated Feb 1, 2026

Library for automatic retraining and continual learning

Python 297 7 Updated Sep 30, 2024

Pure-Python Server Side Events (SSE) client

Python 227 35 Updated Jan 2, 2026
Python 130 19 Updated Aug 21, 2023

An efficient PyTorch implementation of the evaluation metrics in recommender systems.

Python 28 1 Updated Feb 22, 2023

HSEB: Hybrid Search Engine Benchmark

Python 20 2 Updated Oct 5, 2025

Stable Diffusion inference benchmarks

Python 10 Updated Jun 14, 2024

Experimental code for our paper on informative and diverse sampling of negative examples for dense retrieval

Python 3 1 Updated Mar 10, 2024

ICTIR Paper

Python 2 Updated Jul 15, 2022