- Brisbane, Australia
-
10:37
(UTC +10:00)
Starred repositories
🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library
Must-read Papers on Knowledge Editing for Large Language Models.
A claw machine made with a raspberry pico and cheap replacement parts
Conveniently download files, models, tokenizers from HuggingFace Hub
⚡ Faster similarity search with PDX: A vertical data layout for vectors
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
Tools for various benchmarking scenarios
📦 Command line peer-to-peer data transfer tool based on libp2p.
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)
Benchmarks of approximate nearest neighbor libraries in Python
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
Fast and memory-efficient ANN with a subset-search functionality
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
An open-source NLP research library, built on PyTorch.
A technical report on convolution arithmetic in the context of deep learning