Skip to content
View guenthermi's full-sized avatar

Organizations

@Wikidata @elastic @jina-ai @embeddings-benchmark

Block or report guenthermi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon)

Python 213 6 Updated Mar 13, 2026

Lightweight Plain Text Editor for macOS

Swift 7,910 491 Updated Apr 28, 2026

Model implementation for the contextual embeddings project

Python 47 2 Updated Jun 2, 2025

German dataset for DPR model training

Jupyter Notebook 19 1 Updated Jul 21, 2024

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Python 437 28 Updated Mar 26, 2024

Hybrid search engine, combining best features of text and semantic search worlds

Scala 610 15 Updated Jan 6, 2026

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,912 317 Updated Apr 18, 2026

[ICLR 2023 Oral] Image as Set of Points

Python 574 43 Updated Apr 26, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,945 2,191 Updated Jul 29, 2024

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

Python 609 55 Updated Apr 11, 2023

Towards an open source stack for e-commerce search

Ruby 151 32 Updated Mar 21, 2026

An open source implementation of CLIP.

Python 13,754 1,281 Updated Apr 28, 2026

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptatio…

Python 341 38 Updated Jul 6, 2023

State-of-the-Art Text Embeddings

Python 18,606 2,776 Updated Apr 24, 2026

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,506 66 Updated Mar 11, 2024

Simplify deploying and managing Jina projects on Jina Cloud

Python 298 12 Updated Oct 23, 2023

☁️ Build multimodal AI applications with cloud-native stack

Python 21,876 2,237 Updated Mar 24, 2025

A tool for manually classification of dwtc tables. The result is then being used as a training data set.

Java 2 1 Updated Jul 25, 2023
HTML 1 Updated Mar 9, 2020
JavaScript 1 2 Updated Jul 19, 2018

A collection of free Bootstrap 5 templates.

3,070 992 Updated Jul 25, 2024

A tool to analyse, browse and query Wikidata

TypeScript 85 17 Updated May 13, 2025

Examples showing how to use Wikidata Toolkit as a Maven library in your project

Java 55 23 Updated Sep 10, 2025

Java library to interact with Wikibase

Java 408 113 Updated Apr 27, 2026

This repo contains tutorials on OpenCV-Python library using new cv2 interface

Python 1,270 876 Updated Apr 25, 2021