Skip to content
View guenthermi's full-sized avatar

Organizations

@Wikidata @elastic @jina-ai @embeddings-benchmark

Block or report guenthermi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon)

Python 204 6 Updated Mar 13, 2026

Lightweight Plain-Text Editor for macOS

Swift 7,813 483 Updated Apr 15, 2026

Model implementation for the contextual embeddings project

Python 47 2 Updated Jun 2, 2025

German dataset for DPR model training

Jupyter Notebook 19 1 Updated Jul 21, 2024

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Python 433 28 Updated Mar 26, 2024

Hybrid search engine, combining best features of text and semantic search worlds

Scala 608 15 Updated Jan 6, 2026

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,901 313 Updated Apr 14, 2026

[ICLR 2023 Oral] Image as Set of Points

Python 575 42 Updated Apr 26, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,946 2,191 Updated Jul 29, 2024

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

Python 609 55 Updated Apr 11, 2023

Towards an open source stack for e-commerce search

Ruby 151 32 Updated Mar 21, 2026

An open source implementation of CLIP.

Python 13,687 1,276 Updated Apr 6, 2026

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptatio…

Python 341 38 Updated Jul 6, 2023

State-of-the-Art Text Embeddings

Python 18,549 2,769 Updated Apr 14, 2026

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,506 68 Updated Mar 11, 2024

Simplify deploying and managing Jina projects on Jina Cloud

Python 298 12 Updated Oct 23, 2023

☁️ Build multimodal AI applications with cloud-native stack

Python 21,868 2,239 Updated Mar 24, 2025

A tool for manually classification of dwtc tables. The result is then being used as a training data set.

Java 2 1 Updated Jul 25, 2023
HTML 1 Updated Mar 9, 2020
JavaScript 1 2 Updated Jul 19, 2018

A collection of free Bootstrap 5 templates.

3,070 993 Updated Jul 25, 2024

A tool to analyse, browse and query Wikidata

TypeScript 85 17 Updated May 13, 2025

Examples showing how to use Wikidata Toolkit as a Maven library in your project

Java 55 23 Updated Sep 10, 2025

Java library to interact with Wikibase

Java 407 113 Updated Mar 30, 2026

This repo contains tutorials on OpenCV-Python library using new cv2 interface

Python 1,271 877 Updated Apr 25, 2021