- Japan
-
08:07
(UTC +09:00) - https://secon.dev/
- https://huggingface.co/hotchpotch
- https://kaggle.com/hotchpotch
- @hotchpotch
-
duckdb-vaporetto Public
DuckDB extension for Japanese full-text search with 🛥Vaporetto / Vaporetto による DuckDB + 日本語全文検索拡張機能
-
sqlite-vaporetto Public
SQLite FTS5 extension for fast Japanese full-text search with 🛥Vaporetto / Vaporetto による高速な日本語全文検索を SQLite FTS5 で実現する拡張機能
-
sentence-transformers Public
Forked from huggingface/sentence-transformersState-of-the-Art Text Embeddings
Python Apache License 2.0 UpdatedApr 28, 2026 -
duckdb-vaporetto-wasm-demo Public
DuckDB + FTS + Vaporetto を用いた Wasm での Web ブラウザ上での日本語全文検索デモ
JavaScript UpdatedApr 25, 2026 -
beko-translate Public
beko-translateは、Apple Silicon Mac向けのCLI翻訳ツールです。PDF見開き翻訳機能も同梱してあり原文・訳文を交互に表示できます。
-
wikipedia-paragraphs Public
Forked from singletongue/wikipedia-paragraphsCleaned Wikipedia paragraphs for natural language processing (NLP)
Python Apache License 2.0 UpdatedFeb 21, 2026 -
PDFMathTranslate-next Public
Forked from PDFMathTranslate-next/PDFMathTranslate-nextPDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
Python GNU Affero General Public License v3.0 UpdatedJan 15, 2026 -
open_provence Public
✂️ OpenProvence: Open-Source, Efficient, and Robust Context Pruning for Retrieval-Augmented Generation
-
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is the leading framework for building LLM-powered agents over your data.
Python MIT License UpdatedOct 31, 2025 -
fast-bunkai Public
⚡Japanese sentence splitting(日本語文境界判定器), 40–250× faster via a Rust-accelerated Python library with near-perfect API compatibility with megagonlabs/bunkai.
-
JMTEB Public
Forked from sbintuitions/JMTEBThe evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
Python Creative Commons Attribution Share Alike 4.0 International UpdatedSep 9, 2025 -
JaCWIR Public
JaCWIR: Japanese Casual Web IR - 日本語情報検索評価のための小規模でカジュアルなWebタイトルと概要のデータセット
-
JQaRA Public
JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット
-
yast Public
YAST - Yet Another SPLADE or Sparse Trainer
-
yasem Public
YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings
-
記事タイトルがないものを、自動タイトル
Python MIT License UpdatedApr 21, 2025 -
Educational content scoring and evaluation code using fineweb-2 (Japanese). Includes training and assessment implementations for content rating tasks.
-
sd-16 Public
Forked from mahm/sd-16LangGraph sample code for Software Design article vol.16
Python UpdatedNov 21, 2024 -
JapaneseEmbeddingEval Public
Forked from oshizo/JapaneseEmbeddingEvalJupyter Notebook UpdatedOct 7, 2024 -
FlagEmbedding Public
Forked from FlagOpen/FlagEmbeddingRetrieval and Retrieval-augmented LLMs
Python MIT License UpdatedAug 29, 2024 -
text-embeddings-inference Public
Forked from huggingface/text-embeddings-inferenceA blazing fast inference solution for text embeddings models
Rust Apache License 2.0 UpdatedJun 12, 2024 -
vespa-kuromoji-linguistics Public
Forked from yahoojapan/vespa-kuromoji-linguisticsJava Apache License 2.0 UpdatedApr 3, 2024 -
wikipedia 日本語の文を、各種日本語の embeddings や faiss index へと変換するスクリプト等。
-
youri-7b を SFT で Q&A + RAG形式に特化したフォーマットで学習
-
ranx Public
Forked from AmenRa/ranx⚡️A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion 🐍
Python MIT License UpdatedFeb 21, 2024 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 29, 2023 -
ncd_classifier Public
NCD Classifier is a Python library that implements the method proposed in the paper "Low-Resource" Text Classification: A Parameter-Free Classification Method with Compressors".
-
peft Public
Forked from huggingface/peft🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Python Apache License 2.0 UpdatedMay 29, 2023 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedMay 2, 2023