Highlights
-
electric Public
Forked from electric-sql/electricReal-time sync for Postgres.
Elixir Apache License 2.0 UpdatedAug 1, 2025 -
-
wtpsplit Public
Forked from segment-any-text/wtpsplitToolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Python MIT License UpdatedJun 23, 2025 -
haystack Public
Forked from deepset-ai/haystackAI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
Python Apache License 2.0 UpdatedMay 21, 2025 -
tantivy-py Public
Forked from quickwit-oss/tantivy-pyPython bindings for Tantivy
Rust MIT License UpdatedApr 12, 2025 -
-
rich Public
Forked from Textualize/richRich is a Python library for rich text and beautiful formatting in the terminal.
Python MIT License UpdatedMar 30, 2025 -
pgvectorscale Public
Forked from timescale/pgvectorscaleA complement to pgvector for high performance, cost efficient vector search on large workloads.
Rust PostgreSQL License UpdatedMar 14, 2025 -
pdftext Public
Forked from datalab-to/pdftextExtract structured text from pdfs quickly
Python Apache License 2.0 UpdatedFeb 26, 2025 -
edgartools Public
Forked from dgunning/edgartoolsNavigate SEC Edgar data in Python
Python MIT License UpdatedFeb 14, 2025 -
fancy-regex Public
Forked from fancy-regex/fancy-regexRust library for regular expressions using "fancy" features like look-around and backreferences
Rust MIT License UpdatedJan 31, 2025 -
surya Public
Forked from datalab-to/suryaOCR, layout analysis, reading order, table recognition in 90+ languages
Python GNU General Public License v3.0 UpdatedJan 25, 2025 -
pyvespa Public
Forked from vespa-engine/pyvespaPython API for https://vespa.ai, the open big data serving engine
Python Apache License 2.0 UpdatedDec 12, 2024 -
vidore-benchmark Public
Forked from illuin-tech/vidore-benchmarkVision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
Python MIT License UpdatedOct 21, 2024 -
-
-
mdm4-splicing Public
Computational analysis of RPL22 alterations and impact on MDM4 splicing
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedAug 3, 2024 -
-
text-embeddings-inference Public
Forked from huggingface/text-embeddings-inferenceA blazing fast inference solution for text embeddings models
Rust Apache License 2.0 UpdatedMay 12, 2024 -
-
many Public
Frequently-used methods for exploratory analysis
-
cancer_data Public
A unified downloader+preprocessor for cancer genomics datasets
-
-
gaoya Public
Forked from serega/gaoyaLocality Sensitive Hashing
Rust MIT License UpdatedMar 1, 2024 -
paradedb Public
Forked from paradedb/paradedbPostgres for Search and Analytics
Rust GNU Affero General Public License v3.0 UpdatedMar 1, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 24, 2024 -
schema-infer Public
Forked from triggerdotdev/schema-inferInfers JSON Schemas and Type Definitions from example JSON
TypeScript MIT License UpdatedFeb 6, 2024 -
json-schema-fns Public
Forked from triggerdotdev/json-schema-fnsModern utility library and typescript typings for building JSON Schema documents
TypeScript MIT License UpdatedFeb 6, 2024 -
json-infer-types Public
Forked from triggerdotdev/json-infer-typesInfers the type and format of JSON values
TypeScript MIT License UpdatedFeb 6, 2024