Skip to content
View CreateRandom's full-sized avatar

Block or report CreateRandom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Configuration-based installation of OpenShift and Cloud Pak for Data/Integration/Watson AIOps/Business Automation on various private and public cloud infrastructure providers. Deployment attempts t…

Jinja 157 80 Updated May 11, 2026

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

Go 46,297 4,085 Updated May 16, 2026

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

Python 9,987 1,065 Updated May 17, 2026

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 6,867 560 Updated Jul 11, 2024

Multi-tool for semantic search

Python 2,705 158 Updated Aug 27, 2024

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2,502 238 Updated May 15, 2026

Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...

395 34 Updated Nov 10, 2022

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

Python 2,086 100 Updated Dec 15, 2023

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,863 1,036 Updated May 15, 2026

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

Python 5,414 615 Updated May 16, 2026

The agent engineering platform.

Python 136,918 22,647 Updated May 17, 2026

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,825 257 Updated May 17, 2026

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 14,720 1,234 Updated May 15, 2026

Low-code framework for building custom LLMs, neural networks, and other AI models

Python 11,697 1,219 Updated May 17, 2026

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Python 12,797 2,389 Updated May 17, 2026

An orchestration platform for the development, production, and observation of data assets.

Python 15,524 2,124 Updated May 15, 2026

Tools for detecting wildlife in aerial images using active learning

Python 242 61 Updated Mar 30, 2026

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,447 261 Updated Oct 18, 2024

Build data pipelines, the easy way 🛠️

TypeScript 4,137 263 Updated Jun 6, 2023

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,938 1,061 Updated May 11, 2026

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Python 4,462 570 Updated Jul 29, 2025

🐢 bayesAB: Fast Bayesian Methods for A/B Testing

R 314 42 Updated Jun 25, 2021

9 tools for Goodreads.com, for finding people based on the books they’ve read, finding books popular among the people you follow, following new book reviews, etc

Perl 88 7 Updated Jan 29, 2023

State-of-the-Art Embeddings, Retrieval, and Reranking

Python 18,670 2,789 Updated May 15, 2026

An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP

Java 276 62 Updated Nov 5, 2022

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,834 2,069 Updated Jan 23, 2024

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,667 897 Updated Jun 10, 2025

A machine learning software for extracting information from scholarly documents

Java 4,868 552 Updated May 14, 2026

Code to accompany ICML 2018 paper

Python 567 129 Updated Sep 23, 2021

Open-source, low-code AutoML platform for Python. PyCaret 4.0: sklearn-native engine + React control plane.

Python 9,790 1,861 Updated May 16, 2026
Next