Skip to content
View mateiz's full-sized avatar

Organizations

@mesos @radlab

Block or report mateiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large World Model -- Modeling Text and Video with Millions Context

Python 7,396 560 Updated Oct 19, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,905 186 Updated Feb 24, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,408 87 Updated Feb 7, 2025

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,805 1,147 Updated Jun 30, 2023
Python 1,533 221 Updated Jun 26, 2025

DSPy: The framework for programming—not prompting—language models

Python 32,252 2,628 Updated Feb 18, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,592 2,000 Updated Feb 18, 2026

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,151 154 Updated Feb 12, 2026

Sample base images for Databricks Container Services

Jupyter Notebook 206 128 Updated Nov 28, 2025

An open protocol for secure data sharing

Scala 921 216 Updated Feb 14, 2026

Offload IoT computation to local hardware while justifying any network accesses.

Rust 7 2 Updated May 31, 2023

A native Rust library for Delta Lake, with bindings into Python

Rust 3,149 579 Updated Feb 18, 2026

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Jupyter Notebook 336 58 Updated Feb 7, 2026

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,778 466 Updated Oct 14, 2025

The library for web and native user interfaces.

JavaScript 243,060 50,594 Updated Feb 18, 2026

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,229 4,554 Updated Feb 16, 2026

Joblib Apache Spark Backend

Python 249 25 Updated Apr 7, 2025

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 740 93 Updated Jan 26, 2023
Python 392 115 Updated Nov 4, 2022

An open-source toolkit for large-scale genomic analysis

Scala 293 118 Updated Feb 15, 2026

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 901 137 Updated Nov 7, 2025

Koalas: pandas API on Apache Spark

Python 3,372 366 Updated Mar 20, 2024

A Python-embedded modeling language for convex optimization problems.

C++ 6,107 1,154 Updated Feb 17, 2026

The Legion Parallel Programming System

C++ 753 152 Updated Dec 17, 2025

GoCD plugins to work with MLFlow as model repository in a CD flow

Java 31 4 Updated Nov 1, 2023

MLflow App Library

Python 77 34 Updated Dec 25, 2018

Intellij Jsonnet Plugin

Java 90 17 Updated Mar 9, 2024

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 24,212 5,297 Updated Feb 18, 2026

The "Command Line Interactive Controller for Kubernetes"

Rust 1,508 91 Updated Oct 29, 2025

Accelerating network inference over video

Python 436 121 Updated Mar 6, 2020
Next