Skip to content
View mateiz's full-sized avatar

Organizations

@mesos @radlab

Block or report mateiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large World Model -- Modeling Text and Video with Millions Context

Python 7,419 558 Updated Oct 19, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,930 184 Updated Feb 24, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,409 89 Updated Feb 7, 2025

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,798 1,139 Updated Jun 30, 2023
Python 1,570 229 Updated Mar 25, 2026

DSPy: The framework for programming—not prompting—language models

Python 35,141 2,980 Updated Jun 18, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,859 2,116 Updated Jun 18, 2026

Scalable master data management, identity resolution, entity resolution, and deduplication using ML

Java 1,218 168 Updated Jun 13, 2026

Sample base images for Databricks Container Services

Jupyter Notebook 221 129 Updated Jun 8, 2026

An open protocol for secure data sharing

Scala 951 229 Updated Jun 13, 2026

Offload IoT computation to local hardware while justifying any network accesses.

Rust 7 2 Updated May 31, 2023

A native Rust library for Delta Lake, with bindings into Python

Rust 3,243 629 Updated Jun 15, 2026

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Jupyter Notebook 342 59 Updated Apr 1, 2026

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,886 467 Updated Oct 14, 2025

The library for web and native user interfaces.

JavaScript 245,985 51,074 Updated Jun 18, 2026

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,646 4,605 Updated Jun 14, 2026

Joblib Apache Spark Backend

Python 250 24 Updated Mar 24, 2026

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 743 93 Updated Jan 26, 2023
Python 394 115 Updated Nov 4, 2022

An open-source toolkit for large-scale genomic analysis

Scala 303 117 Updated Jun 7, 2026

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 913 141 Updated Nov 7, 2025

Koalas: pandas API on Apache Spark

Python 3,374 371 Updated Mar 20, 2024

A Python-embedded modeling language for convex optimization problems.

C++ 6,258 1,183 Updated Jun 18, 2026

The Legion Parallel Programming System

C++ 759 154 Updated Mar 28, 2026

GoCD plugins to work with MLFlow as model repository in a CD flow

Java 31 4 Updated Nov 1, 2023

MLflow App Library

Python 80 36 Updated Dec 25, 2018

Intellij Jsonnet Plugin

Java 91 17 Updated Mar 9, 2024

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 26,610 5,864 Updated Jun 18, 2026

The "Command Line Interactive Controller for Kubernetes"

Rust 1,509 92 Updated Mar 27, 2026

Accelerating network inference over video

Python 437 120 Updated Mar 6, 2020
Next