Skip to content
View MiguelPeralvo's full-sized avatar

Highlights

  • Pro

Block or report MiguelPeralvo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
462 results for source starred repositories
Clear filter

DSPy: The framework for programming—not prompting—language models

Python 29,864 2,381 Updated Nov 6, 2025

R Toolkit for Databricks

R 72 15 Updated Sep 19, 2025

Accelerates migrations to Databricks by automating key migration activities

Python 114 78 Updated Nov 7, 2025
Python 58 17 Updated Sep 18, 2025

Databricks framework to validate Data Quality of pySpark DataFrames

Python 331 69 Updated Nov 7, 2025

Python scraper based on AI

Python 21,724 1,880 Updated Nov 6, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,542 46,109 Updated Nov 7, 2025

🚀 The open-source, multi-tenant, self-building knowledge graph

Rust 1,363 104 Updated Nov 7, 2025

This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.

Python 615 230 Updated Sep 10, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,458 732 Updated Jun 7, 2025

Democratizing Internet-scale financial data.

Jupyter Notebook 1,271 236 Updated Jul 1, 2024

Facilitates simple large scale processing of HLS Medical images, documents, zip files. OHIF Viewer, 2 segmentation models and interactive learning.

JavaScript 44 28 Updated Nov 6, 2025

A Swiss-Army-knife for your Data Intelligence platform administration.

Python 129 17 Updated Apr 30, 2025

spark-based library that helps construct and query knowledge graphs from unstructured and structured data

Scala 98 10 Updated Sep 2, 2023

Demos to implement your Databricks Lakehouse

HTML 379 143 Updated Nov 5, 2025

An open source ML system for the end-to-end data science lifecycle

Java 1,068 501 Updated Nov 1, 2025

An excerpt from our financial valuation model of Tesla

1,185 148 Updated Jun 12, 2024

🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.

Python 453 127 Updated Apr 22, 2025

bamboolib - a GUI for pandas DataFrames

Jupyter Notebook 951 94 Updated Feb 20, 2024

Databricks Terraform Provider

Go 555 469 Updated Nov 7, 2025

Geospatial clustering at massive scale

Scala 105 19 Updated Jul 11, 2024

Capture deep metrics on one or all assets within a Databricks workspace

Scala 231 69 Updated Nov 20, 2024

Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline

Python 153 127 Updated Aug 14, 2024

Complete end to end sample of doing DevOps with Azure Databricks

Shell 69 109 Updated Feb 2, 2022

An experimental tool to synchronize source Databricks deployment with a target Databricks deployment.

Python 47 14 Updated Jan 21, 2024

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Jupyter Notebook 332 58 Updated Oct 18, 2025

An open-source toolkit for large-scale genomic analysis

Scala 291 117 Updated Nov 2, 2025

A native Rust library for Delta Lake, with bindings into Python

Rust 3,021 542 Updated Nov 5, 2025
Next