- São Paulo
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Apache Superset is a Data Visualization and Data Exploration Platform
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Free and Open Source, Distributed, RESTful Search Engine
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP…
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Spark - A unified analytics engine for large-scale data processing
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🪄 Create rich visualizations with AI
Script em python para carregar os arquivos de cnpj com dados públicos da Receita Federal para o formato sqlite
A computer algebra system written in pure Python
An open protocol enabling communication and interoperability between opaque agentic applications.
Cartopy - a cartographic python library with matplotlib support
Kepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.
Backend do Brasil.IO (para código dos scripts de coleta de dados, veja o link na página de cada dataset)
A game theoretic approach to explain the output of any machine learning model.