- São Paulo
Stars
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Free and Open Source, Distributed, RESTful Search Engine
Apache Superset is a Data Visualization and Data Exploration Platform
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Python Data Science Handbook: full text in Jupyter Notebooks
LlamaIndex is the leading framework for building LLM-powered agents over your data.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Spark - A unified analytics engine for large-scale data processing
A command line tool and library for transferring data with URL syntax, supporting DICT, FILE, FTP, FTPS, GOPHER, GOPHERS, HTTP, HTTPS, IMAP, IMAPS, LDAP, LDAPS, MQTT, POP3, POP3S, RTMP, RTMPS, RTSP…
Google Research
A game theoretic approach to explain the output of any machine learning model.
SQL powered operating system instrumentation, monitoring, and analytics.
An open protocol enabling communication and interoperability between opaque agentic applications.
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
🪄 Create rich visualizations with AI
A computer algebra system written in pure Python
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Kepler.gl is a powerful open source geospatial analysis tool for large-scale data sets.
Vamos transformar o Brasil em uma API?
Fully local web research and report writing assistant