Skip to content
View camscottie's full-sized avatar

Block or report camscottie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

Scala 1,118 256 Updated Dec 6, 2025

Introduction to Machine Learning Systems

JavaScript 11,059 1,241 Updated Dec 21, 2025

Refine high-quality datasets and visual AI models

Python 10,165 693 Updated Dec 21, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 25,900 3,269 Updated Dec 21, 2025

Open source audio annotation tool for humans

TypeScript 1,124 138 Updated Dec 14, 2025

Audio Annotation Tool for ML development

TypeScript 79 15 Updated Dec 21, 2025

Transform your Obsidian into a powerful video note-taking tool. 🖇️🗂️⏯️

818 74 Updated Dec 2, 2025

Sample FastAPI application to showcase how to leverage Databricks services.

Python 6 2 Updated Jul 1, 2025

This repository implements a production-grade foundation for agentic governance that transforms policy and legislation into executable, accountable Decision Functions. It represents an open standar…

Python 6 Updated Nov 24, 2025

Make dbt great again! Extend dbt with plugins, local docs and custom adapters — fast, safe, and developer-friendly

Python 267 15 Updated Dec 10, 2025

A native Rust library for Delta Lake, with bindings into Python

Rust 3,077 558 Updated Dec 21, 2025

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 17,959 840 Updated Dec 21, 2025

A neovim plugin to run lines/blocs of code (independently of the rest of the file), supporting multiples languages

Rust 1,655 48 Updated Dec 9, 2025

🖼️ Bringing images to Neovim.

Lua 1,795 85 Updated Sep 7, 2025

A neovim plugin for interactively running code with the jupyter kernel. Fork of magma-nvim with improvements in image rendering, performance, and more

Python 1,053 60 Updated Nov 5, 2025

A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.

Lua 107 1 Updated Jun 29, 2024

Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.

Python 141 27 Updated Sep 2, 2025

QuackIR is an IR toolkit built on DuckDB

Python 13 1 Updated Nov 6, 2025

A real-time reddit data streaming pipeline for sentiment analysis of various subreddits

HCL 139 19 Updated Aug 23, 2023

Light-weight, browser-based ROLAP pivot tables on top of DuckDB-WASM

JavaScript 532 27 Updated Dec 9, 2025

Awesome Data Engineering

20 2 Updated Feb 3, 2025

Always know what to expect from your data.

Python 11,013 1,653 Updated Dec 19, 2025

A collection of handy CLI tools to convert CSV and JSON to Apache Arrow and Parquet

Rust 198 15 Updated Dec 1, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 52,391 5,608 Updated Dec 19, 2025

The universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with ease.

TypeScript 2,123 347 Updated Apr 15, 2025

Web Speech API

Bikeshed 176 38 Updated Aug 27, 2025

The Web MIDI API, developed by the W3C Audio WG

HTML 336 51 Updated Dec 9, 2025

The Web Audio API v1.0, developed by the W3C Audio WG

Bikeshed 1,097 172 Updated Dec 15, 2025
Next