- Toronto, Canada
-
04:43
(UTC -04:00) - thedataquarry.com
- @tech_optimist
- in/prrao87
-
-
cocoindex Public
Forked from cocoindex-io/cocoindexData transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
Rust Apache License 2.0 UpdatedMar 30, 2026 -
duckdb-web Public
Forked from duckdb/duckdb-web🐤 DuckDB website and documentation
HTML MIT License UpdatedMar 23, 2026 -
fine-grained-sentiment Public
A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.
-
lance Public
Forked from lance-format/lanceOpen Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
Rust Apache License 2.0 UpdatedMar 10, 2026 -
graph-benchmark Public
Graph benchmarks for Kuzu, Ladybug and lance-graph
-
graph-benchmark-ldbc Public
Graph benchmarks for Kuzu, Ladybug and lance-graph on LDBC SNB dataset
-
lancedb-study Public
Comparing LanceDB and Elasticsearch for full-text search and vector search performance
-
hub-docs Public
Forked from huggingface/hub-docsDocs of the Hugging Face Hub
Handlebars Apache License 2.0 UpdatedFeb 2, 2026 -
DSRs Public
Forked from krypticmouse/DSRsA DSPy rewrite to(not port) Rust
-
-
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
Python Apache License 2.0 UpdatedJan 22, 2026 -
huggingface.js Public
Forked from huggingface/huggingface.jsUse Hugging Face with JavaScript
TypeScript MIT License UpdatedJan 17, 2026 -
-
dspy Public
Forked from stanfordnlp/dspyDSPy: The framework for programming—not prompting—language models
-
ray Public
Forked from ray-project/rayRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedDec 3, 2025 -
lance-namespace Public
Forked from lance-format/lance-namespaceLance Namespace is an open specification on top of the storage-based Lance table and file format to standardize access to a collection of Lance tables
Java Apache License 2.0 UpdatedNov 25, 2025 -
-
-
dspy-graph-rag Public
Experiments with structured outputs and Graph RAG in DSPy
-
ossinsight Public
Forked from pingcap/ossinsightAnalysis, Comparison, Trends, Rankings of Open Source Software, you can also get insight from more than 7 billion with natural language (powered by OpenAI). Follow us on Twitter: https://twitter.co…
TypeScript Apache License 2.0 UpdatedAug 8, 2025 -
llama_index Public
Forked from run-llama/llama_indexLlamaIndex is a data framework for your LLM applications
Python MIT License UpdatedAug 7, 2025 -
unitycatalog Public
Forked from unitycatalog/unitycatalogOpen, Multi-modal Catalog for Data & AI
Python Apache License 2.0 UpdatedMay 13, 2025 -
yfiles-jupyter-graphs-for-kuzu Public
Forked from yWorks/yfiles-jupyter-graphs-for-kuzuThe open-source adapter for working with Kuzu databases and cypher queries in jupyter notebooks leveraging the yFiles Graphs for Jupyter plugin.
Python MIT License UpdatedApr 11, 2025 -
learn Public
Forked from marimo-team/learn📚 A curated collection of marimo notebooks for education.
-
kuzudb-study Public
Benchmark study on Kuzu, an embedded graph database, on an artificial social network dataset
-
awesome-wasm Public
Forked from mbasso/awesome-wasm😎 Curated list of awesome things regarding the WebAssembly (wasm) ecosystem.
-
build-with-baml Public
Example projects using BAML, a DSL that helps generate structured outputs from LLMs
-
baml Public
Forked from BoundaryML/bamlBAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground
TypeScript Apache License 2.0 UpdatedMar 8, 2025 -
pdf2image Public
Forked from BoundaryML/testRepo to test PDF2Image conversion for use in BAML
Python UpdatedMar 8, 2025