Stars
A fast, scalable, and intuitive Python package in sequence analysis.
Knwler is a lightweight, single-file Python tool that extracts structured knowledge graphs from documents using AI. Feed it a PDF or text file and receive a richly connected network of entities, re…
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
OctoTools: An agentic framework with extensible tools for complex reasoning
G6VP is an online visual analysis tool for graphs and a low-code platform for building graph applications.
Seamlessly integrate LLMs into scikit-learn.
mRMR (minimum-Redundancy-Maximum-Relevance) for automatic feature selection at scale.
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing…
TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.
Graph+Semantics: Import/Export RDF from Neo4j. SHACL Validation, Model mapping and more.... If you like it, please ★ ⇧
Artifacts intended to support the Ray Developer Community: SIGs, RFC overviews, and governance. We're very glad you're here! ✨
The Code Examples and Notebooks for The Practitioners Guide to Graph Data
General-purpose dimensionality reduction and manifold learning tool based on Variational Autoencoder, implemented in TensorFlow.
Inductive relation prediction by subgraph reasoning, ICML'20
🏅 KG Inductive Link Prediction Challenge (ILPC) 2022
"Inductive Entity Representations from Text via Link Prediction" @ The Web Conference 2021
A game theoretic approach to explain the output of any machine learning model.
Like cURL, but for gRPC: Command-line tool for interacting with gRPC servers
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
A Scala API for Apache Beam and Google Cloud Dataflow.
A framework for generating complex and realistic datasets for use in evaluating causal inference methods.
Natural language processing support for Pandas dataframes.
Literature references for “Designing Data-Intensive Applications”