-
NVIDIA
- Netherlands
- @marcromeyn
Lists (32)
Sort Name ascending (A-Z)
Build
Computer Vision
Cookbooks
custom-trainer
Dagster
Data-infra
Docs
Finance
Information Retrieval
Jax
Large Language Models
Large scale ML
LLM Eval
LLM Rapids
LLM + Tabular
MCP
meshx
ML
ML Executor
ML-Infra
NeMo
NeMo Agent
PKM
Python
Pytorch
Recsys
RL
Rust
Scripts
Shell
Tensorflow
Vscode
Starred repositories
SQL databases in Python, designed for simplicity, compatibility, and robustness.
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
A Powerful Spider(Web Crawler) System in Python.
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
A curated list of awesome commands, files, and workflows for Claude Code
Train transformer language models with reinforcement learning.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)
PyGWalker: Turn your dataframe into an interactive UI for visual analysis
verl: Volcano Engine Reinforcement Learning for LLMs
FauxPilot - an open-source alternative to GitHub Copilot server
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Postgres CLI with autocompletion and syntax highlighting
the first library to let you embed a developer agent in your own app!
An open-source NLP research library, built on PyTorch.
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
Hackable and optimized Transformers building blocks, supporting a composable construction.
TensorFlow-based neural network library
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.