-
NVIDIA
- Netherlands
- @marcromeyn
Lists (32)
Sort Name ascending (A-Z)
Build
Computer Vision
Cookbooks
custom-trainer
Dagster
Data-infra
Docs
Finance
Information Retrieval
Jax
Large Language Models
Large scale ML
LLM Eval
LLM Rapids
LLM + Tabular
MCP
meshx
ML
ML Executor
ML-Infra
NeMo
NeMo Agent
PKM
Python
Pytorch
Recsys
RL
Rust
Scripts
Shell
Tensorflow
Vscode
Starred repositories
A library for efficient similarity search and clustering of dense vectors.
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Fast inference engine for Transformer models
Learning embeddings for classification, retrieval and ranking.
A composable and fully extensible C++ execution engine library for data management systems.
Portfolio Optimization and Quantitative Strategic Asset Allocation in Python
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI …
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
Collective communications library with various primitives for multi-machine training.
ThunderGBM: Fast GBDTs and Random Forests on GPUs
A multi-model machine learning feature embedding database
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …
Minimalist code necessary for using porting C++ functions/classes using OpenCV's "Mat" type in functions argument lists directly (w/o explicit conversions) to python.