- Netherlands
-
17:05
(UTC +01:00) - https://nl.linkedin.com/in/jeroensteggink
Lists (32)
Sort Name ascending (A-Z)
AI Agents
Annotations
Apache Pulsar
Apache Spark
Big Data
Big DataBusiness Rules
Crawler
Data Management
Data quality
Data Science
DevOps
GraphQL
Graphs
Kubernetes
LLM
Machine Learning
Messaging
Messaging / RPC
ML - Images
MLOps
Music
NLP
Natural Language ProcessingOpenShift
Programming
Reverse engineering
Search
Security
UI
Video
Vision Language Models
Web
Windows
- All languages
- ANTLR
- Bicep
- C
- C#
- C++
- CMake
- CSS
- ChucK
- Clojure
- Cuda
- Cypher
- Dart
- Dockerfile
- Elixir
- Erlang
- FreeMarker
- GDScript
- GLSL
- Gherkin
- Go
- Groovy
- HCL
- HTML
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Max
- Mojo
- Mustache
- Nim
- Nix
- OCaml
- Open Policy Agent
- PDDL
- PHP
- Perl
- PowerShell
- Prolog
- Python
- R
- Racket
- Rich Text Format
- Roff
- Ruby
- Rust
- Scala
- Scheme
- Shell
- Smarty
- Svelte
- TeX
- TypeScript
- Vue
- XSLT
- Zig
Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
💾 Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]
CMAK is a tool for managing Apache Kafka clusters
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
The leader in Customer Data Infrastructure
State of the Art Natural Language Processing
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
A Scala API for Apache Beam and Google Cloud Dataflow.
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Streaming MapReduce with Scalding and Storm
Feathr – A scalable, unified data and AI engineering platform for enterprise
Html Content / Article Extractor in Scala - open sourced from Gravity Labs
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
🤖 A bot that helps you keep your projects up-to-date
Free Elasticsearch security plugin and Kibana security plugin: super-easy Kibana multi-tenancy, Encryption, Authentication, Authorization, Auditing
Chronon is a data platform for serving for AI/ML applications.
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
The software used to extract structured data from Wikipedia
An open protocol for secure data sharing
Essential Spark extensions and helper methods ✨😲
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.
Data Lineage Tracking And Visualization Solution
Hybrid search engine, combining best features of text and semantic search worlds
Qubole Sparklens tool for performance tuning Apache Spark
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estima…