-
DFKI GmbH
- Bremen, Germany
- dfki.de/robotics
Stars
Robot Vulnerability Database. An archive of robot vulnerabilities and bugs.
nw_wrld is an event-driven sequencer for triggering visuals using web technologies. It enables users to scale up audiovisual compositions for prototyping, demos, exhibitions, and live performances.…
A generative world for general-purpose robotics & embodied AI learning.
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
Famous Vision Language Models and Their Architectures
Open-source and strong foundation image recognition models.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
Semantic Segmentation of Images and Point Clouds for Traversability Estimation
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)
Collection of AWESOME vision-language models for vision tasks
[IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Journal of Field Robotics (JFR) 2026:Survey Paper about Autonomous Ground Robot System in Unstructured Off-Road Environments
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Train transformer language models with reinforcement learning.
Official inference repo for FLUX.1 models
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
StyleGAN2-ADA - Official PyTorch implementation
Dockerfile for Velodyne VLP-16 and VLP-32 in ROS 2
Official PyTorch implementation of StyleGAN3
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Export blender camera animations to Deforum Diffusion notebook format.