TheDenk

Denis TheDenk

#CV #ML #DL

37 followers · 3 following

Achievements

x2 x2

Achievements

x2 x2

Stars

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,065 140 Updated Dec 18, 2025

microsoft / fara

Fara-7B: An Efficient Agentic Model for Computer Use

Python 3,307 289 Updated Dec 15, 2025

stepfun-ai / gelab-zero

GELab: GUI Exploration Lab. One of the best GUI agent solutions in the galaxy, built by the StepFun-GELab team and powered by Step’s research capabilities.

Python 1,657 136 Updated Dec 19, 2025

huggingface / smol2operator

Python 126 18 Updated Sep 23, 2025

huggingface / screenenv

A powerful Python library for creating and managing isolated desktop environments using Docker containers.

Python 441 42 Updated Sep 8, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,789 575 Updated Mar 23, 2025

mherrmann / helium

Lighter web automation with Python

Python 8,183 511 Updated Nov 10, 2025

imputnet / helium

Private, fast, and honest web browser

C++ 8,808 175 Updated Dec 20, 2025

Tencent-Hunyuan / HunyuanImage-3.0

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,600 123 Updated Oct 31, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,414 230 Updated Nov 12, 2025

nv-tlabs / ChronoEdit

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 628 37 Updated Nov 20, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 4,009 289 Updated May 19, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,367 52 Updated Nov 28, 2025

zli12321 / Vision-Language-Models-Overview

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

478 26 Updated Dec 15, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,673 2,865 Updated Dec 21, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,569 595 Updated Dec 22, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,409 462 Updated Dec 18, 2025

meituan-longcat / LongCat-Video

Python 1,590 208 Updated Dec 20, 2025

Genymobile / scrcpy

Display and control your Android device

C 132,853 12,407 Updated Dec 20, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,895 12,110 Updated Dec 22, 2025

dottxt-ai / outlines

Structured Outputs

Python 13,142 658 Updated Dec 12, 2025

bytedance / USO

🔥🔥 Open-sourced unified customization model

Python 1,196 73 Updated Sep 12, 2025

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 18,824 2,081 Updated Dec 17, 2025

ashishps1 / awesome-behavioral-interviews

Tips and resources to prepare for Behavioral interviews.

7,472 1,461 Updated Aug 19, 2025

snubroot / Veo-3-Prompting-Guide

164 60 Updated Jul 24, 2025

microsoft / ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 47,369 16,274 Updated Dec 21, 2025

lucidrains / phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Python 791 82 Updated Jul 29, 2024

gojasper / LBM

LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨ (ICCV 2025 Highlight)

Python 801 49 Updated Jul 24, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,881 655 Updated Nov 20, 2025

ModelTC / Wan2.2-Lightning

Forked from Wan-Video/Wan2.2

Wan2.2-Lightning: Speed up wan2.2 model with distillation

Python 246 16 Updated Nov 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Denis TheDenk

Achievements

Achievements

Block or report TheDenk

Stars

zai-org / GLM-V

microsoft / fara

stepfun-ai / gelab-zero

huggingface / smol2operator

huggingface / screenenv

openvla / openvla

mherrmann / helium

imputnet / helium

Tencent-Hunyuan / HunyuanImage-3.0

ML-GSAI / LLaDA

nv-tlabs / ChronoEdit

StarsfieldAI / R1-V

baaivision / Emu3.5

zli12321 / Vision-Language-Models-Overview

volcengine / verl

open-compass / VLMEvalKit

EvolvingLMMs-Lab / lmms-eval

meituan-longcat / LongCat-Video

Genymobile / scrcpy

vllm-project / vllm

dottxt-ai / outlines

bytedance / USO

microsoft / VibeVoice

ashishps1 / awesome-behavioral-interviews

snubroot / Veo-3-Prompting-Guide

microsoft / ai-agents-for-beginners

lucidrains / phenaki-pytorch

gojasper / LBM

facebookresearch / dinov3

ModelTC / Wan2.2-Lightning