wolfworld6

wolfworld6

7 followers · 14 following

codex_demo Public

exmples

Updated Feb 9, 2026
unsloth Public
Forked from unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python Apache License 2.0 Updated Dec 23, 2025
SAM3_LoRA Public
Forked from Sompote/SAM3_LoRA

Finetune SAM3 with LoRA — optimized for images. A simple setup for training SAM3 on image datasets. Video finetuning is not yet supported but planned for future releases.

Python Updated Dec 18, 2025
DINOV3-YOLOV12 Public
Forked from Sompote/DINOV3-YOLOV12

Use DINOv3’s powerful, self-supervised visual features + YOLOv12’s blazing-fast detection, all in one repo. Whether you have only a few hundred labeled images or a medium-sized dataset, DINOV3-YOLO…

Python GNU Affero General Public License v3.0 Updated Nov 27, 2025
textvqa_grounding_task_qwen2.5-vl-ft Public
Forked from 828Tina/textvqa_grounding_task_qwen2.5-vl-ft

Jupyter Notebook Updated May 20, 2025
VL-Rethinker Public
Forked from TIGER-AI-Lab/VL-Rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"

Python Apache License 2.0 Updated Apr 29, 2025
deepseek-r1-vision Public
Forked from sungatetop/deepseek-r1-vision

an method to make vlm think like r1

Python Updated Feb 20, 2025
Video-RAG-master Public
Forked from Leon1207/Video-RAG-master

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python Updated Jan 15, 2025
qwen2vl_data_processingv3 Public
Forked from zew013/qwen2vl_data_processingv3

Jupyter Notebook Updated Dec 15, 2024
Awaker Public
Forked from MetabrainAGI/Awaker2.5-VL

Python Apache License 2.0 Updated Nov 24, 2024
LLaVA-o1 Public
Forked from PKU-YuanGroup/LLaVA-CoT

Apache License 2.0 Updated Nov 19, 2024
VisRAG Public
Forked from OpenBMB/VisRAG

Parsing-free RAG supported by VLMs

Python Apache License 2.0 Updated Nov 4, 2024
Qwen2-vl-sft Public
Forked from digbangbang/Qwen2-vl-sft

This repository contains a project I completed during my internship at meituan. Specifically, it performs SFT on Qwen2-vl, uses internal company data, and fine-tunes Qwen2-vl for downstream tasks (…

Python Apache License 2.0 Updated Sep 20, 2024
PrimeVul Public
Forked from DLVulDet/PrimeVul

Repository for PrimeVul Vulnerability Detection Dataset

Python MIT License Updated Sep 7, 2024
Vista Public
Forked from OpenDriveLab/Vista

A Generalizable World Model for Autonomous Driving

Python Apache License 2.0 Updated Sep 4, 2024
AL-Ref-SAM2 Public
Forked from appletea233/AL-Ref-SAM2

AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Python MIT License Updated Sep 4, 2024
dify-with-qwen-vl Public
Forked from soulteary/dify-with-qwen-vl

视频理解：千问视频多模态模型 & Dify

Python Apache License 2.0 Updated Sep 2, 2024
llm2vec Public
Forked from McGill-NLP/llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python MIT License Updated Aug 30, 2024
AIcity2024-track3 Public

Python 5 2 Updated May 11, 2024
Multi-LLM-Agent Public
Forked from X-PLUG/Multi-LLM-Agent

Python Updated Apr 23, 2024
snag_release Public
Forked from fmu2/snag_release

Official Implementation of SnAG (CVPR 2024)

Python Updated Apr 22, 2024
keras-llm-robot Public
Forked from smalltong02/keras-llm-robot

A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.

Python Apache License 2.0 Updated Jan 23, 2024
llama Public
Forked from meta-llama/llama

Inference code for LLaMA models

Python 1 Other Updated Jan 4, 2024
ego4d_asl Public
Forked from JonnyS1226/ego4d_asl

code for Ego4D Workshop@CVPR 2023 - 1st in MQ & 2nd in NLQ challenge

Python Updated Dec 19, 2023
NExT-Chat Public
Forked from NExT-ChatV/NExT-Chat

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Python Apache License 2.0 Updated Dec 19, 2023
MIC Public
Forked from HaozheZhao/MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python Updated Dec 18, 2023
AdaTAD Public
Forked from sming256/AdaTAD

The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames

1 Updated Dec 9, 2023
dot Public
Forked from 16lemoing/dot

Updated Dec 8, 2023
ONE-PEACE Public
Forked from OFA-Sys/ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python Apache License 2.0 Updated Dec 5, 2023
Video-LLaVA Public
Forked from PKU-YuanGroup/Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python Apache License 2.0 Updated Nov 26, 2023

wolfworld6

codex_demo Public

Uh oh!

unsloth Public

Uh oh!

SAM3_LoRA Public

Uh oh!

DINOV3-YOLOV12 Public

Uh oh!

textvqa_grounding_task_qwen2.5-vl-ft Public

Uh oh!

VL-Rethinker Public

Uh oh!

deepseek-r1-vision Public

Uh oh!

Video-RAG-master Public

Uh oh!

qwen2vl_data_processingv3 Public

Uh oh!

Awaker Public

Uh oh!

LLaVA-o1 Public

Uh oh!

VisRAG Public

Uh oh!

Qwen2-vl-sft Public

Uh oh!

PrimeVul Public

Uh oh!

Vista Public

Uh oh!

AL-Ref-SAM2 Public

Uh oh!

dify-with-qwen-vl Public

Uh oh!

llm2vec Public

Uh oh!

AIcity2024-track3 Public

Uh oh!

Multi-LLM-Agent Public

Uh oh!

snag_release Public

Uh oh!

keras-llm-robot Public

Uh oh!

llama Public

Uh oh!

ego4d_asl Public

Uh oh!

NExT-Chat Public

Uh oh!

MIC Public

Uh oh!

AdaTAD Public

Uh oh!

dot Public

Uh oh!

ONE-PEACE Public

Uh oh!

Video-LLaVA Public

Uh oh!