Skip to content
View oztrkoguz's full-sized avatar

Block or report oztrkoguz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VoiceHub: A Unified Inference Interface for TTS Models

Python 55 2 Updated Oct 6, 2025

An AI-powered tool for summarizing YouTube videos by generating scene descriptions, translating them, and creating subtitled videos with text-to-speech narration

Python 40 2 Updated Aug 2, 2025

My ComfyUI workflows collection

15 Updated Jan 18, 2025

A course on aligning smol models.

Jupyter Notebook 1 Updated Dec 11, 2024

A course on aligning smol models.

Jupyter Notebook 6,433 2,285 Updated Oct 1, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 18,677 1,266 Updated Oct 8, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,827 1,338 Updated Oct 9, 2025

Custom nodes for using fal API.

Python 157 30 Updated Oct 3, 2025

"MGPT-Langchain-ChatBot-Multi-Functionality-Ollama" is a versatile chatbot framework that uses LangChain to build a context-aware chatbot in Python. This project integrates with Ollama, enabling mu…

Python 4 Updated Sep 5, 2024

This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.

Python 14 1 Updated Jul 28, 2024

Agentic components of the Llama Stack APIs

4,276 634 Updated Aug 5, 2025

Image Upscaler with Tile Controlnet Fully Integrated in Huggingface Diffusers

Python 17 3 Updated Jul 17, 2024

Dream Interpreter inside ComfyUI

JavaScript 81 13 Updated Jul 31, 2024

Famous Vision Language Models and Their Architectures

Markdown 1,034 53 Updated Feb 24, 2025

It automatically describes images in PDF files and generates questions from these descriptions. With its advanced RAG structure, it directs these questions directly to PDF text content, providing c…

Python 12 Updated Jun 29, 2024

Agent Framework For Fintech

Python 7,658 712 Updated Oct 9, 2025

This project is an automated research and summarization tool that allows users to conduct research on a specific question and summarize the information found and present it as a blog post.

Python 12 1 Updated Jun 3, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 90,522 10,126 Updated Oct 9, 2025

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 2 Updated May 14, 2024

This project offers a user-friendly interface that allows users to easily create stories and enrich them with visuals. It supports creativity with story creation and visualisation features.

Python 31 3 Updated Apr 7, 2025

PuLID native implementation for ComfyUI

Python 892 61 Updated Apr 14, 2025

Image identification with Kosmos2 model, drawing and cutting bbox with object detection

Python 16 1 Updated Jul 25, 2024

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

Python 531 53 Updated Feb 13, 2025

🙌 OpenHands: Code Less, Make More

Python 64,052 7,748 Updated Oct 9, 2025

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5,005 621 Updated Jul 2, 2024