- Eskisehir, Türkiye
- https://dolphinium.github.io/portfolio
Stars
Get your documents ready for gen AI
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
The ultimate training toolkit for finetuning diffusion models
Machine Learning Resources, Practice and Research
Real time transcription with OpenAI Whisper.
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)
Turkish BERT/DistilBERT, ELECTRA, ConvBERT and T5 models
The Maxar Open Data STAC Catalog in CSV, GeoJSON, and MosaicJSON formats
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data warehouse. The pipeline leverages a combination of tools and serv…
The 1st VHR SAR-Optical benckmark dataset for detecting earthquake damaged buildings.
Turkish text normalization tools for ASR (Automatic Speech Recognition) benchmarking and evaluation.
An open-source Python toolkit designed to streamline the development and enhancement of ASR systems.
Sampling-Balance based Multi-stage Network (SB-MSN) for aerial image object detection