Skip to content
View selfishout's full-sized avatar

Highlights

  • Pro

Block or report selfishout

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
selfishout/README.md

Hi, I'm Ali Torabi πŸ‘‹

πŸ”¬ Computer Vision & Deep Learning Engineer

Building production-ready Computer Vision and Vision-Language Models using PyTorch and state-of-the-art transformers.

πŸ’‘ Passionate about bridging the gap between research and real-world applications


πŸš€ Featured Projects

Vision Transformer Classification

Complete ViT implementation from scratch with attention visualization

CLIP Image Search Engine

Semantic search with natural language queries using OpenAI CLIP

Image Captioning with VLMs

Automatic captioning with BLIP, BLIP-2, and GIT models

YOLO Object Detection

Real-time detection with YOLOv8 for 80+ object classes


πŸ’» Tech Stack

Languages & Frameworks

Python PyTorch TensorFlow OpenCV

Specialized Libraries

Hugging Face CLIP YOLO Gradio

Tools & Platforms

Git Docker Jupyter VS Code


🎯 Expertise


Vision Transformers
ViT, CLIP, TrOCR, Swin

Vision-Language
BLIP, BLIP-2, GIT, LLaVA

Object Detection
YOLO, Mask R-CNN, Detectron2

Segmentation
DeepLab, U-Net, SAM

Face Recognition
FaceNet, ArcFace, MTCNN

OCR
TrOCR, EasyOCR, Tesseract

Image Enhancement
Super-Resolution, Style Transfer

Zero-Shot Learning
CLIP, Open-vocabulary models

πŸ“Š GitHub Statistics

GitHub Stats GitHub Streak

Top Languages

Trophies


πŸ† Project Highlights

  • 🎯 12 Production-Ready Projects in Computer Vision
  • πŸ“ 6,400+ Lines of well-documented code
  • 🌟 State-of-the-Art implementations
  • πŸ”§ Modern Architectures: Transformers, CNNs, Vision-Language Models
  • πŸ“š Comprehensive Documentation with examples and demos
  • 🎨 Interactive Web Interfaces using Gradio
  • ⚑ GPU-Accelerated implementations
  • πŸ§ͺ Research-to-Production pipeline

πŸ“ˆ Current Focus

  • πŸ”¬ Exploring multi-modal foundation models
  • πŸš€ Optimizing inference speed for production deployment
  • πŸ“– Studying latest research in Vision Transformers
  • 🀝 Contributing to open-source CV projects
  • πŸ“ Writing technical blog posts on Medium
  • πŸŽ“ Preparing video tutorials on YouTube

🌐 Connect With Me


πŸ“š Recent Activity


πŸ’‘ Fun Facts

  • 🎯 Published 12 CV projects in 1 day
  • πŸ”₯ Specializing in Vision Transformers and Multi-modal AI
  • πŸ“– Always learning the latest research papers
  • 🎨 Love building interactive demos for models
  • 🌍 Open to collaboration on CV projects

Profile Views

⭐️ From selfishout - Building the future of Computer Vision, one commit at a time

Popular repositories Loading

  1. xai-cv-benchmark xai-cv-benchmark Public

    Python 1

  2. XAIMethods XAIMethods Public

    Python 1

  3. agricultural-dataset-combination agricultural-dataset-combination Public

    A comprehensive project for combining multiple agricultural datasets into a unified format suitable for Weakly Supervised Semantic Segmentation (WSSS) applications

    Python 1

  4. vision-transformer-classification vision-transformer-classification Public

    Complete PyTorch implementation of Vision Transformer for image classification

    Python 1

  5. clip-image-search clip-image-search Public

    Semantic image search engine using OpenAI CLIP

    Python 1

  6. image-captioning-vlm image-captioning-vlm Public

    Automatic image caption generation with BLIP and BLIP-2

    Python 1