Arhosseini77

Alireza Hosseini Arhosseini77

32 followers · 48 following

Achievements

Organizations

Lists (8)

Sort

AUDIO

Starred repositories

worldbench / awesome-ai-auto-research

🔥 A Survey on AI Auto-Research

HTML 369 30 Updated May 19, 2026

catswe / flash-attention-residuals

Triton kernels and PyTorch ops for Block Attention Residuals (AttnRes)

Python 82 6 Updated May 29, 2026

agentskills / agentskills

Specification and documentation for Agent Skills

Python 20,499 1,283 Updated May 20, 2026

simranjeet97 / Awsome_AI_Agents

This is end to end course on AI Agents and Agentic AI with 15+ AI Agent Projects with real time use cases and industry expertise.

Jupyter Notebook 184 74 Updated Apr 17, 2026

msu-video-group / NTIRE26_Saliency_Prediction

CVPR-NTIRE 2026 Challenge on Video Saliency Prediction

Python 16 Updated Mar 20, 2026

slopus / happy

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 21,922 1,831 Updated Jun 10, 2026

nextlevelbuilder / ui-ux-pro-max-skill

An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms

Python 91,721 9,578 Updated Apr 3, 2026

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

3,011 125 Updated Jun 12, 2026

apple / ml-sharp

Sharp Monocular View Synthesis in Less Than a Second

Python 8,537 619 Updated Dec 19, 2025

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,529 319 Updated May 26, 2026

ramintoosi / grep-rust-codecrafters

Regular expressions (or Regexes) are patterns used to match character combinations in strings. In this challenge, I learned to build a Regex engine from scratch by recreating grep, a CLI tool for r…

Rust 2 Updated Dec 22, 2025

AliRezaBeigy / google-meet-keybinding

Control Google Meet with customizable keyboard shortcuts. Toggle mic, camera, and navigate meetings from anywhere with global hotkeys.

JavaScript 2 Updated Nov 26, 2025

black-forest-labs / flux2

Official inference repo for FLUX.2 models

Python 2,401 169 Updated Mar 12, 2026

mbzuai-oryx / KITAB-Bench

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

Python 71 4 Updated May 24, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 23,288 2,151 Updated Jan 27, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 55,039 7,504 Updated May 5, 2026

K-Hooshanfar / shop_assistant_torob

Python 1 Updated Sep 26, 2025

HL-hanlin / Bifrost-1

Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)

Python 47 3 Updated Nov 24, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,379 1,787 Updated Jan 30, 2026

ruipeterpan / marconi

Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]

Python 61 7 Updated Mar 5, 2025

google-research / dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Python 728 83 Updated Jul 14, 2020

zhang9302002 / ThinkingWithVideos

The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"

Python 99 1 Updated Oct 15, 2025

OpenBMB / MiniCPM-V

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,629 2,007 Updated Jun 4, 2026

cjeen / LoRAEdit

Forked from tdrussell/diffusion-pipe

We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.

Python 334 23 Updated Jun 2, 2026

Alireza Hosseini Arhosseini77

Organizations

Lists (8)

AUDIO

NLP

Novel Idea

Saliency

Signals

TOOLS

Tuturials

VISION

Starred repositories

Deep learning