faisalshahbaz

Follow

💭

Anything you can do, AI can do better.

Faisal Shahbaz faisalshahbaz

💭

Anything you can do, AI can do better.

Follow

Machine Learning Engineer | LLM | VLM | Google Certified Tensorflow Developer

39 followers · 28 following

Karachi, Pakistan
in/faisalshahbaz
https://stackoverflow.com/users/4668751/faisal-shahbaz
https://medium.com/@faisalshahbaz

Achievements

Achievements

Stars

NVIDIA / personaplex

PersonaPlex code.

Python 4,722 683 Updated Jan 24, 2026

ekwek1 / soprano

Soprano: Instant, Ultra-Realistic Text-to-Speech

Python 1,161 107 Updated Jan 15, 2026

SpatialVision / Orient-Anything-V2

Orient Anything V2, NeurIPS 2025 Spotlight

Python 196 9 Updated Jan 19, 2026

langwatch / better-agents

Standards for building agents, better

TypeScript 1,462 155 Updated Jan 13, 2026

NJU-3DV / SpatialVID

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 494 16 Updated Feb 4, 2026

byjlw / video-analyzer

Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition

Python 1,264 178 Updated Apr 23, 2025

bigai-nlco / VideoLLaMB

[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 83 2 Updated Feb 27, 2025

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.

3,063 138 Updated Dec 20, 2025

saidwivedi / InteractVLM

[CVPR 2025] InteractVLM: 3D Interaction Reasoning from 2D Foundational Models

Python 123 8 Updated Dec 12, 2025

YunghuiHsu / deepstream-yolo-pose

Use Deepstream python API to extract the model output tensor and customize the post-processing of YOLO-Pose

Python 65 14 Updated Sep 1, 2023

bharath5673 / Deepstream

yolov3, yolo12, dino, segmenations, face, pose, keypoints on deepstream

Jupyter Notebook 116 27 Updated Dec 7, 2025

zhouyuchong / yolov5-deepstream-python

yolov5-deepstream-python

Python 10 3 Updated May 6, 2022

u5e5t / yolov8-onnx-deepstream-python

yolov8的车辆检测模型deepstream-python部署

Python 19 2 Updated Jun 30, 2023

elvislos / deepstream-yolov8-python-app

Python 1 Updated Jun 9, 2025

triple-mu / YOLOv8-TensorRT

YOLOv8 using TensorRT accelerate !

C++ 1,735 293 Updated Apr 30, 2025

levipereira / deepstream-yolov9

Implementation of Nvidia DeepStream 7 with YOLOv9 Models.

Python 15 2 Updated Jun 22, 2024

levipereira / deepstream-yolo-e2e

Implementation of End-to-End YOLO Models for DeepStream

Python 72 9 Updated Dec 29, 2025

neural-maze / ava-whatsapp-agent-course

Meet Ava, the WhatsApp Agent

Python 1,626 416 Updated Oct 20, 2025

aishwaryanr / awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

HTML 24,413 5,233 Updated Feb 5, 2026

samwit / smolagents_examples

This is a repo for a number of examples using the smolagents framework from Hugging Face.

Python 164 40 Updated Jan 8, 2025

microsoft / ai-agents-for-beginners

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 50,058 17,508 Updated Feb 2, 2026

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,713 311 Updated May 21, 2025

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 54,221 5,835 Updated Feb 4, 2026

facebookresearch / wmar

Official implementation of the paper "Watermarking Autoregressive Image Generation" (NeurIPS'25)

Jupyter Notebook 56 8 Updated Sep 19, 2025

kmAyush / Football-Video-Analyser

Computer vision project to predict football game detail from a single camera video clip

Jupyter Notebook 8 1 Updated Jul 26, 2024

anthropics / prompt-eng-interactive-tutorial

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 29,635 2,909 Updated Jul 11, 2024

AgentDeskAI / browser-tools-mcp

Monitor browser logs directly from Cursor and other MCP compatible IDEs.

JavaScript 7,049 516 Updated Mar 26, 2025

llmgenai / LLMInterviewQuestions

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,652 359 Updated Feb 12, 2025

soub4i / kdebug-mcp

KDebug is a Kubernetes debugging tool that allows you to interact with your Kubernetes clusters through LLMs. It uses the Model Control Protocol (MCP) to enable AI to execute Kubernetes commands on…

Go 16 Updated Apr 27, 2025

transformerlab / transformerlab-app

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

Python 4,779 500 Updated Feb 5, 2026