Stars
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
SoTA LLM for converting natural language questions to SQL queries
Build production-ready LLM applications and advanced agents using Python, LangChain, and LangGraph. This is the companion repository for the book on generative AI with LangChain.
Resource, examples & tutorials for multimodal AI, RAG and agents using vector search and LLMs
Finetuning InstructLLaMA with portuguese data
LLM Workshop by Sourab Mangrulkar
Open source library developed under python to estimate the 2D and 3D pose of people present on a video stream thourgh deep networks of convolutions.
Implemented a car plate recognition algorithm based on CTC loss