mutimodal

Star

Here are 9 public repositories matching this topic...

video-db / videodb-node

Star

VideoDB Nodejs SDK

node database video rag mutimodal llm

Updated Jan 23, 2025
TypeScript

johnnyhank / MIRA-Multimodal-Intelligent-Robotic-Assistant

Star

基于Qwen Agent框架，融合JAKA机械臂、视觉检测、语音识别与合成、MCP数据库的多模态大模型

mcp yolo orangepi vlm mutimodal llm edge-tts function-calling qwen qwen-vl-max qwen-agent

Updated May 26, 2025
Python

明康慧醫(MKTY)——基於LLM與多模態人工智能的健康管理與輔助診療系統設計與實現。（明康慧醫智慧醫療系統）該項目已用於齊魯工業大學（山東省科學院）計算機學部2025年畢業設計。項目作者：杜宇 @duyu09，電子郵箱：qluduyu09@163.com [Source code of Design and Implementation of MINH KHỎE TUỆ Y - A Health Management and Assisted Diagnosis System Based on LLM and Multimodal Artificial Intelligence. (Minh Khoe Tue Y Smart Healthcare System)]

mysql python emr bootstrap flask distributed-systems time-series rabbitmq pytorch medical clip graduation-project vue3 element-plus mutimodal llm ai-medical mkty duyu09

Updated Aug 19, 2025
Vue

rekkles2 / Gaze-CIFAR-10

Star

Gaze-Guided Learning: Avoiding Shortcut Bias in Visual Classification

computer-vision deep-learning vr dataset eye-tracking image-classification eyes htc-vive papers gaze-tracking gaze eyetracking mutimodal

Updated Apr 15, 2025
Python

dwain-barnes / llama3.2-vision-ocr-streamlit

Star

"A private, local OCR solution using Meta's Llama 3.2 Vision model with a Streamlit interface. Processes images entirely offline, supporting formats like JPEG, PNG, and BMP.

open-source ocr streamlit mutimodal llm meta-ai ollama llama-3-2-vision local-ocr