mutimodal

Star

Here are 8 public repositories matching this topic...

video-db / videodb-node

Star

VideoDB Nodejs SDK

node database video rag mutimodal llm

Updated Jan 23, 2025
TypeScript

rekkles2 / Gaze-CIFAR-10

Star

Gaze-Guided Learning: Avoiding Shortcut Bias in Visual Classification

computer-vision image-classification gaze-tracking mutimodal

Updated Apr 15, 2025
Python

dwain-barnes / llama3.2-vision-ocr-streamlit

Star

"A private, local OCR solution using Meta's Llama 3.2 Vision model with a Streamlit interface. Processes images entirely offline, supporting formats like JPEG, PNG, and BMP.

open-source ocr streamlit mutimodal llm meta-ai ollama llama-3-2-vision local-ocr

Updated Nov 21, 2024
Python

Tommy-s-Online-Courses / Multimodality

Star

多模态系列课程资料

multimodality multimodal-learning mutimodal

Updated Jul 21, 2024
Rich Text Format

kingabzpro / Gemini-2-Pro-Chat

Star

Gemini 2 Pro app for Image, Audio, and Document understanding + Code Execution.

google gradio gemini-api mutimodal

Updated Feb 9, 2025
Python

anusha-chebolu / multimodal-rag

Star

A multimodal RAG application using Qwen 2.5 VL, ColPali, and QdrantDB for text and image-based retrieval.

rag mutimodal qdrant-vector-database colpali qwen2-vl

Updated Mar 20, 2025
Jupyter Notebook

johnnyhank / MIRA-Multimodal-Intelligent-Robotic-Assistant

Star

基于Qwen Agent框架，融合JAKA机械臂、视觉检测、语音识别与合成、MCP数据库的多模态大模型

mcp yolo orangepi vlm mutimodal llm edge-tts function-calling qwen qwen-vl-max qwen-agent

Updated May 26, 2025
Python

ashutoshkr45 / QD-RetNet

Star

QD-RetNet: Efficient Retinal Disease Classification via Quantized Knowledge Distillation [MIUA-2025]

knowledge-distillation quantization-aware-training retinal-disease-detection mutimodal

Updated May 26, 2025
Python

Improve this page

Add a description, image, and links to the mutimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mutimodal topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mutimodal

Here are 8 public repositories matching this topic...

video-db / videodb-node

rekkles2 / Gaze-CIFAR-10

dwain-barnes / llama3.2-vision-ocr-streamlit

Tommy-s-Online-Courses / Multimodality

kingabzpro / Gemini-2-Pro-Chat

anusha-chebolu / multimodal-rag

johnnyhank / MIRA-Multimodal-Intelligent-Robotic-Assistant

ashutoshkr45 / QD-RetNet

Improve this page

Add this topic to your repo

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!