Start
multimodal-embedding-generator
multimodal-embedding-generator - Skill Dossier
multimodal-embedding-generator

multimodal-embedding-generator

Generate cross-modal embeddings with CLIP, SigLIP, and ImageBind for text-image-audio search. Activate on: multimodal search, text-to-image search, cross-modal embeddings, CLIP embeddings, visual search. NOT for: text-only embeddings (ai-engineer), image classification (computer-vision-pipeline).

Uncategorized

Allowed Tools

ReadWriteEditBash(python:*pip:*npm:*npx:*)

Share this skill

Skills use the open SKILL.md standard — the same file works across all platforms.

Install all 544 skills as a plugin
claude plugin marketplace add curiositech/windags-skills claude plugin install windags-skills

Claude activates multimodal-embedding-generator automatically when your task matches its description.

View on GitHub
"Use multimodal-embedding-generator to help me build a feature system"
"I need expert help with generate cross-modal embeddings with clip, siglip,..."