Start
model-serving-api-builder
model-serving-api-builder - Skill Dossier
model-serving-api-builder

model-serving-api-builder

Deploy ML models as production APIs with vLLM, TGI, ONNX Runtime, batching, autoscaling, and GPU optimization. Activate on: model serving, deploy LLM, vLLM setup, inference API, GPU serving. NOT for: model training (ai-engineer), prompt engineering (prompt-engineer).

Uncategorized

Allowed Tools

ReadWriteEditBash(python:*pip:*npm:*npx:*)

Share this skill

Skills use the open SKILL.md standard — the same file works across all platforms.

Install all 544 skills as a plugin
claude plugin marketplace add curiositech/windags-skills claude plugin install windags-skills

Claude activates model-serving-api-builder automatically when your task matches its description.

View on GitHub
"Use model-serving-api-builder to help me build a feature system"
"I need expert help with deploy ml models as production apis with vllm, tgi..."