Challenges in building Agentic AI Apps
•Dependency on hosted models
•Quotas based on service tiers
•Unpredictable costs
•Additional setup / infrastructure to build and test Agentic
apps
AI Grocery Shopper
Meal Planner
Agent
Inventory
Agent
Budget
Agent
Shopper
Agent
Generate meal
plans and
shopping list
based on
preferences and
dietary
restrictions
Compare
inventory items
and build revise
shopping list
Optimize items
based on budget
Optimize items
based on
categories
/demo
Docker Model Runner
Docker Model Runner Architecture
Docker Model Runner Design
Docker Model Runner Endpoints
•From Containers -http://model-runner.docker.internal/
•From host processes - http://localhost:12434/
•Create model - POST /models/create
•List models - GET /models
•Get a model - GET /models/{namespace}/{name}
•Delete local model - DELETE /models/{namespace}/{name}
OpenAIEndpoints
•List Models - GET /engines/llama.cpp/v1/models
•Retrieve models - GET /engines/llama.cpp/v1/models/{namespace}/{name}
•List chat completions - POST /engines/llama.cpp/v1/chat/completions
•Create completions - POST /engines/llama.cpp/v1/completions
•Createembeddings-POST /engines/llama.cpp/v1/embeddings
Summary -Docker Model Runner
Docker Model Runner General Availability | LinkedIn
Resources
•Docker Docks - Model Runner
•Docker Docs - Use AI models in Compose
•Docker Model Runner Design and What’s Next
•Docker Model Runner General Availability Annoncement
Nilesh Gule
ARCHITECT | MICROSOFT MVP
“Code with Passion and
Strive for Excellence”
nileshgule
@nileshgule Nilesh Gule
NileshGule
www.handsonarchitect.com
https://www.youtube.com/@nilesh-gule