⚡ GMI GPU Cost Optimizer Agent
Agentic AI with autonomous tool use
Kimi K2.5 → Tools → Reasoning → Recommendation
Deploy a Llama 70B model for a chatbot, 50 QPS, $15k/month budget
Compare serverless vs dedicated for a 7B model with 5 QPS, 8 hours/day
Cheapest way to serve a 405B research model, 8 hours a day?
Plan scaling from 10 QPS to 200 QPS for a 34B code assistant
Send ⚡