Macha

How to Pick the Right AI Model for Your Agent on Macha

Macha Team

Written by

Macha Team

Last edited June 12, 2026

Zendesk Expert Reviewed

Verified

Not every workflow needs the same AI model — and picking the right one per agent is the single biggest lever you have over both quality and cost. A simple triage agent and a complex multi-step resolver have very different needs, and Macha lets you choose a different model for each. Here's how to choose well.

How to Pick the Right AI Model for Your Agent on Macha

Watch the 60-second guide

Where you set the model

The model is a per-agent setting. Open an agent and pick its model from the top-right of the configuration screen. Because it's per agent, you can run your high-volume triage agent on a cheap, fast model and reserve a stronger model for the one agent that genuinely needs deep reasoning — instead of paying premium rates across the board.

Set each agent’s model in its configuration — it shows the model and its per-message credit cost (here, GPT-5 at 3 credits / message).
Set each agent’s model in its configuration — it shows the model and its per-message credit cost (here, GPT-5 at 3 credits / message).

The three things that vary between models

When you compare models, you're really trading off three things:

  • Quality — how well it handles nuance, long instructions, and tricky reasoning.
  • Speed — how fast it responds (matters for live chat and high volume).
  • Cost — its credits-per-response rate (0.5 to 9, depending on the model).

The available models and their rates:

ModelCredits / responseGood for
GPT-5.4 Mini (default)1Most support work — triage, drafts, summaries, field updates
GPT-53Complex, multi-step agents with long instructions
GPT-5.45The hardest reasoning where quality is paramount
Claude Sonnet 4.5 / 49Premium reasoning and writing quality
Llama 3.3 70B1A fast, capable open option
Llama 3.1 8B / Mixtral 8×7B0.5Simple, high-volume tasks where cheapest wins

How to choose: match the model to the task

A simple rule covers most cases:

  • Routine support work (classify, tag, summarize, draft a reply, update a field) → GPT-5.4 Mini (1 credit). It's fast, affordable, and handles these well. For a high-volume operation, make it your default.
  • Complex agents — long, detailed instructions with many steps and rules to remember → GPT-5 (3 credits). Strong quality at a reasonable cost; it's the right step up when Mini starts missing nuance.
  • The hardest reasoning or highest-stakes writing → a premium model (GPT-5.4 or Claude Sonnet). Use these sparingly, on the one or two agents that truly need them.
  • Massive-volume, very simple tasks → the cheapest options (Llama 3.1 8B / Mixtral at 0.5 credits) can cut cost further, if quality holds.

A practical way to decide

Don't agonize over it up front — let the test run tell you:

  1. Start on GPT-5.4 Mini. It's the right answer most of the time.
  2. Test the agent against real tickets. If the quality is good, you're done — keep the cheap model.
  3. If it misses nuance — misreads intent, fumbles long instructions — step up to GPT-5 and test again.
  4. Only go premium if GPT-5 still isn't enough for that specific agent.

This "start cheap, step up only if needed" approach gets you the lowest cost that still does the job — per agent.

A note on speed

For anything customer-facing in real time — a website chatbot, live chat — response speed matters as much as quality. The faster models (Mini, Llama) feel snappier to a waiting visitor; a heavier model can be worth the wait for a back-of-house agent doing complex analysis where nobody's watching the clock.

A model cheat-sheet by agent type

If you want a quick starting point instead of deciding from scratch:

AgentStart withWhy
Ticket triage / taggingGPT-5.4 Mini (1)Simple classification, high volume
SummarizerGPT-5.4 Mini (1)Straightforward, runs a lot
WISMO / order lookupGPT-5.4 Mini (1)Fetch and format, not deep reasoning
Refund / policy agentGPT-5 (3)Judgment and rules to follow carefully
Complex multi-step resolverGPT-5 (3), premium if neededLong instructions, real nuance
Live website chatbotGPT-5.4 Mini / Llama (fast)Speed matters for a waiting visitor

The pattern: start everything on Mini, and promote only the agents that prove they need more. A test run will tell you which ones those are.

Frequently asked questions

Where do I set an agent's model? On the agent's configuration screen, top-right. It's a per-agent setting.

Which model should most agents use? GPT-5.4 Mini (1 credit) — it handles the bulk of support workflows well.

When should I use a stronger model? When an agent has long, complex instructions or needs deeper reasoning — step up to GPT-5, and only go premium if that's still not enough.

Can different agents use different models? Yes — that's the point. Cheap-and-fast for high-volume agents, stronger for the few that need it.

How does the model affect cost? It sets the credits-per-response rate (0.5 to 9). See Macha credits explained.

The bottom line

Pick the model per agent, match it to the task, and start cheap: GPT-5.4 Mini for everyday support, GPT-5 for complex agents, premium only where it's truly needed. Let the test run prove whether you can stay on the cheaper model — and you'll get the right quality at the lowest cost, agent by agent.

Try it: build an agent, pick a model, and test it on real tickets. 7-day free trial, no credit card required. Start free.

Zendesk
5.0 on Zendesk Marketplace

Loved by support teams worldwide

See what support teams are saying about Macha AI.

The application seems excellent to me! We are still testing, and we need support for some details and they were extremely efficient too!

Daniela Costa

Daniela Costa

Head of Support, Seabra

Macha has been a great addition to our support toolkit. It generates clear, well-organized responses that fit naturally into our workflow. One feature we particularly appreciate is its ability to automatically reply in the same language as the ticket.

Marius F

Marius F

Support Head, Zentana

We've been using Macha for a little while now and it's been really great addition so far! It's powerful, convenient, and makes getting work done a lot easier for our agents.

Alexander Wedén

Alexander Wedén

Head of Support

Support team is very helpful and responsive. Really enjoy how lightweight this is within Zendesk itself vs other more intrusive tools.

Cathleen Wright

Cathleen Wright

Zendesk Admin, Cortex IO

So far it's pretty good! Our queries are a little nuanced, so we can't always use it, but it's got enough utility for us. It can even incorporate our bilingual country with greetings in a second language.

Jae Oliver

Jae Oliver

Head of Support, Wise

Really enjoying using Macha, it has made a noticeable difference to our support team in a short amount of time. I really like the ticket summary feature, saves us a lot of time.

Harry Jackson

Harry Jackson

Head of Support, Crumb

Macha AI is a great addition to my workspace! It's powerful, convenient, and it really makes productivity so much easier for our agents!

Dave G

Dave G

Head of Support, Cyber Power Systems

Very impressed! AI integration for Zendesk has certainly come a long way and Macha seems to set the standard for now. This will for sure save lot of time in our support team.

Pauli Juel

Pauli Juel

Head of CS, Dokument24

Macha has been working great for us so far! The auto-responses are accurate and our resolution time has dropped significantly.

Lana T

Lana T

Zendesk Admin, Swotzy

Macha AI is a great addition. The knowledge base feature means our agents always have the right answers at their fingertips.

Mischa Wolf

Mischa Wolf

Head of Support, Topi

We're enjoying this integration so far. It's made our support team more efficient and our customers get faster responses.

Paula G

Paula G

Head of Customer Support, Xly Studio

The team enjoys using it. It saves considerable time on common questions and the integration options are excellent.

Kilian Leister

Kilian Leister

Support Head, Didriksons

Ready to supercharge your team with AI?

Get started in minutes. Connect your tools, configure your agents, and let AI handle the rest.