Question 1

Why use a weighted scorecard instead of just picking the vendor everyone likes?

Accepted Answer

Because unstructured vendor decisions are dominated by whoever demos last and whoever argues loudest. A weighted matrix forces the team to agree on what matters before scoring anything, which moves the argument from vendors to priorities — a far more productive fight. It also leaves a written record: when someone asks in a year why you chose vendor B, the scorecard answers in thirty seconds instead of a meeting.

Question 2

How should I set the criterion weights?

Accepted Answer

Set them before you score, ideally before you see any demos, and set them as a team. Start from the defaults here — capability heaviest, then price, latency, compliance and support — and adjust for your context: a healthcare company should pull compliance up sharply, a latency-sensitive product should promote latency. The tool normalizes whatever you enter to 100%, so think in relative importance rather than precise percentages. If two stakeholders disagree on a weight, that disagreement is the real decision; surface it now.

Question 3

What does the sensitivity note tell me?

Accepted Answer

Whether your verdict is robust or fragile. The tool checks how the winner changes as each criterion's weight shifts, and flags the smallest weight change that would flip the result. A winner that survives large weight swings is a safe decision; a winner that flips when one weight moves a few points means the top vendors are effectively tied, and you should decide on something the matrix does not capture — ecosystem, contract terms, or a paid pilot.

Question 4

How do I score fairly on a 1-5 scale?

Accepted Answer

Anchor the scale before anyone scores: 1 means fails your requirement, 3 means meets it, 5 means clearly exceeds it — and write down what 'meets' means per criterion. Score from evidence where possible: published pricing for price, your own benchmark runs for latency, the vendor's compliance documentation for compliance. Have evaluators score independently before comparing, because group scoring converges on the first opinion voiced rather than the best one.

Question 5

Should price be a scored criterion or a separate negotiation?

Accepted Answer

Both, in sequence. Score list price in the matrix so cost discipline shapes the shortlist, but remember the number you eventually pay is negotiated, especially at team and enterprise scale. A useful pattern: use the scorecard to get to two finalists, then negotiate both in parallel — vendors price differently when they know a scored competitor is one column away. And measure your actual usage first; per-seat list prices mean little until you know your real volume.

Criterion	Anthropic	OpenAI	Google
Capability (35%)	5	4	4
Price (25%)	3	3	4
Latency (15%)	4	4	5
Compliance (15%)	4	4	4
Support (10%)	4	3	3
Weighted total	4.10	3.65	4.05

AI Vendor Scorecard

How it works

Frequently asked questions

Why use a weighted scorecard instead of just picking the vendor everyone likes?

How should I set the criterion weights?

What does the sensitivity note tell me?

How do I score fairly on a 1-5 scale?

Should price be a scored criterion or a separate negotiation?

Related tools

Model Capability Picker

AI Readiness Assessment

SLA Uptime Calculator

AI Model Pricing Comparison