A recall notice goes out in nine languages. Eight are clean. The ninth swaps a dosage figure, and now a regulator in that market wants answers. That is the kind of failure a translation vendor can hand you, and it never shows up in a sales demo.

By the time you are comparing vendors, you have read the case studies and sat through the pitch. The hard part is telling two polished suppliers apart. That is the job of a translation vendor scorecard. It turns a gut call into a documented, weighted decision you can defend to finance, legal, and your security team without booking a second meeting.

This guide hands you the full framework. Three pillars, fifteen criteria, a scoring method, and the red flags that should stop a deal cold. It is built for the people who sign off: procurement, legal ops, and compliance.

What a Translation Vendor Scorecard Actually Does

A scorecard converts impressions into numbers you can line up, side by side. Every vendor gets rated on the same criteria. Each criterion carries a weight that reflects how much it matters to you. Those weighted scores roll up into one total that ranks on your shortlist.

The number is not the point. The discipline is.

When every evaluator answers the same questions and backs each answer with evidence, the charismatic account manager stops winning on charm alone. So does the vendor whose demo you happened to see last week. And when a stakeholder asks six months from now why you picked who you picked, you point to the scorecard instead of your memory of a good meeting.

For a bottom-of-funnel buyer, this is the final gate. A strong contender either proves it can carry out your regulated, high-volume work, or it quietly drops out of the running. That is the point. Better to learn it now than during a product launch.

The Three Pillars

Most evaluation templates obsess over linguistic quality and wave a hand at everything else. For a compliance or procurement team, that is backwards. A vendor can write flawless French and still email your draft contracts as plain attachments. Pretty does not mean safe.

A real translation vendor scorecard weighs three things:

Quality: Does the work read correctly and consistently, in your voice, and can they prove it with process and data?

Security and compliance: Are your files protected, and do the certifications match what your regulators expect?

Scalability: Can they absorb a launch, an audit, or fifteen new languages without renegotiating from zero?

Here is how each one breaks down.

translation vendor scorecard framework infographic

Pillar One: Quality

Quality is the pillar everyone assumes they understand. One clean sample proves that a vendor can do good work once. It tells you nothing about doing it across 4,000 pages and a dozen linguists.

Score these five things

Who does the work: Ask for native-speaking, professionally qualified translators with real experience in your field. A generalist translating a clinical trial protocol is a quiet liability. You want names and credentials, not a headcount.

The review process: Serious vendors run translation, then editing, then proofreading, with a second qualified linguist checking the first. The shorthand is TEP. Ask one blunt question. Is that revision standard, or an upsell? ISO 17100 certification is a strong signal, because the standard requires a second-linguist review and traceable processes rather than good intentions.

How they measure errors: Mature vendors grade their own output against a framework like MQM or DQF, sorting issues by type and severity. Ask for a recent quality report. If nobody can explain how they score accuracy, they are managing it on feel.

Terminology control: For anything ongoing, they should keep a translation memory, a glossary, and a style guide tied to your account. This is what stops “policyholder” from turning into three different words across one project. It also lowers your cost over time, since repeated content gets reused instead of re-translated.

What happens when you disagree: You will eventually. A clear path to raise and resolve a linguistic dispute is what separates a partner from an order-taker. The range of specialization on a provider’s translation services page is a fair proxy for whether they can even staff your subject matter experts to begin with.

Here is the part most buyers get wrong. Skip the free sample. Pay for a small pilot on your own content, then have your in-country reviewers grade it blind against these criteria. A sample from the vendor hand-picked only tells you what they want you to see.

Pillar Two: Security and Compliance

For legal operations, this is usually the pillar that decides it. Translation vendors handle contracts under negotiation, patient records, financial filings, and litigation material. A weak link here is not a typo. It is a breach notification with your name on it.

Score these five things

Meaningful Certifications:

Look for ISO 27001 for information security management and a SOC 2 report covering the Trust Services Criteria. These are not badges you buy on a website. They reflect audited controls for access, risk, and incident response. A vendor that runs a public Trust Center where you can read the certifications is telling you that scrutiny is welcome. One that promises to email a PDF “on request” is telling you something too.

Regulatory fit: Match their profile to your obligations. Health data pulls in HIPAA. Personal data on EU residents pulls in GDPR, including how cross-border transfers are handled under standard contractual clauses. Government work may demand specific status. Get the alignment in writing, not in a friendly call.

How files move and where they rest: You want encryption in transit and at rest, controlled file transfer instead of open email, and a straight answer on data residency if your jurisdiction limits where information can sit. “We use a shared drive” is not a security policy.

Who can see your documents: Confidential projects should be visible only to vetted linguists under NDA, with role-based access and an audit log. Ask how freelancers get screened, and how fast access is revoked when someone rolls off the project.

Retention and deletion: Ask how long they keep your content after delivery, and whether they will delete it on request. For privileged material, open-ended retention is a liability sitting on someone else’s server, indefinitely.

The signal that matters most here is transparency. A vendor that publishes its security posture and lets you verify it beats one that asks for your trust on faith. Every time.

Pillar Three: Scalability

Scalability is the pillar that keeps you from outgrowing your vendor the moment it starts to matter. The supplier that nails a 5,000-word pilot and then chokes on a multi-market launch will cost you more than the discount that won the deal.

Score these five things

Coverage and depth: Confirm the languages you need today and the ones your roadmap implies. Then ask about depth. How many qualified linguists can they field per language? One resignation should never stall your account.

Surge handling: Regulated volume arrives in waves, not a steady drip. Ask what happens when you drop a high-volume request across eight languages with a tight deadline. A good answer includes rush options that do not set fire to quality on the way out.

Technology and integration: Look past email. You want APIs, content connectors, or a managed portal, so translation fits into your existing systems instead of living in an inbox. Machine translation with human review, used well, expands throughput while keeping a person accountable for what ships. Used badly, it ships fluent nonsense at scale. Ask which one you are buying.

Account continuity: As volume grows, you need a named contact, a documented process, and a plan for when staff change. Ask what governance looks like at your size. Reporting cadence, escalation path, and who picks up the phone at 6pm on a Friday.

Pricing that behaves: Understand how cost moves as volume climbs. Translation memory reuse, volume tiers, and a predictable rate card all point to a partner built to grow with you, not one that bills you harder for succeeding.

To pressure-test all of this, ask for references at your scale and throw a realistic surge scenario at the vendor during evaluation. Then watch how they answer it in real time, not in a polished follow-up.

How to Weigh and Score It

The framework only works if the math reflects your priorities. Four steps:

  1. Weigh the pillars: Split 100 points across the three based on your risk. A litigation-heavy legal team might land on security 40, quality 35, and scalability 25.

A fast-scaling product company might flip security and scalability. Decide on purpose. Do not let the default decide for you.

  1. Set a scale: Rate every criterion 1 to 5. Define what each number means before anyone scores, so a 4 means the same thing to every evaluator.
  2. Demand evidence: Every score points to something real. A certificate, a pilot result, a reference call, and a signed policy. A score with no evidence behind it is just an opinion of wearing a number.
  3. Roll it up: Multiply each score by its weight, sum within each pillar, then total the pillars. The highest total leads. But always read the breakdown first. A strong overall score that hides a failing security pillar is a deal you walk away from.

A Worked Example

Take two vendors. Vendor A scores 4.6 on quality, 3.1 on security, and 4.4 on scalability. Vendor B scores 4.2, 4.8, and 4.0. Weight security at 40, quality at 35, and scalability at 25, and Vendor B wins clearly, because your heaviest-weighted pillar is exactly where A is weakest. Strip the weighting out and A’s shinier quality number might have won a gut-feel call, walking a real security gap straight through your front door.

weighted scoring chart

Red Flags That Should Pause a Deal

Some findings override a strong score. Treat these as stop signs. A vendor that cannot name the standards it works to. One that refuses a paid pilot. One that sends files over unencrypted email, runs no second-linguist review, keeps your data forever with no deletion path, or staffs a single linguist per language with no backup. Any one of these earns a hard conversation before you go a step further. Score be damned.

Turning the Scorecard Into a Decision

Build the scorecard before you talk to vendors. Let the criteria shape your questions, not the reverse. Have at least two people score independently, then settle the gaps with evidence instead of seniority. Keep the finished scorecards on file. A documented, weighted evaluation is exactly the record an auditor wants to see, and exactly what protects you if the choice ever gets questioned.

A good translation vendor scorecard does more than crown a winner. It forces you to name what you actually need, surfaces risk while you can still act on it, and gives procurement, legal, and compliance one shared language for the decision. You stop buying translation on trust. You start buying it on evidence.

Ready to run a partner through it? Put Day Translations against every line of this scorecard, and see how the certifications, the security posture, and the capacity hold up under your own assessment.