Introducing Proactive Agents.
Learn more
Glossary

Model card

A model card is a short, structured document that describes the intended use cases, performance characteristics, limitations, and ethical considerations of a specific AI model, intended to help practitioners make informed decisions about whether and how to deploy it.

The concept was introduced by Google researchers in 2019 to address a recurring problem: AI models were being evaluated and deployed without a common framework for disclosing what they were designed to do, where they had been tested, and where they were known to fail. In customer service environments where AI agents handle sensitive interactions at scale, model cards provide the transparency layer that procurement, compliance, and engineering teams need to assess fitness for purpose before deployment.

How a model card works

A model card is typically a short document, often a few pages, organized into standardized sections. Common components include:

  • Model overview: The model's architecture, training objective, version number, and the organization responsible for it.
  • Intended use: The tasks and deployment contexts the model was designed for, and explicit out-of-scope uses the authors caution against.
  • Performance metrics: Evaluation results broken down by relevant subgroups, not just aggregate scores. A model that performs well on average may underperform on specific demographic groups, languages, or conversation types.
  • Limitations and trade-offs: Known failure modes, sensitivity to prompt phrasing, and edge cases where the model behaves unexpectedly.
  • Training data: A high-level description of the data sources and any known biases they may introduce, relevant to responsible AI assessments.

Why a model card matters for customer experience

In customer service, model selection decisions affect millions of customer interactions. A model that is well-suited to English-language e-commerce inquiries may perform poorly on multilingual conversations or technical support queries, and without a model card, that gap is invisible until it surfaces in production metrics like customer satisfaction score (CSAT) or escalation rate. Model cards give operators a way to match model capabilities to deployment context before committing to integration work.

A meaningful limitation of model cards is that they reflect performance at the time the card was written. A model that has since been updated, retrained, or exposed to a different input distribution may behave differently from what the card documents. Treating model cards as static guarantees rather than point-in-time disclosures is a common source of misplaced confidence, and it reinforces why continuous monitoring via model drift detection remains essential even after a model passes procurement review.

Model cards and AI governance

Regulators and enterprise buyers increasingly require documented evidence of AI system characteristics as a condition of procurement or compliance certification. Model cards address this requirement directly by providing a standardized, shareable disclosure. The Hugging Face model card specification, widely adopted across the open-source AI community, offers a practical template that organizations can adapt to their own governance requirements. Teams building formal AI governance programs should integrate model card review into their evaluation process and link findings to broader AI compliance documentation to create a complete audit trail from model selection through deployment.

For a deeper dive, download Decagon's guide to agentic AI for customer experience.

Deliver the concierge experiences your customers deserve

Get a demo