backicon

How Do You Measure Facilities Management Performance Beyond SLA Compliance?

Published on :

May 28, 2026

by

Anisha Bhattacharjee


In Facilities Management, performance has been measured through activity for decades. Tickets closed. PPMs completed. SLAs met. MTTR reduced. The CMMS, CAFM, IWMS, and most BMS dashboards are activity recorders by design. They capture what was done, when it was done, and whether the workflow complied with the contract.

They do not measure whether the work improved the condition of the facility. That gap is what an outcome-led FM governance layer closes: a System of Decisions, Xempla's AI-native governance layer for CMMS, CAFM, and BMS environments.

Performance is not the activity itself. Performance is what the activity delivered. The moment an asset owner asks why reactive maintenance OPEX keeps rising despite high SLA compliance, or why the same HVAC chiller fails repeatedly on a fully PPM-compliant schedule, the activity record runs out of answers. A different measurement layer has to take over.


Two Things to Establish First

Before the framework, two ideas the rest of this page depends on.

Outcome-led measurement sits above SLAs, it does not replace them. SLAs remain necessary contractual KPIs between asset owners and FM providers. Outcome KPIs answer a different question: is the facility in the condition it is supposed to be in?

Not all AI in FM is the same, and the difference decides what can be measured. AI added to FM treats intelligence as a feature: a predictive alert or anomaly flag sitting beside a CMMS. The system of record is still the CMMS, and the measurement model underneath does not change. AI-native FM treats intelligence as the operating layer. AI is what produces the measurement itself, reading across CMMS, CAFM, IWMS, and BMS environments, structuring the maintenance decision before execution, and verifying the outcome afterward. AI added to FM gives you a smarter dashboard. AI-native FM gives you a governance layer. In this context, governance means the ability to explain, justify, verify, and continuously improve maintenance decisions over time. Only the second can answer all four measurement layers below.


The Four Layers of FM Performance Measurement

FM performance is not a single KPI. It is a four-layer question, and most Facilities Management teams today can only answer the first layer consistently.

The four layers move from activity to governance. The first asks whether the work happened at all, the kind of question SLA and PPM reporting already answers well. The second asks whether the work actually improved the facility, which most FM operations can only infer. The third asks whether it was the right work to do in the first place, which traditional systems cannot judge. The fourth asks whether the decision behind the work can be proven, which is what auditors and asset owners increasingly demand. The table below summarises where traditional FM stops on each, and where an AI-native governance layer continues.

Layer The Question Traditional FM AI-Native FM Governance
Layer 1 Did the work happen? Yes, via SLA, PPM completion, MTTR Yes, same systems of record
Layer 2 Did the work actually work? Partial, inferred from repeat work orders Yes, via fault recurrence, MTBF trend, Central Assurance Score
Layer 3 Was it the right work? No, dependent on one-off technician judgement Yes, via a continuous triage-and-learning loop, surfaced over time in the PPM-to-reactive ratio
Layer 4 Can we prove the decision trail? No, decisions live in inboxes and tribal knowledge Yes, via decision traceability and verify-loop closure

Layer 1 is the floor. Layer 4 is where governance begins. Each layer below follows the same structure: the question it asks, the gap in traditional FM, and how the System of Decisions closes it.


Layer 1: Did the work happen?

The question. Was the ticket closed, the PPM completed, the SLA met, the response made on time?

The gap. There is no gap here. This is the layer Facilities Management has solved. SLA compliance, PPM completion rate, work-order volume, and MTTR all confirm that maintenance activity occurred against schedule, and the CMMS and CAFM are highly effective systems of record for it. The real problem is narrower: for many FM operations, this is the only layer being measured. A ticket closed within SLA confirms workflow execution, not whether the asset became more reliable afterward.

How the System of Decisions closes it. The governance layer reads the same SLA, PPM, and MTTR data the CMMS already produces, but treats it as the input to the three layers above rather than the final word on performance. It builds on this layer rather than replacing it.


Layer 2: Did the work actually work?

The question. Did the intervention resolve the fault and improve the reliability of the asset?

The gap. Traditional FM can only infer this. A work order closed within SLA on a recurring fault is recorded identically to one closed on a one-time issue, because each ticket is an isolated event. A technician may reset a failing HVAC unit and close every work order within SLA while the same fault returns every two weeks. The CMMS shows compliance. It cannot show that the asset is trapped in reactive behaviour.

How the System of Decisions closes it. The governance layer measures the reliability outcomes a CMMS cannot, distilling them into a single Central Assurance Score. The key shift is in how a signal like MTBF is read. A dashboard shows MTBF as a static number; the governance layer reads it as a trend that responds to decisions, tracking whether MTBF lengthens after a maintenance approach changes, and linking the decision to the reliability result. Fault recurrence is read the same way. These signals feed the layer's headline outcome KPI, the Central Assurance Score.

📊

Central Assurance Score

Xempla's Central Assurance Score continuously evaluates the operational health of your facilities and locations. Rather than flooding FM teams with raw metrics like fault recurrence and MTBF movement, it distils complex FM signals into a single, actionable score, viewable at two levels of granularity. You can view the Central Assurance Score at the individual facility level (asset by asset), or at the wider location level (site-wide).

The facility-level view shows whether a specific site's assets are operating as they should; the site-wide view shows whether the location as a whole is in the condition it is supposed to be in.


Layer 3: Was it the right work?

The question. Was this the right intervention to choose, or just the one that closed the ticket?

The gap. Traditional FM makes each decision in isolation. A technician assesses the fault in front of them, acts on judgement, and closes the work order. There is no triage across competing signals, no memory of what worked last time, and no check on whether the symptom or the actual problem was addressed. A reactive replacement may resolve a fault that a condition-based intervention weeks earlier would have prevented at a fraction of the cost, but nothing in the CMMS evaluates whether the right call was made. "Right" is whatever was decided that day, and the reasoning disappears once the ticket closes.

How the System of Decisions closes it. The governance layer judges the decision, not just the task, by running a continuous loop rather than assessing each work order in isolation. It takes in signals as they arrive, triages them against each other, and isolates the underlying problem rather than the surface symptom. It works against a reference point for how the asset should be operating, comparing that ideal condition with the current state to recognise when something is wrong, and it learns from every verified outcome so the next decision is better informed than the last. The result is that "right" stops being whatever a technician decided that day and becomes a structured decision the layer can justify and improve on. Whether those decisions are genuinely the right ones shows up over time in the Central Assurance Score and in the PPM-to-reactive ratio, as planned work begins to displace reactive work.


Layer 4: Can we prove the decision trail?

The question. Why was this action taken, on what evidence, and did the outcome confirm it was right?

The gap. Almost no traditional FM operation can answer this, yet it is exactly what asset owners, auditors, and ESG reporting frameworks now ask for. The CMMS records the work order, not the decision behind it: the evidence, the rationale, and the verified result. Without that chain, FM remains a service-delivery function rather than a governed operation.

How the System of Decisions closes it. The governance layer logs every maintenance decision as evidenced, traceable, and outcome-verified, which is what turns FM into a governed operation. Supporting governance KPIs at this layer include decision traceability rate, the proportion of decisions with a complete evidence chain, and verify-loop closure rate, the proportion of interventions whose outcomes have been validated and fed back into future maintenance logic. Together they let the organization explain why a decision was made, what evidence supported it, and whether the outcome improved reliability afterward. This is the layer where governance becomes measurable.


Where the Governance Layer Sits

The System of Decisions is an AI-native layer that sits above existing FM systems of record, reading data from them, structuring maintenance decisions before execution, and verifying outcomes after intervention.

It does not replace the systems of record. The CMMS remains the workflow system, the CAFM the coordination system, and the BMS the building telemetry system. The governance layer adds the reasoning, reliability analysis, and outcome verification those systems cannot produce independently. The Central Assurance Score, viewable at both facility and site-wide level, is one of its primary outputs, updated continuously because the layer runs continuously rather than at audit intervals.

You can see how this works in practice, across real asset portfolios and facilities management teams, in the Xempla case studies: Xempla case studies.


FAQ

How do you measure Facilities Management performance beyond SLA compliance?

Measure across four layers rather than one. Layer 1 confirms the work happened through SLA compliance, PPM completion, and MTTR. Layer 2 confirms the intervention improved reliability through fault recurrence rate, MTBF trend, and the Central Assurance Score. Layer 3 confirms it was the right intervention through a continuous triage-and-learning loop rather than one-off judgement. Layer 4 verifies that the decision is traceable and governed through decision traceability and verify-loop closure.

What is a Central Assurance Score?

Xempla's Central Assurance Score continuously evaluates the operational health of your facilities and locations, distilling complex FM signals like fault recurrence and MTBF movement into a single, actionable score rather than raw metrics. You can view the same score at two levels: the individual facility level (asset by asset), or the wider location level (site-wide).

What is the difference between SLA compliance and a Central Assurance Score?

SLA compliance confirms that a ticket was responded to and closed within an agreed window. A Central Assurance Score confirms whether the maintenance work improved the operational health of the facility itself. SLA is an activity KPI. The Central Assurance Score is an outcome KPI.

What is the difference between AI in FM and AI-native FM?

AI in FM adds intelligence as a feature, such as a predictive alert beside a CMMS, without changing the underlying measurement model. AI-native FM uses intelligence as the operating layer that produces the measurement itself, reading across systems, structuring decisions before execution, and verifying outcomes afterward.

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5
Heading 6

Paragraph

Block quote

Ordered list

  1. Item 1
  2. Item 2
  3. Item 3

Unordered list

  • Item A
  • Item B
  • Item C

Bold text

Emphasis

Superscript

Subscript