Glossary · Clinical

Oxford Knee / Hip Score

The Oxford Hip Score (OHS) and Oxford Knee Score (OKS) are 12-item patient-reported outcome measures (PROMs) that quantify pain and functional disability over the preceding four weeks, producing a 0–48 composite score where higher values indicate better function. Both instruments are required by CMS quality programs for elective hip and knee arthroplasty episodes.

Verified May 8, 2026 · 9 sources ↓

Drawn from InnovationNIHGanapathihipkneesurgeonMedrxivArthroplastyjournal

Definition

Source · Editorial summary grounded in 9 cited references ↓

Each Oxford score presents patients with 12 questions covering pain severity and functional activities relevant to the affected joint—such as walking distance, ability to perform personal hygiene tasks, and difficulty with transportation. Each item is rated on a five-point scale scored 0–4 (best to worst outcome per item under the current recommended system), yielding a total range of 0–48. A score of 0–19 indicates severe arthritis or post-operative problems, 20–29 moderate-to-severe, 30–39 mild-to-moderate, and 40–48 satisfactory joint function. The original 1996 publication by Dawson et al. used a 1–5 reverse scoring convention (total 12–60, higher = worse); Oxford University Innovation later standardized the 0–4 per-item convention to eliminate confusion arising from clinicians who had been inverting or adapting the legacy scale.

Both instruments were developed and are copyright-managed by Oxford University Innovation Limited. Any clinical or commercial deployment—including electronic or digital administration—requires a licence. Publicly funded healthcare organizations and non-commercially funded academic users may obtain that licence free of fee. CMS has incorporated the OHS and OKS into its quality reporting framework for total joint arthroplasty, and participating providers must administer them in a manner consistent with the validated paper-and-pen version to preserve measurement equivalence.

A key psychometric caveat: co-morbidities affecting adjacent joints (e.g., lumbar spine pathology influencing knee or hip perception) can inflate scores and reduce joint specificity. Minimally important change (MIC) thresholds—typically cited around 5 points for both instruments—vary meaningfully by baseline score category, a finding confirmed in large NHS datasets exceeding 387,000 records. Clinicians and coders should therefore treat MIC benchmarks as baseline-dependent estimates rather than fixed cut-points when assessing whether a patient has achieved a clinically meaningful response.

Why it matters

CMS quality programs tie reimbursement and public performance reporting to PROM collection for elective total hip and knee arthroplasty (e.g., the Elective Outpatient Procedure measure set). Failure to collect, score, or submit OHS/OKS data in the required format can result in measure non-compliance, downward payment adjustments, or exclusion from value-based care programs. On the clinical side, an incorrect scoring convention—such as applying the legacy 12–60 scale when a registry or EHR expects 0–48—produces systematically reversed or out-of-range data that renders the outcome meaningless for audit, revision-risk stratification, or shared decision-making. Documentation that captures a specific pre-operative and post-operative score, rather than a narrative impression, is what permits the delta calculation on which MIC determinations depend.

Common mistakes

Where people most often go wrong with this concept.

Source · Editorial brief grounded in cited references ↓

  • Applying the legacy 1–5 per-item scoring (total 12–60, higher = worse) when the receiving registry or CMS submission expects the current 0–4 per-item convention (total 0–48, higher = better), which inverts the clinical interpretation.
  • Treating a single published MIC threshold (e.g., 5 points) as universally applicable regardless of the patient's baseline score—evidence from NHS datasets shows MIC varies substantially by pre-operative severity band.
  • Administering the OHS to evaluate hip function in a patient with significant ipsilateral lumbar spine or contralateral knee pathology without flagging co-morbidity interference, which compromises score specificity.
  • Deploying a digital or EHR-embedded version of the OHS/OKS without first obtaining a licence from Oxford University Innovation and confirming eCOA migration equivalence, which may invalidate the measurement properties and create IP liability.
  • Confusing the OHS with the Harris Hip Score (HHS) or HOOS in documentation or code mapping—these are distinct instruments with different psychometric properties and separate measure IDs in CMS quality frameworks.
  • Recording only the total composite score without retaining item-level responses, which prevents identification of the individual functional domains (e.g., 'knee giving way,' 'overall pain') that research identifies as the strongest predictors of revision risk.

Related codes

Codes commonly involved when this concept appears in practice.

Frequently asked questions

Source · Generated from the editorial pipeline, verified against 9 cited references ↓

01What is the correct scoring range for the Oxford Hip and Knee scores today?
Under the current Oxford University Innovation–recommended convention, each of the 12 items is scored 0–4 (4 = best outcome), giving a total range of 0–48. Higher scores indicate better joint function. The original 1996 publication used a 1–5 per-item scale totaling 12–60 where higher scores meant worse function; that legacy system has been officially superseded to prevent scoring errors.
02Are the OHS and OKS required by CMS, and is there a fee to use them?
CMS has incorporated both scores into its quality reporting programs for elective total joint arthroplasty. Oxford University Innovation owns the copyright but grants licences free of fee to publicly funded healthcare providers and non-commercially funded academic users. Providers must apply for a licence before deployment, including for any electronic or EHR-integrated version.
03What is the minimally important change (MIC) for these scores, and how should it be applied?
The MIC is commonly cited at approximately 5 points for both the OHS and OKS, but large NHS registry data (n > 387,000) demonstrate that MIC varies by the patient's baseline score. Clinicians should treat any single published MIC threshold as a rough guide rather than a fixed benchmark, and interpret change relative to where the patient started.
04Can the OHS be used to assess non-surgical treatments, not just arthroplasty?
Yes. Although the OHS was developed primarily to evaluate total hip arthroplasty outcomes, it has been applied in studies of physical therapy, joint supplements, and anti-inflammatory medications. Its responsiveness to change has been shown to exceed that of generic measures like the SF-36 and disease-specific measures like the WOMAC in surgical contexts, though cross-setting validity should be confirmed before non-surgical use.
05Does co-morbidity in adjacent joints affect score accuracy?
Yes. Patients with lumbar spine pathology, contralateral knee disease, or other lower-extremity conditions often report difficulty separating pain sources, which can inflate OHS or OKS scores and reduce joint specificity. Clinicians should document relevant co-morbidities alongside the score to contextualize results for audit or revision-risk stratification.
06Which individual OKS items best predict the need for revision surgery?
Research published in the Journal of Arthroplasty identified 'overall pain,' 'limping when walking,' and 'knee giving way' as the strongest item-level predictors of subsequent revision within the OKS. This underscores the value of retaining item-level data rather than storing only the composite total.

Sources & references

Editorial content was developed using the following public sources. Last verified May 8, 2026.

  1. 01
    innovation.ox.ac.uk
    https://innovation.ox.ac.uk/clinical-outcomes/information-for-cms-members/
  2. 02
    pmc.ncbi.nlm.nih.gov
    https://pmc.ncbi.nlm.nih.gov/articles/PMC1283979/
  3. 03
    ganapathihipkneesurgeon.com
    https://ganapathihipkneesurgeon.com/wp-content/uploads/2016/07/Guidance-for-using-Oxford-Hip-Score.pdf
  4. 04
    medrxiv.org
    https://www.medrxiv.org/content/10.1101/2025.07.29.25332390.full
  5. 05
    arthroplastyjournal.org
    https://www.arthroplastyjournal.org/article/S0883-5403(23)00218-8/fulltext
  6. 06
    sralab.org
    https://www.sralab.org/rehabilitation-measures/oxford-hip-score
  7. 07Dawson J, Fitzpatrick R, Carr A, Murray D. Questionnaire on the perceptions of patients about total hip replacement surgery. J Bone Joint Surg Br. 1996;78:185-190.
  8. 08Murray D, Fitzpatrick R, et al. The use of the Oxford hip and knee scores. J Bone Joint Surg Br. 2007;89(8):1010-1014.
  9. 09Pynsent PB, Adams DJ, Disney SP. The Oxford hip and knee outcome questionnaires for arthroplasty. J Bone Joint Surg Br. 2005;87:241-248.

Mira AI Scribe

When documenting a visit involving pre- or post-operative Oxford Hip Score or Oxford Knee Score administration, capture: (1) which instrument was used (OHS vs. OKS), (2) the total composite score under the current 0–48 convention, (3) the time point (pre-operative baseline, 6-month, 12-month post-op, or other), and (4) any co-morbidities that may have interfered with joint-specific scoring (e.g., ipsilateral lumbar stenosis, contralateral knee arthritis). Flag if the score was collected digitally and confirm licence-equivalent eCOA administration. For CMS quality program submissions, verify the measure ID and submission window align with the correct scoring convention. If the composite score falls below 20, document explicitly that the score indicates severe joint dysfunction, as this supports medical necessity for arthroplasty or revision. If post-operative score change is less than the MIC (~5 points), note the patient's baseline score category before characterizing the result as a non-response, since MIC thresholds are baseline-dependent. Do not substitute the OHS for the OKS or vice versa in documentation—they are separate instruments with separate measure identifiers.

See Mira's approach

Related terms

Ready?

Ready to transform your orthopedic practice?

See how orthopedic practices are running documentation, billing, and operations on a single voice-first platform.

Get started for free