Using EBM to Improve: Hypotheses and Experiments

Level: Advanced Module: Evidence-Based Management 3 min read Lesson 46 of 52

Overview

  • What you’ll learn: How to formulate improvement hypotheses, design experiments, define success criteria, interpret results, and create a culture of experimentation.
  • Prerequisites: Lessons 40–45 (all EBM concepts).
  • Estimated reading time: 15 minutes

Introduction

EBM without experimentation is just “evidence-based talking.” The real power of EBM comes from actually running experiments, measuring outcomes, and learning from results. This lesson teaches you how to go from “we should try something” to “we tested X, learned Y, and will do Z next.”

The Hypothesis Template

Every improvement starts with a hypothesis. Use this template:

We believe that [action/change]
will result in [expected outcome]
as measured by [specific metric]
We will know we succeeded when [success threshold]

Example: “We believe that adding WIP limits to our Kanban board will result in shorter cycle times, as measured by average cycle time per item. We will know we succeeded when average cycle time decreases by 20% over 3 Sprints.”

Designing Good Experiments

A good experiment is:

  • Small: Implement the minimum change needed to test the hypothesis. Don’t change five things at once — you won’t know which one caused the result.
  • Time-bound: Define how long the experiment will run (usually 1–3 Sprints).
  • Measurable: You must be able to measure the outcome. If you can’t measure it, you can’t learn from it.
  • Reversible: If the experiment fails, you should be able to revert without significant damage.

Interpreting Results

After the experiment period, inspect the data:

  • Confirmed: The metric improved as hypothesized. Consider making the change permanent and run a new experiment.
  • Disproved: The metric did not improve or got worse. This is not a failure — it is learning. Roll back the change and try a different approach.
  • Inconclusive: Not enough data, too many confounding variables, or the change was too small to measure. Extend the experiment or redesign it.

Creating a Culture of Experimentation

The hardest part of EBM is not the framework — it is the culture. Experimentation requires:

  • Safety to fail: If failed experiments are punished, nobody will experiment. Celebrate learning, not just success.
  • Data literacy: People must understand what the data means and doesn’t mean.
  • Patience: Results take time. One Sprint is rarely enough to see the impact of a process change.
  • Discipline: Run one experiment at a time (or at least control for variables). Don’t change everything simultaneously.

Connecting Experiments to Goals

Every experiment should connect to your goal hierarchy: Tactical Goal (Sprint) → Intermediate Goal (quarter) → Strategic Goal (organization). If you cannot explain how an experiment connects to a strategic goal, it may not be worth running.

Key Takeaways

  • Use the hypothesis template: “We believe [action] will result in [outcome] as measured by [metric].”
  • Design experiments that are small, time-bound, measurable, and reversible.
  • Disproved hypotheses are learning, not failure.
  • Create safety to fail, data literacy, and discipline for experimentation.
  • Connect every experiment to your goal hierarchy.
本課中文版

概述

沒有實驗的 EBM 只是「基於證據的說說而已」。EBM 的真正力量來自實際執行實驗、量測結果、從中學習。

假設模板

我們相信 [行動/改變]
會導致 [預期結果]
用以下方式量測 [具體指標]
當以下情況發生時我們知道成功了 [成功門檻]

設計好的實驗

  • 小的:實施最小的改變來測試假設。
  • 有時限的:定義實驗要跑多久。
  • 可量測的:你必須能量測結果。
  • 可逆的:如果實驗失敗,你應該能回復。

解讀結果

  • 確認:指標如假設改善了。考慮永久化改變。
  • 推翻:指標沒改善或惡化。這不是失敗,是學習。
  • 不確定:數據不足。延長或重新設計實驗。

創造實驗文化

實驗需要:失敗的安全感、數據素養、耐心和紀律。

重點整理

  • 使用假設模板。
  • 設計小的、有時限的、可量測的、可逆的實驗。
  • 被推翻的假設是學習,不是失敗。
  • 將每個實驗連結到目標層級。
日本語版

概要

実験のないEBMは「エビデンスベースの雑談」に過ぎない。EBMの真の力は、実際に実験を行い、結果を測定し、学ぶことにある。

仮説テンプレート

我々は信じる:[行動/変更]が
以下の結果をもたらす:[期待される結果]
以下で測定:[具体的なメトリクス]
以下の場合に成功と判断:[成功閾値]

良い実験の設計

  • 小さい:仮説をテストするための最小限の変更。
  • 期限付き:実験期間を定義する。
  • 測定可能:結果を測定できなければならない。
  • 可逆的:失敗した場合に元に戻せる。

結果の解釈

  • 確認:メトリクスが仮説通りに改善。恒久化を検討。
  • 否定:改善しなかったまたは悪化。失敗ではなく学習。
  • 不確定:データ不足。実験を延長または再設計。

重要ポイント

  • 仮説テンプレートを使う。
  • 小さく、期限付きで、測定可能で、可逆的な実験を設計。
  • 否定された仮説は失敗ではなく学習。
  • すべての実験を目標階層に接続する。

You Missed