Your battery covers the intoxicant domains well but is blind to fatigue's signature — erratic lapses in sustained attention. This lesson designs a vigilance micro-task that fits your under-2-minute phone budget, and shows the two or three things you must not get wrong.
The gap we're closing
From Lesson 2: sleep loss shows up as wake-state instability — lapses
and rising reaction-time variability, not uniform slowing. From Lesson 1: none of your three tasks sustains
attention long enough or monotonously enough to surface that. This is the highest-value addition for a
"detect any impairment" goal, because fatigue is probably your most common real-world case.
Recall first
The Reliable Change Index judges today's score against...
RCI = (today − baseline) ÷ baseline variability. That's why the variability from a vigilance task is so valuable — it feeds both the fatigue signal and the decision rule.
What the PVT actually is (stripped to essentials)
The Psychomotor Vigilance Task is deceptively simple: a stimulus
appears at random intervals; you respond as fast as you can; it records your reaction time
over and over. That's it. Its power comes from three properties you must preserve if you shrink it:
Random inter-stimulus interval (ISI). The gap between stimuli varies unpredictably (classically 2–10 s). Randomness is non-negotiable — it stops users from anticipating and building a rhythm, which is what forces genuine sustained attention.
No learning curve. There's nothing to strategize, so scores barely improve with practice[1] — the property your richer tasks lack (Lesson 3).
No ceiling. Reaction time always has headroom; you can't "max it out," so impairment always has room to show.
The metrics — this is where the sensitivity lives
Don't just log mean RT. The evidence on what best discriminates sleep loss[1] points to:
Response speedmean(1/RT) — the reciprocal of reaction time. More normally distributed and more sensitive across the whole range than raw mean RT; it's the recommended primary metric.[1]
VariabilitySD or CoV of RT — the direct readout of wake-state instability. Often the earliest sign.
Lapsescount(RT ≥ 500 ms) — momentary attention failures. In a very short task few accumulate, so pair with "minor lapses" (≥ 355 ms) to keep signal.
False startsRT < 100 ms or pre-stimulus — responding before you could have seen the stimulus. Flags guessing/rhythm-tapping; count them separately.
The budget problem — and an honest answer
The full PVT is 10 minutes. The validated brief version (PVT-B) is
3 minutes and keeps much of the sensitivity to sleep loss.[2] You want your
whole battery under ~2 minutes, so a vigilance slice of ~45–60 s. That's shorter than anything
formally validated — which is fine if you treat the shortening as a hypothesis to test, not a given.
At least one study found a 3-minute version already diverging from the 10-minute reference under some
conditions.[3] The shorter you cut, the more you must validate.
How to buy sensitivity back in a 60-second task
Shorter means fewer trials, so lean on the metrics that don't need many: response speed (1/RT) and
variability extract signal from every single trial, whereas lapse counts need time to
accumulate. Use a tight random ISI (e.g., 1–4 s) to pack in ~15–20 trials, and make the primary readout
mean(1/RT) + CoV rather than lapse count.
A concrete starting spec
A defensible v1 to prototype and then validate — the full version lives in the Vigilance Micro-Task Spec reference card:
Duration
~60 s (fits budget; validate against a 3-min PVT-B)
Stimulus
One salient target appears after a random gap; tap ASAP
ISI
1–4 s, uniform random — never fixed
Trials
~15–20 (as many as fit)
Primary
mean(1/RT) and CoV of RT
Secondary
lapses ≥500 ms, minor lapses ≥355 ms, false starts
Feed into
the per-user baseline + RCI from Lesson 3
Gamify carefully — the core is sacred
You're building "games," and that instinct is good for engagement — but the vigilance task's validity lives
in exactly the properties gamification tends to destroy. The rule: decorate around the core, never
alter the core.
Keep: random ISI, simple immediate response, RT as the measurement, boring-by-design so there's nothing to master.
Safe to add: a theme/skin, per-trial RT feedback ("312 ms!"), a gentle progress bar.
Avoid: anything that rewards strategy or lets the player learn a rhythm — a "beat your combo" mechanic, predictable timing, or a difficulty ceiling that makes failure impossible. Each of those quietly reintroduces the practice and ceiling effects from Lesson 3.
Before you trust it — validate
Three checks, in order:
Concurrent validity: do scores on your 60-s version correlate with a 3-min PVT-B in the same sitting?
Test–retest reliability: do un-impaired users score consistently day to day (aim ICC > 0.8)? Without this you can't separate impairment from noise.[2]
Sensitivity to the real thing: does it drop after real sleep loss (e.g., a rested vs. sleep-restricted comparison)? That's the only test that matters in the end.
Check yourself
Why must the inter-stimulus interval be random?
Predictable timing lets people tap on rhythm without truly attending — which destroys the task's whole purpose and invites false starts.
In a 60-second vigilance task, the most sensitive metric is usually...
1/RT and variability extract signal from every trial; lapse counts need many trials to accumulate, which a 60-s task can't provide.
To gamify safely, the thing you must protect is...
Skin and feedback are fine; anything that adds strategy, rhythm, or a ceiling reintroduces practice and ceiling effects and breaks validity.
Your single win
You can now spec a vigilance micro-task that closes your fatigue blind spot inside a 60-second budget:
random ISI, dead-simple response, score on 1/RT and variability, decorate but never alter the core, and
validate the short version against a longer one. Combined with Lessons 1–3, you now have the full loop —
the right domains, the fingerprints, a trustworthy decision rule, and the one task you were missing.
I'm your teacher — ask me anything. Want help picking the exact ISI range, deciding
how many sessions before the baseline is trusted, or designing the rested-vs-tired validation study? Bring it
to the chat.
References
[1] Basner & Dinges, Maximizing Sensitivity of the PVT to Sleep Loss, SLEEP (2011).
[2] Basner, Mollicone & Dinges, Validity and Sensitivity of a Brief PVT (PVT-B) (2011).
[3] The 3-Minute PVT Demonstrates Inadequate Convergent Validity…, Frontiers in Neuroscience (2022).