PHQ-2 & GAD-2: The Ultra-Brief Depression and Anxiety Screening Pair
The PHQ-2 and GAD-2 are ultra-brief, 2-item self-report screening tools designed for use in primary care, emergency, and other time-pressured settings. The PHQ-2 screens for depression using the two cardinal symptoms (low mood and loss of interest); the GAD-2 screens for anxiety using the two cardinal symptoms (feeling nervous and being unable to stop or control worrying). Both produce a 0-6 score, and a score of 3 or higher is the standard cutoff that triggers escalation to the longer PHQ-9 (depression) or GAD-7 (anxiety) for fuller assessment. Both are public-domain instruments developed by the same research team that built the PHQ-9 and GAD-7 (Spitzer, Kroenke, Williams, Löwe). The PHQ-2 and GAD-2 are screening tools, not diagnoses: a positive screen means a clinician should follow up, not that you have depression or an anxiety disorder.
What are the PHQ-2 and GAD-2?
The PHQ-2 (Patient Health Questionnaire-2) and GAD-2 (Generalized Anxiety Disorder-2) are the ultra-brief 2-item versions of the longer PHQ-9 and GAD-7 depression and anxiety screeners developed by Drs. Robert Spitzer, Kurt Kroenke, Janet Williams, and Bernd Löwe under a research grant from Pfizer. The PHQ-2 was first published by Kroenke and colleagues in 2003 as the first two items of the PHQ-9, and the GAD-2 was published by Kroenke and colleagues in 2007 as the first two items of the GAD-7. Both are designed as 'first-step' screens that quickly identify people who may benefit from a more comprehensive assessment. They are public-domain instruments offered free of charge, available in many languages, and recommended by the U.S. Preventive Services Task Force for routine primary-care screening of depression (Grade B) and anxiety (Grade B) in adults.
What the PHQ-2 and GAD-2 measure
Each ultra-brief screener measures the two cardinal symptoms that anchor the longer parent instrument's full assessment. The PHQ-2 covers little interest or pleasure in doing things (anhedonia) and feeling down, depressed, or hopeless — these are the two symptoms that the DSM uses as the gateway criteria for a major depressive episode. The GAD-2 covers feeling nervous, anxious, or on edge and not being able to stop or control worrying — these are the two symptoms most strongly associated with generalized anxiety disorder. Both screeners use a recall window of the past two weeks. Because each instrument covers only the gateway symptoms, neither is meant to provide a full picture on its own; their job is to surface people who should receive the longer PHQ-9 or GAD-7 (or both).
How the PHQ-2 and GAD-2 are administered
Each instrument is a 2-item self-report screener that takes well under a minute to complete. Each item asks 'Over the last two weeks, how often have you been bothered by [symptom]?' with four response options scored 0 (not at all), 1 (several days), 2 (more than half the days), and 3 (nearly every day). The total score for each screener is the sum of its two items, with a range from 0 to 6. The PHQ-2 and GAD-2 are commonly administered together in primary care as a paired ultra-brief screen and can be completed on paper, on a screen, or verbally during a routine visit. Many electronic health records prompt clinicians to administer these screeners at routine appointments and automatically escalate to the PHQ-9 or GAD-7 when the threshold is met.
Who uses the PHQ-2 and GAD-2
The PHQ-2 and GAD-2 are used routinely in primary care, urgent care, emergency departments, integrated behavioral health programs, and hospital pre-admission screening. The U.S. Preventive Services Task Force recommends routine adult screening for depression (Grade B) and for anxiety disorders in adults 64 and younger (Grade B) when adequate follow-up systems exist, and brief instruments like the PHQ-2 and GAD-2 are commonly used to meet that recommendation efficiently. Outside primary care, both are used in research, public health surveys, and community mental-health initiatives. They are particularly valuable when time is scarce — for example, in pre-visit triage, on a tablet in a waiting room, or as part of an annual wellness check that already has many other components.
PHQ-2 and GAD-2 are first-step screens, not diagnoses
Both the PHQ-2 and the GAD-2 are designed as first-step screens, not standalone diagnostic tools. A positive screen (≥3 on either instrument) is a signal that the longer PHQ-9 or GAD-7 should be administered, and that a clinician should follow up with a more comprehensive assessment. A negative screen does not rule out depression, anxiety, or other mental-health conditions — particularly if the person is presenting with specific concerns or has known risk factors. The ultra-brief design trades sensitivity for brevity and is meant to be the start of a conversation, not the end of one. If you are in crisis after reviewing your results, call or text 988 (U.S. Suicide & Crisis Lifeline) for immediate support.
How to score and interpret your results
Both ultra-brief screens use the same simple math. Each item is rated 0 to 3 based on how often the symptom has bothered you over the past two weeks, and the two items are summed to give a total between 0 and 6. The same scale applies whether you are completing the PHQ-2 for depression or the GAD-2 for anxiety — one reason the pair is often delivered together as a single ultra-brief checkpoint.
The cutoff is the same for both. A score of 3 or higher on either tool is a positive screen and a signal to administer the longer parent questionnaire — the PHQ-9 or the GAD-7. A score below 3 is a negative screen, but that result does not rule out depression, anxiety, or any other mental-health condition; it only means today’s ultra-brief check did not flag a concern.
Side-by-side scoring at a glance
The table below shows how the two screens line up. Both share an identical response scale, an identical score range, and an identical positive-screen cutoff — only the symptoms they ask about differ.
| Element | PHQ-2 (depression) | GAD-2 (anxiety) |
|---|---|---|
| Number of items | 2 | 2 |
| Recall window | Past 2 weeks | Past 2 weeks |
| Per-item score | 0-3 | 0-3 |
| Total score range | 0-6 | 0-6 |
| Positive-screen cutoff | ≥3 | ≥3 |
| Step-2 escalation | PHQ-9 | GAD-7 |
The cutoff of 3 is not arbitrary. For the PHQ-2, likelihood-ratio and receiver-operating-characteristic analysis in the Kroenke 2003 validation study identified 3 as the optimal cutpoint for screening. The GAD-2 settled on the same threshold for the same reason — it best balances catching cases (sensitivity) against falsely flagging people who are not depressed or anxious (specificity). Either way, a 3 means escalate, not diagnose.
The 4 items: PHQ-2 + GAD-2 verbatim
All four items share the same stem and the same answer choices. Each instrument opens with the prompt: “Over the last 2 weeks, how often have you been bothered by the following problems?”. You then rate each item on the four-point frequency scale below.
| Response | Score |
|---|---|
| Not at all | 0 |
| Several days | 1 |
| More than half the days | 2 |
| Nearly every day | 3 |
The two PHQ-2 items (depression):
- Little interest or pleasure in doing things
- Feeling down, depressed or hopeless
The two GAD-2 items (anxiety):
- Feeling nervous, anxious or on edge
- Not being able to stop or control worrying
Each pair is intentionally narrow. The PHQ-2 covers the two cardinal DSM-5 symptoms of major depression: anhedonia (loss of interest in things you usually enjoy) and depressed mood. These are the symptoms that anchor the formal diagnostic criteria of “most of the day, nearly every day, for at least two weeks”. The GAD-2 covers the two symptoms most strongly associated with generalized anxiety disorder: a baseline feeling of nervousness or being on edge, and worry that you cannot turn off. Because each pair is the first two items of the longer PHQ-9 and GAD-7, scoring positive on the ultra-brief version essentially means the longer version is already partly completed.
The escalation rule: when a positive screen triggers PHQ-9 or GAD-7
The PHQ-2 and GAD-2 are designed as Step 1 in a two-step stepped-screening model. A positive ultra-brief score is a trigger to administer the longer parent instrument for a fuller picture — it is not, by itself, a reason to start treatment.
The escalation rule on the depression side is explicit. The University of Washington’s primary-care reference states that “patients who screen positive should be further evaluated with the PHQ-9, other diagnostic instruments, or direct interview to determine whether they meet criteria for a depressive disorder”. The U.S. Preventive Services Task Force endorses this stepped approach by recommending depression screening with “PHQ in various forms” and requiring that anyone who screens positive be “appropriately diagnosed and treated with evidence-based care or referred”. In practice, a PHQ-2 of 3 or more becomes a PHQ-9 administration at the same visit or the next one.
What the Step-2 instruments add
On the anxiety side, the escalation logic is the same. The Sapra 2020 primary-care review describes the GAD-2 as having been “proposed as an essential first step for screening generalized anxiety disorder” and frames it explicitly as “the ultra-quick version” intended to trigger fuller assessment. The USPSTF anxiety recommendation reinforces the pattern: GAD-2 at cutoff ≥3 is a validated brief screener, and GAD-7 at cutoff ≥10 is the diagnostic follow-up. A positive GAD-2 therefore escalates to a GAD-7 administration, which adds five more items covering restlessness, irritability, trouble relaxing, anticipatory fear, and difficulty concentrating.
The longer instruments do two things the ultra-brief versions cannot. They provide severity grading — for the GAD-7, scores of 5, 10, and 15 mark mild, moderate, and severe anxiety — and they sharpen accuracy by adding items that distinguish clinical anxiety or depression from short-lived stress reactions. Skipping the Step-2 instrument after a positive ultra-brief screen is the most common implementation failure of the model.
How accurate are the ultra-brief screens?
For 4-item tools that take under a minute to complete, both screens are surprisingly good — but neither is a substitute for a clinical interview. The numbers below come from the original validation studies and the 2023 USPSTF systematic review.
| Instrument | Study | Population (N) | Cutoff | Sensitivity | Specificity |
|---|---|---|---|---|---|
| PHQ-2 | Kroenke 2003 | 6,000 primary-care + OB/GYN patients (criterion subsample 580) | ≥3 | 83% | 92% |
| PHQ-2 | UW mirror (Kroenke 2003) | Same | ≥3 | 82.9% | 90.0% |
| GAD-2 | Kroenke 2007 | 965 primary-care patients, 15 U.S. clinics | ≥3 | AUC 0.80-0.91 across 4 anxiety disorders | — |
| GAD-2 | Sapra 2020 / UW (Kroenke 2007) | Same | ≥3 | 86% (GAD) | 83% (GAD) |
| GAD-2 | USPSTF 2023 | Pooled review | ≥3 | 0.81 | 0.86 |
| GAD-7 (Step 2) | USPSTF 2023 | Pooled review | ≥10 | 0.79 | 0.89 |
What those numbers mean in practice
Sensitivity is the screen’s ability to catch true cases — an 83% sensitivity for the PHQ-2 means roughly 17 of every 100 people with major depression will be missed at a single screening. Specificity is the screen’s ability to clear people who do not have the condition — 92% specificity means roughly 8 of every 100 people without depression will be flagged for a follow-up they do not need. The GAD-2 trades a little specificity (83-86%) for a little extra sensitivity (81-86%) compared with the PHQ-2, but the two screens land in the same overall accuracy band.
The GAD-2 also retains useful screening accuracy for anxiety disorders other than GAD. At cutoff ≥3 it reaches 76% sensitivity and 81% specificity for panic disorder, 70% and 81% for social anxiety disorder, and 59% and 81% for PTSD. It is not a substitute for disorder-specific instruments, but it is a reasonable single trigger for evaluating multiple anxiety presentations. A screen this short will miss some cases — which is exactly why it must be paired with clinical follow-up and never used as a standalone diagnosis.
Why use the ultra-brief instead of the full PHQ-9 or GAD-7?
The honest answer is time. A 4-item paired screen takes well under a minute; a combined PHQ-9 plus GAD-7 takes several minutes plus scoring. In a primary-care visit that has to cover a chronic condition, a vaccination question, a lab review, and a referral, the ultra-brief screen is often the only mental-health checkpoint that fits.
When the ultra-brief is the right tool
The PHQ-2 and GAD-2 are well suited to settings where the goal is case-finding at scale rather than detailed assessment:
- Routine annual visits where mental health is one of many items on the agenda
- Pre-visit triage on a tablet in the waiting room, so positives can be flagged before the clinician walks in
- Emergency-department or urgent-care intake where time per patient is minimal
- Hospital pre-admission screening as part of a broader history
- Population surveys and research where a longer instrument would crater response rates
- Stepped-care protocols that explicitly hand off Step-1 positives to Step-2 assessment
The USPSTF Grade B recommendations for depression and for anxiety screening in adults are part of why these ultra-brief tools are so widely embedded. Both 2023 recommendations name brief PHQ and GAD instruments and require adequate follow-up systems. The Sapra 2020 review reinforces the rationale: the GAD-2 is “quick to administer, enhancing time efficiency” while keeping “good sensitivity and specificity for the diagnosis of the most common anxiety disorders encountered in primary care”.
When the longer version is the right tool
The full PHQ-9 and GAD-7 earn their extra items in two situations. First, when a positive ultra-brief screen needs diagnostic confirmation and severity grading — the longer tools include items on sleep, appetite, concentration, restlessness, and (for the PHQ-9) thoughts of self-harm that change clinical urgency. Second, when the goal is tracking treatment response over time, the longer scales have more measurement resolution than a 0-6 ultra-brief total can provide. As detection tools at first contact, the ultra-brief versions are excellent; as ongoing monitoring instruments, the full versions are better.
What to do with your results and when to seek help
The PHQ-2 and GAD-2 are screens, not diagnoses — and they are Step 1, not even a full screen. Whatever your score, the right next move depends on the result, your history, and how you have been feeling beyond the two questions each instrument asks.
A positive score (3 or higher on either screen) is a signal to talk with a clinician and complete the longer parent instrument. A primary-care visit is the most common path; the longer PHQ-9 or GAD-7 will add the items needed to grade severity and inform next steps. A positive screen does not mean you have depression or an anxiety disorder. It means the symptoms you are experiencing rise above the threshold where a fuller assessment is warranted.
A negative score (0, 1, or 2 on both screens) does not rule out depression, anxiety, or other mental-health conditions, particularly if you have specific concerns or known risk factors. If something feels wrong even with a negative screen, that signal is worth raising with a clinician on its own.
Contact a clinician promptly if any of the following apply
- You scored 3 or higher on either screen and are not already in care for that condition
- You have had depressed mood or loss of interest most of the day, nearly every day, for at least two weeks
- You have a personal or family history of depression, anxiety, or bipolar disorder
- You are pregnant or postpartum and have noticed mood or anxiety changes (USPSTF specifically includes perinatal screening)
- You have a chronic medical condition known to raise depression risk
- Your sleep, appetite, work, or relationships have changed in ways you cannot explain
If you are in crisis
If you are having thoughts of suicide or self-harm, get help now. Call or text the 988 Suicide & Crisis Lifeline at 988, or chat at 988lifeline.org. In life-threatening situations, call 911. The 988 line is free, confidential, and available 24 hours a day in the United States; you do not need to be in immediate danger to use it.
Frequently asked questions
What is a passing or positive PHQ-2 or GAD-2 score?
There is no “passing” score — these are screens, not pass/fail tests. The standard cutoff is 3 or higher on either instrument: that is a positive screen and a signal to complete the longer PHQ-9 or GAD-7 and talk with a clinician. A score of 0-2 is a negative screen, which does not rule out depression or anxiety.
What’s the difference between PHQ-2 and PHQ-9? Or GAD-2 versus GAD-7?
The PHQ-2 is the first two items of the PHQ-9; the GAD-2 is the first two items of the GAD-7. The ultra-brief versions are Step 1 case-finders that take under a minute; the longer versions add items needed for severity grading and diagnostic confirmation. A positive ultra-brief screen normally triggers the longer parent instrument as Step 2.
Are the PHQ-2 and GAD-2 free to use? Do I need permission?
Yes, they are free, and no special permission is required. According to the official PHQ screeners portal, “all PHQ, GAD-7 screeners and translations are downloadable from this website and no permission is required to reproduce, translate, display or distribute them”. The instruments are public-domain and have been translated into many languages.
Is there a PDF of the PHQ-2 or GAD-2?
Official PDF versions of every PHQ and GAD instrument are hosted on the PHQ screeners portal maintained by Pfizer Inc., where they can be downloaded without permission for clinical, research, or educational use. The verbatim items used by clinicians match the wording reproduced above.
Can I take the PHQ-2 and GAD-2 myself at home?
Both are self-report instruments and can be completed by an adult without a clinician present. Self-administration is useful as a personal check-in or as preparation for a visit, but a positive result should be reviewed with a clinician — the screens are designed to trigger a conversation, not replace one.
What CPT or billing code applies?
The PHQ screeners portal does not publish CPT or billing codes; coding depends on the clinical context (screening versus diagnostic), the payer, and the country. A clinician or medical biller is the right source for the specific code that applies to a given visit. The instruments themselves remain royalty-free regardless of how a visit is billed.
Do the PHQ-2 and GAD-2 diagnose depression or anxiety?
No. Both are screening tools, not diagnostic tools. A positive screen identifies people who should receive a longer assessment (PHQ-9 or GAD-7) and a clinical evaluation; only a qualified clinician — using diagnostic criteria, history, and judgment — can make a diagnosis.