The test-retest reliability of the consensus best-estimate process was evaluated approximately 6 months after the final patient follow-up interview. Thirty previously reviewed patients were randomly selected, stratified by depression severity, and reevaluated by the panel. Reliability for the three-level outcome of major, subthreshold, or no depression was excellent (weighted kappa= 0.89, 95% CI=0.77–1.00).