Depression causes mood disorders with noticeable problems in day-to-day activities. Current methods of assessing depression depend almost entirely on clinical interviews or questionnaires. They lack systematic and efficient ways of incorporating behavioral observations that are strong indicators of a psychological disorder. To help clinicians effectively and efficiently diagnose depression severity, automated systems, using objective and quantifiable data for depression assessment, are being developed. This paper presents a framework toward estimating a clinical depression-specific score, namely the Beck Depression Inventory-II (BDI-II) score, based on the analysis of facial expressions features. To extract facial dynamic features, we propose a novel dynamic feature descriptor denoted as median robust local binary patterns from three orthogonal planes (MRLBP-TOP), which can capture both the microstructure and macrostructure of facial appearance and dynamics. To aggregate the MRLBP-TOP over an image sequence, we propose a variant to the Fisher vector (FV) encoding scheme, denoted as the Dirichlet process FV (DPFV). DPFV adopts Dirichlet process Gaussian mixture models (DPGMM) to automatically learn the number of GMM mixtures and model parameters. Experimental results on the AVEC2013 and AVEC2014 depression databases have demonstrated the effectiveness of the proposed method.

Original languageEnglish
Article number8501575
Pages (from-to)1476-1486
Number of pages11
JournalIEEE Transactions on Multimedia
Volume21
Issue number6
Early online date2018
DOIs
Publication statusPublished - Jun 2019

    Research areas

  • Depression, Dirichlet process Fisher vector (DPFV), dynamic feature descriptor, median robust local binary patterns from three orthogonal planes (MRLBP-TOP), nonverbal behaviors

ID: 40029195