Skip to main content

Demographic and clinical characteristics associated with anxiety and depressive symptom outcomes in users of a digital mental health intervention incorporating a relational agent

Abstract

Background

Digital mental health interventions (DMHIs) may reduce treatment access issues for those experiencing depressive and/or anxiety symptoms. DMHIs that incorporate relational agents may offer unique ways to engage and respond to users and to potentially help reduce provider burden. This study tested Woebot for Mood & Anxiety (W-MA-02), a DMHI that employs Woebot, a relational agent that incorporates elements of several evidence-based psychotherapies, among those with baseline clinical levels of depressive or anxiety symptoms. Changes in self-reported depressive and anxiety symptoms over 8 weeks were measured, along with the association between each of these outcomes and demographic and clinical characteristics.

Methods

This exploratory, single-arm, 8-week study of 256 adults yielded non-mutually exclusive subsamples with either clinical levels of depressive or anxiety symptoms at baseline. Week 8 Patient Health Questionnaire-8 (PHQ-8) changes were measured in the depressive subsample (PHQ-8 ≥ 10). Week 8 Generalized Anxiety Disorder-7 (GAD-7) changes were measured in the anxiety subsample (GAD-7 ≥ 10). Demographic and clinical characteristics were examined in association with symptom changes via bivariate and multiple regression models adjusted for W-MA-02 utilization. Characteristics included age, sex at birth, race/ethnicity, marital status, education, sexual orientation, employment status, health insurance, baseline levels of depressive and anxiety symptoms, and concurrent psychotherapeutic or psychotropic medication treatments during the study.

Results

Both the depressive and anxiety subsamples were predominantly female, educated, non-Hispanic white, and averaged 38 and 37 years of age, respectively. The depressive subsample had significant reductions in depressive symptoms at Week 8 (mean change =—7.28, SD = 5.91, Cohen’s d = -1.23, p < 0.01); the anxiety subsample had significant reductions in anxiety symptoms at Week 8 (mean change = -7.45, SD = 5.99, Cohen’s d = -1.24, p < 0.01). No significant associations were found between sex at birth, age, employment status, educational background and Week 8 symptom changes. Significant associations between depressive and anxiety symptom outcomes and sexual orientation, marital status, concurrent mental health treatment, and baseline symptom severity were found.

Conclusions

The present study suggests early promise for W-MA-02 as an intervention for depression and/or anxiety symptoms. Although exploratory in nature, this study revealed potential user characteristics associated with outcomes that can be investigated in future studies.

Trial Registration

This study was retrospectively registered on ClinicalTrials.gov (#NCT05672745) on January 5th, 2023.

Peer Review reports

Background

Mental health issues have consistently contributed to the worldwide disease burden [1]. This held true before but was exacerbated by the COVID-19 pandemic, given its associated increase of 25% in depressive and anxiety disorders [2]. Stressing already limited mental health treatment access and provider capacity, the increased prevalence has left even more individuals without needed care [3, 4]. In order to mitigate such prolific access and capacity issues, people experiencing symptoms of depression and anxiety may seek interventions with alternative delivery modalities [5]. Indeed, the heightened need for treatment access has led to an increase in the use and study of technology-delivered mental health interventions [6]. Such interventions, known as digital mental health interventions (DMHIs), offer self-guided (no human support), guided (human support specific to the DMHI offered via telephone, video, or text), or blended (a combination of DMHI plus human support) approaches to deliver treatment modules and skill-building interventions [7]. DMHIs have included therapeutic techniques from one or more evidence-based psychotherapies, such as cognitive behavioral therapy (CBT) [7]. In aggregate, DMHIs have been shown to produce at least moderate improvements in depressive and anxiety symptoms [8,9,10].

Among DMHIs, there has been increased attention on and proliferation of mental health chatbots [11, 12] also known as conversational agents [13, 14]. Conversational agent-delivered interventions are often available on a smartphone application (app), enabling users access at any time for support and to address immediate problems contextually in real time. Like the broader DMHI category, conversational agent-delivered interventions may be founded in evidence-based psychotherapeutic approaches such as CBT, Acceptance and Commitment Therapy (ACT), and Dialectical Behavior Therapy (DBT) [7, 15,16,17]. In addition to thoughtful conversational design, some conversational agents are bolstered by artificial intelligence and/or natural language processing (NLP) [18], which may enable response by delivery of therapeutic intervention and psychoeducational content. Conversational agent-delivered interventions may be particularly useful for those who wish to receive help without speaking to a clinician due to stigma or cost concerns [13]. Recent meta-analyses of conversational agent delivered DMHIs indicate that they are generally viewed favorably by users and may enhance access to mental health care and improve mental health outcomes [11, 13, 19].

When conversational agents are built to form an alliance with users over time, they may be referred to as relational agents [20, 21]. Some data have demonstrated that relational agents are able to form a working therapeutic alliance with users [22], which has been linked to improved mental health outcomes [23]. These capabilities, coupled with their immediate availability in someone’s moment of need, offer the potential for relational agents to extend the current menu of mental health services available to those in need. Unfortunately, little is known about the characteristics of users who may benefit from relational agent-delivered DMHI [11].

The broader, non-relational agent-specific DMHI literature provides some clues about what types of user demographic or clinical characteristics might be associated with outcomes among relational agent users, but much of the evidence has been inconclusive. DMHI studies of depression and anxiety have reported inconsistent findings in outcomes by characteristics such as gender, age, marital status, or education [24,25,26,27,28,29]. Although there is preliminary evidence of the feasibility and acceptability of DMHI interventions for racial/ethnic, LGBTQ (lesbian, gay, bisexual, transgender, queer or questioning), and other marginalized or underserved groups, these demographic and self-identity characteristics have been underexplored as predictors of outcomes in the DMHI literature [30, 31]. Individual DMHI studies addressing racial and ethnic minorities have shown promise in reducing depressive symptoms, but large scale prospective studies are lacking [31].

With regard to clinical characteristics, non-relational agent-specific DMHI studies have generally found that users with higher severity levels of anxiety and depression tend to achieve greater improvement in these symptoms than those with lesser severity [8, 10, 32], likely an artifact that these users have more room for improvement [8]. Surprisingly, the association between concurrent mental health treatment (i.e., psychotherapy and/or psychotropic medications) and DMHI-related changes in depression and anxiety symptoms has seen little investigation. One study of health care workers during the COVID-19 pandemic found that exposure to a mobile phone–based mental health intervention did not significantly affect Depression Anxiety Stress Scale-21 (DASS-21) scores in the overall sample, but a subgroup analysis of those receiving psychotherapy or psychotropic medications found significant improvements in this group [33]. While some data exist on demographic and clinical characteristic associations with mental health outcomes among the broad category of DMHIs, the authors could find no such data in the literature specific to relational or conversational agent-delivered DMHIs. Thus, further exploration of putative demographic and clinical characteristics among users of a relational agent-delivered DMHI is needed to illuminate optimal uses as well as perhaps tailor interventions to the unique needs of individuals experiencing anxiety and depressive symptoms.

This present study evaluated Woebot for Mood and Anxiety (W-MA-02), an intervention delivered via Woebot, a relational agent that utilizes thoughtful conversational design and some NLP to incorporate elements of  CBT, Interpersonal Psychotherapy (IPT), and DBT into a text-based interface on a smartphone app. Feasibility, acceptability, and efficacy studies using Woebot-based interventions have shown promise in young adults reporting depressive symptoms [34], adults with substance abuse issues [35], adolescents with depressive and/or anxiety symptoms [36], and adult women with postpartum mood concerns [37, 38]. The present study sought to address gaps in the literature specific to a relational agent-delivered DMHI by first measuring depressive or anxiety symptom changes after 8 weeks of W-MA-02 use among participants with self-reported clinical levels of depressive or anxiety symptoms at baseline, respectively. In addition, the study explored associations between participant demographic and clinical characteristics and depressive and anxiety symptom outcomes. The research questions in this exploratory study included the following:

  1. 1.

    Do participants with clinically elevated depressive symptoms at baseline have significant decreases in symptoms at Week 8 of a relational agent-delivered DMHI?

  2. 2.

    Do participants with clinically elevated anxiety symptoms at baseline have significant decreases in symptoms at Week 8 of a relational agent-delivered DMHI?

  3. 3.

    Does the amount of depressive and anxiety symptom reduction at Week 8 of a relational agent-delivered DMHI differ by demographic characteristics, clinical characteristics (i.e., baseline anxiety and depression symptom severity), and involvement in concurrent mental health treatment?

Methods

Overall study description

This exploratory study was non-randomized, single-armed, and open-labeled. Study procedures were approved by the WIRB-Copernicus IRB Group (WCG) institutional review board on January 20th, 2022. All participants provided informed consent. Data was collected from W-MA-02 participants between May 11th, 2022 and July 20th, 2022. The study was retrospectively registered on ClinicalTrials.gov (#NCT05672745) on January 5th, 2023.

Recruitment

A convenience sample was recruited via social media advertisements, such as Facebook and Instagram, containing an embedded hyperlink to the study’s electronic consent form. Upon signing the consent form, study staff emailed participants a link to complete the self-report screening (assessing inclusion and exclusion criteria) and baseline survey questions. The inclusion criteria were: (1) 18 years of age or older, (2) residence in the United States (U.S.), (3) reported English literacy, (4) availability to complete study activities, and (5) ownership or regular access to a smartphone with WiFi and sufficient data access for the duration of the study. The exclusion criteria were: (1) current suicidal ideation with a plan or intent to act or a suicide attempt within the past 12 months, (2) a reported lifetime diagnosis of a psychotic disorder (including schizophrenia or schizoaffective disorder) or bipolar disorder, or (3) previous use of a Woebot application.

Participants who met inclusion and exclusion criteria and who completed the baseline survey assessments within 7 days of receipt were immediately provided with a link to access and download W-MA-02. They were instructed to complete registration within 3 days using the study’s referral code and the same credentials entered at consent. Those who failed to register within 3 days became ineligible for the study and thus were not enrolled. Participants who did not meet eligibility criteria were thanked for their interest, given a list of national crisis resources, and exited from the study.

Study procedures

Enrolled participants entered the 8-week intervention period, where they were invited to engage with W-MA-02 daily. Participants completed survey assessments that were delivered outside of the application via email invitation at Day 3, Week 4 (mid-intervention), and Week 8 (end of intervention; EOI). Upon the completion of the final survey, participants exited the study and their access to W-MA-02 was discontinued. Participants received a maximum of $100 in Amazon gift cards for completing study assessments ($20, $30, and $50 for Day 3, Week 4, and Week 8, respectively; payments were not linked to app usage). This study reports data collected at baseline and Week 8 (EOI). Figure 1 describes the participant flow through the study.

Fig. 1
figure 1

Participant Flow Through Study. EOI = end of intervention. EOS = end of study

W-MA-02 intervention

W-MA-02 is intended for users seeking help with mood management and tracking; it is not mental health disorder-specific. The application is accessible as an iOS or Android smartphone application and hosts the relational agent Woebot to guide users through psychotherapeutic content delivered via text-based messages. Using proprietary natural language processing, the user experience centers around goal-oriented conversations based on self-reported real-time needs. These conversations are tailored to help the user develop emotion regulation skills in the context of problems experienced in their everyday lives. W-MA-02 is grounded in scientifically validated psychotherapies (e.g., CBT, IPT, DBT) and helps users address mood monitoring and management, as well as utilizes tools such as progress reflection, gratitude journaling, and mindfulness practice. Previous studies of various Woebot applications, including the W-MA-02 intervention, have provided evidence of feasibility, acceptability and efficacy [34, 35, 38, 39], as well as demonstrated its ability to form a working alliance with users [22]. Sample screenshots from W-MA-02 are shown in Fig. 2.

Fig. 2
figure 2

Screenshots of Mood Monitoring within W-MA-02

Ethical considerations

W-MA-02 is not a crisis service. However, it is equipped with an NLP algorithm that detects patterns in user free-text input that may suggest potential concerning language. Upon detection, W-MA-02 asks the user to confirm if they are in crisis. For all confirmations, W-MA-02 provides a list of resources that includes emergency contact phone numbers and suicide crisis hotline information. W-MA-02 then offers tools to assist with addressing upsetting emotions and thoughts in the moment. The list of resources is also immediately available at any time to the user either from the settings menu or by entering “SOS” in free text. Potential concerning language detection triggers and confirmations were not actively monitored by staff during the study.

Due to the low-risk profile of this study, adverse events (AEs) were assessed by spontaneous participant reports communicated outside of W-MA-02 and followed a predetermined safety protocol. In the event of a spontaneously reported AE to any study staff, the Principal Investigator would be immediately alerted with all available clinical and other details of the event. Based on this initial report, appropriate steps would be taken to gather further information as needed and address the AE. During this study, no such AEs were reported.

Assessments

Participants were emailed links to survey assessments through the Castor Electronic Data Capture (EDC) [40] platform, with a 7-day window to complete the baseline and Week 8 surveys reported herein. Completion of the baseline survey was considered Day 1 of the intervention period. We tracked the assessment completion rate, defined as completion of both the PHQ-8 and GAD-7 at baseline and Week 8.

Demographic battery

The baseline survey assessed demographic characteristics, including age, race/ethnicity, marital status, education, sexual orientation, employment status, and health insurance. Gender identity, while collected, did not yield enough individuals who identified outside of the most common categories (Man or Woman) to allow for inclusion in the models. Instead, we included sex assigned at birth.

Clinical characteristics

Concurrent mental health treatment

Concurrent treatment was defined as mental health treatment at any point in the study. At baseline, participants were asked to indicate if they were currently seeing a therapist for mental health concerns and/or if they were taking medications for a psychiatric condition. At the Week 8 assessment, participants were asked if psychotropic medications and/or psychotherapy were continued or if either was started during the course of the 8-week study intervention.

Patient Health Questionnaire (PHQ-8)

The PHQ-8 is a widely used 8-item self-report measure, with demonstrated reliability and sensitivity to clinical change. It is used to screen for depression, measure symptom severity, and assess outcome [41]. Respondents are asked to consider symptoms over the past two weeks and to indicate frequency of symptoms from not at all (0) to nearly every day (3). The total sum scores range from 0–24, with the following severity cutoffs—minimal (0–4), mild (5–9), moderate (10–14), moderately severe (15–19), and severe (20–24). A score of 10 or greater is considered an acceptable clinical cutoff for current depression [41]. PHQ-8 scores at baseline and Week 8 were analyzed in this study.

Generalized Anxiety Disorder (GAD-7)

The GAD-7 is a brief self-report tool to assess the frequency and severity of anxious thoughts and behaviors [42]. Respondents are asked to indicate frequency of symptoms from not at all (0) to nearly every day (3) over the previous two weeks. Total sum scores range from 0–21, with the following severity cutoffs—minimal (0–4), mild (5–9), moderate (10–14), and severe (15–21). A score of 10 or greater is considered a reasonable clinical cutoff for generalized anxiety disorder [42].

W-MA-02 utilization

Over the course of the study, we tracked participant W-MA-02 log-ins. Completion of the.

intervention was defined as logging in to W-MA-02 during at least 50% (i.e., at least 4 of 8) of the study weeks (yes vs. no).

Statistical analysis

Each set of analyses was performed on the two analytic subsamples: study participants with PHQ-8 ≥ 10 at baseline and those with GAD-7 ≥ 10 at baseline. Participants with clinically elevated symptoms of both depressive and anxiety symptoms were included in both subsamples (although multiple regression models adjusted for the presence of the other symptoms in analyses). Continuous demographic and clinical variables were summarized by means and standard deviations; discrete and/or ordinal variables were summarized by reporting the sample size and proportion of participants in each category. Response was defined as having a 50% reduction in PHQ-8 measured depressive symptoms among those with PHQ-8 ≥ 10 at baseline and having a 50% reduction in GAD-7 measured anxiety symptoms among those with GAD-7 ≥ 10 at baseline.

To address the first research question, a paired t-test was applied to the baseline and Week 8 PHQ-8 scores for the subgroup defined as having a baseline PHQ-8 ≥ 10. Similarly, for those with elevated anxiety symptoms at baseline (GAD-7 ≥ 10), a paired t-test was applied to the baseline and Week 8 GAD-7 scores to address the second research question. For both instances, the tests were intent-to-treat and two-sided, with alpha = 0.05.

To assess the third research question, sets of bivariate and multiple linear regression models were separately fit to the subgroups with baseline PHQ-8 ≥ 10 and baseline GAD-7 ≥ 10. Each regression model included change in depressive or anxiety symptom scores as outcomes, respectively, and each demographic and clinical characteristic as independent variables. First, bivariate regressions were run between each outcome and each demographic and clinical characteristic, separately. Then, a multiple regression model that included all demographic and clinical characteristics as well as an indicator of utilization to adjust for W-MA-02 use (dichotomous variable of having any W-MA-02 use in at least 50%, or 4 of 8, weeks of the study) were run and, using Akakie’s Information Criterion (AIC) for model selection, rerun until the final model was attained. Final models also removed all variables with Variance Inflation Factors greater than 4 to account for multicollinearity. Each model was run on the ITT population with no accommodations for missing data (i.e., those with missing data were dropped from individual bivariate or multiple regression models in each instance). Sensitivity analyses were performed with the PP population, which included participants who used the app in at least 4 of 8 study weeks and who completed the end of study (EOS) PHQ-8 and GAD-7 assessments; the same modeling approach used for the intention to treat (ITT) population was applied to the PP population. The level of significance for all final models was 0.05. All analyses were performed with the R statistical software program [43].

Results

Sample

As detailed in the CONSORT diagram (Fig. 3), 1105 potential participants were screened. Of these, 485 were excluded for not meeting eligibility criteria. After screening and baseline assessments were conducted, an additional 358 participants with unauthorized accounts (e.g., duplicate registrants) were withdrawn from the study and excluded from the analyses. Of the original 262 enrollees, an additional six unauthorized registrants began to complete study procedures but were subsequently removed from the analytic sample because they were initially missed as a result of a clerical error. Thus, the final analytic sample consisted of 256 participants. Of these 256 participants, 245 completed the Day 3 survey, 236 completed the Week 4 surveys, and 234 completed the Week 8 surveys.

Fig. 3
figure 3

CONSORT diagram. Although Day 3 and Week 4 survey data were collected as part of the parent study, they were not part of the analyses included in the present report. Six participants were removed after enrollment due to a clerical error; they should have been removed prior to enrollment

Sample characteristics

The analytic subsamples included 111 participants with elevated baseline levels of depressive symptoms and 107 participants with elevated baseline levels of anxiety symptoms at baseline as defined by a PHQ-8 score ≥ 10 and/or GAD-7 score ≥ 10, respectively. The sample size for the combined (elevated on PHQ-8 or GAD-7) groups comprised 139 participants; some participants with elevated symptoms of both depressive and anxiety symptoms were in both analytic subsamples (n = 79). Both the depressive symptom and anxiety symptom samples were predominantly female, educated, and employed, identified as non-Hispanic white and heterosexual, and had private health insurance (Table 1) with mean ages of 38 and 37 years, respectively. Among those with clinical levels of depressive symptoms, 50% had symptoms falling in the moderately severe or severe range (PHQ-8 score ≥ 15) and 49% reported concurrent mental health treatment during the study. Among those with clinical levels of anxiety symptoms, 44% had symptoms falling in the severe range (GAD-7 score ≥ 15) and 53% reported concurrent mental health treatment during the study.

Table 1 Baseline sociodemographic and clinical characteristics, adherence, and satisfaction among participants with clinical levels of baseline depressive or anxiety symptoms

Per Protocol (PP) characteristics

The assessment completion rate at Week 8 was 91%. The PP population examined in sensitivity analyses comprised 177 (69%) of the 256 participants in the analytic sample after removing 67 (26%) who used W-MA-02 for at most 3 of 8 weeks and removing 22 (9%) whose EOS PHQ-8 and GAD-7 assessments at Week 8 were fully missing (no item scores were completed for each measure); the removals were not mutually exclusive (n = 10 met both criteria). Sixty-four percent (N = 71/111) of the elevated depressive symptoms subgroup met PP and 59% (N = 63/107) of the elevated anxiety symptoms subgroup met these criteria.

Adverse events

No study participant reported an AE or serious adverse event (SAE) during the study. Over the course of the study, 127,773 messages were sent by the 256 participants. A subset of 71 participants provided text inputs containing language suggestive of a potential crisis. Of these participants, four confirmed being in a crisis situation and were automatically provided resources and support as described in the Ethical Considerations section of the Methods.

Depressive symptoms

In the subgroup with clinically elevated depressive symptoms, PHQ-8 scores significantly decreased over the 8-week study period (mean change -7.28, SD = 5.91, Cohen’s d = -1.23, p < 0.001); 43% responded to the intervention. In bivariate regression models, the amount of symptom decline did not significantly differ by age, educational level, employment, or baseline level of anxiety symptoms but did significantly differ across several characteristics tested (Table 2). Non-Hispanic Blacks, those who were single, and those with severe levels of baseline depressive symptoms demonstrated significantly greater amounts of decline in depressive symptoms than Non-Hispanic Whites, those who were married/partnered/cohabiting, and those with moderate levels of baseline depressive symptoms, respectively. Females, participants identifying as sexual minorities, those with private or government-based health insurance, those who used the program on at least 4 of the 8 weeks, and those in concurrent mental health treatment demonstrated significantly lower amounts of decline in depressive symptoms than males, participants identifying as heterosexuals, those with no health insurance or who declined to answer the health insurance question, those who used the program on less than 4 of the 8 weeks, and those not in concurrent mental health treatment, respectively. After removing race/ethnicity, insurance, and employment from consideration because of multicollinearity, the final multiple regression model indicated significant differences in the amount of decrease in depressive symptoms at Week 8 by sexual orientation, marital status, baseline depressive symptom severity category, and concurrent mental health treatment (Table 3) in the same directions as each bivariate model described above (in addition, divorced individuals had greater amounts of depressive symptom declines as compared with married/partnered, cohabiting). Although a few bivariate analyses had varying levels of significance across ITT and PP analyses (Supplemental Table 1), the multiple regression model largely confirmed the full sample findings (Supplemental Table 2).

Table 2 Unadjusted bivariate linear regression models of characteristics associated with change scores from baseline to Week 8 in depressive symptoms among those with clinically elevated levels of baseline depressive symptoms: PHQ-8 ≥ 10
Table 3 Adjusted linear regression model (Multiple Regression) of change scores from baseline to Week 8 in depressive symptoms among those with clinically elevated levels of baseline depressive symptoms: PHQ-8 ≥ 10

Anxiety symptoms

Average GAD-7 scores for each subgroup of interest significantly decreased over the course of the study. In the subgroup with clinically elevated anxiety symptoms at baseline, GAD-7 scores significantly decreased over the 8-week study period (mean change -7.45, SD = 5.99, Cohen’s d = -1.24, p < 0.001); 58% responded to the intervention. The amount of symptom decline did not significantly differ by use of W-MA-02 at least 4 of 8 weeks, age, sex at birth, educational level, or employment, but did significantly differ across several characteristics tested (Table 4). Non-Hispanic Blacks, those who were single, those with severe levels of baseline depressive symptoms, and those with severe levels of anxiety symptoms demonstrated greater amounts of decline in anxiety symptoms than Non-Hispanic Whites, those who were married/partnered/cohabiting, those with minimal levels of baseline depressive symptoms, and those with moderate levels of baseline anxiety symptoms, respectively. Participants identifying as sexual minorities, those with private or government-based health insurance, and those in concurrent mental health treatment demonstrated lower amounts of decline in anxiety symptoms than heterosexuals, those with no health insurance or who declined to answer the health insurance question, and those without concurrent mental health treatment, respectively. PP analyses largely confirmed the full sample findings, with the exception of concurrent mental health treatment reaching marginal significance (p = 0.05, Supplementary Table 3).

Table 4 Unadjusted bivariate linear regression models of change scores from baseline to Week 8 in anxiety symptoms among those with clinically elevated levels of baseline anxiety symptoms: GAD-7 ≥ 10

After removing race/ethnicity, employment, and insurance status from consideration due to multicollinearity, the final multiple regression model indicated significant differences in the amount of decrease in anxiety symptoms at Week 8 by sexual orientation, marital status, baseline anxiety symptom severity category, and concurrent mental health treatment (Table 5) in the same direction as those previously described in each bivariate regression model. PP analyses largely confirmed the full sample findings (Supplemental Tables 3 and 4).

Table 5 Adjusted Linear Regression Model (Multiple Regression) of change scores from baseline to Week 8 in anxiety symptoms among those with clinically elevated levels of baseline anxiety symptoms: GAD-7 ≥ 10

Discussion

In this single-arm exploratory trial of W-MA-02, an intervention employing an NLP-supported relational agent, the objective was to explore the magnitude of the reduction in self-reported depressive (PHQ-8) and anxiety (GAD-7) symptoms between baseline and Week 8 EOI as well as the associative relationships between demographic and clinical characteristics and each outcome, among those that had elevated scores on these measures at baseline. On average, study participants with elevated depressive symptoms at baseline experienced significant declines in self-reported depressive symptoms across the intervention period; study participants with elevated anxiety symptoms at baseline experienced significant declines in self-reported anxiety symptoms across the intervention period. In addition, analyses revealed significant associations between depressive and anxiety symptom changes and demographic and clinical characteristics in both bivariate and, in many cases, multiple regression models. The high assessment completion and W-MA-02 utilization rates are notable given that attrition remains a perennial problem in mobile health app studies [44], with one recent systematic review of depression DMHI app RCTs suggesting that dropout rates may range from 26 to 48%, even when participant reimbursement was included in the analysis [45].

In the absence of a control group, it is difficult to draw conclusions about the efficacy of W-MA-02, especially given evidence that approximately one-third of untreated major depression or generalized anxiety disorder cases may spontaneously remit within 6 months [46,47,48]. However, in both the clinically elevated depressive and anxiety groups, we noted significant decreases in self-reported symptoms at Week 8. The mean level of change was greater than seven points on both change in PHQ-8 among those with clinically elevated depressive symptoms at baseline as well as change in GAD-7 among those with clinically elevated anxiety symptoms at baseline. This is noteworthy given that these outcomes were two and three points greater, respectively, than the typically accepted change score cutoff of five and four points to define a clinically significant response on the PHQ-8 and GAD-7 measures, respectively [49, 50].

Our adjusted analyses did not find significant associations between sex at birth, age, educational background, employment, and clinical outcomes, which is consistent with prior psychotherapeutic and DMHI meta-analyses and systematic reviews [8, 51,52,53]. However, several studies of internet-based CBT (iCBT) have found better depression outcomes for females [26, 54] and greater probability of non-response in anxiety disorders and depression for males [55]. Our results may support the finding that age and sex at birth may be more important to the epidemiology of depression than to prognosis with an intervention [53].

The associations that were found in adjusted analyses between significant improvements in depression and anxiety symptoms and sexual orientation, concurrent mental health treatment, marital status, and baseline symptom severity suggest that these may be demographic predictors of intervention response to DMHIs employing relational agents. However, a series of purpose-designed follow-up studies to test each of these putative predictors of clinical outcome in a randomized clinical trial (RCT) design is needed before any definitive conclusions can be made. Given that LGBTQ individuals have demonstrated higher rates of depression and anxiety [56] that are further compounded by access barriers and stigma [31], DMHIs may present an unique opportunity to provide safe and accessible support to this group. LGBTQ individuals are also more likely to seek health information with digital resources than their non-LGBTQ counterparts [57] and a high proportion of people accessing digital text-based support services identify as belonging to sexual and gender minorities [58]. Thus, further studies among this group should be a research priority. Finally, our study noted significant differences in the association of concurrent care involvement with changes in anxiety and depression symptoms, but this concurrent care was received independently and therefore no details on its characteristics (e.g., description of intervention, frequency, fidelity to psychotherapeutic treatment model, modality of delivery, dose, and route of psychopharmacological agents) were available. Future studies should test if this association holds in a study using a prospective, RCT design as well as in a true blended care model that tightly integrates the relational-agent-delivered DMHI with human support.

Further, historical findings regarding the link between marital status and psychotherapy outcomes have been decidedly mixed [59], but in this study we found that single or divorced/separated/widowed individuals in the elevated depression group and single individuals in the elevated anxiety group experienced more improved outcomes compared to their married/partnered/cohabiting peers in adjusted analyses. These results are somewhat consistent with a study of an online cognitive behavior intervention compared to a waiting list control that demonstrated greater improvements in depression outcomes at 4 months among separated, widowed, or divorced individuals as compared to married participants [60]. Due to the importance of social support in mental health treatment [51], a more detailed assessment of factors such as relationship quality may better explain the relationship between marital status and clinical outcomes [51].

Finally, we found that greater baseline depression and anxiety symptom severity was associated with greater symptom improvements in the elevated depression and anxiety groups, respectively. It is notable that 50% of the elevated depression group reported either moderately severe or severe levels of baseline symptoms and 44% of the elevated anxiety groups reported severe symptoms. DMHIs have often been considered as interventions for mild to moderate symptoms [61], but these findings may suggest promise for populations with a broader range of severity. While it is true that those with elevated symptoms have more room to improve [8], we nonetheless believe that this finding suggests that there may be clinical utility for programs like this beyond mild to moderate severity levels with adequate safety checks in place.

Strengths and limitations

The present study adds to the extant mental health outcomes literature by extending the investigation of important demographic and clinical characteristics in DMHI interventions to one delivered specifically via a relational agent. To our knowledge, this is the first study that has been expressly designed for this purpose. The recruitment of a demographically diverse naturalistic sample allowed a broad assessment of characteristics that have been traditionally underexplored in digital health studies, such as racial/ethnic membership, sexual orientation, employment status, and health insurance status. There has been increasing attention to the lack of diversity in marginalized and underserved participants involved in DMHI studies [31], and this study notably recruited much higher proportions of sexual minorities [62] and non-Hispanic Black [63] individuals than are found in the U.S. population. Despite a generally demographically diverse sample, it should be noted that this sample was about 75% female sex at birth, a proportion that is not unusual for health behavior studies [64]. Greater recruitment based on gender identity and participation of males would improve sample diversity in future studies to allow for greater generalization of findings.

Several limitations must be noted. First, the study was not an RCT and did not include a control group, so there may have been additional confounders or other factors affecting the efficacy of the intervention. For example, researchers have identified user expectancies related to digital device use (a.k.a.“digital placebo effect”) that may affect mental health outcomes [65]. Subgroup differences, some of which had small sample sizes so limited power to detect such differences, observed in the present study are preliminary and should ideally be tested in RCTs with larger sample sizes and a priori power calculations. Second, because the intervention was delivered over an 8-week period, the longer-term durability of any effects is not known. Changes that did occur might be explained by measurement reactivity, as small effects have been observed in health behavior change [66] and digital health studies [67]. Symptom improvements may also be explained in part by spontaneous remission [46, 47, 68] or other participant characteristics that were not assessed as part of the study. Third, symptoms and concurrent treatments were assessed via self-report and there was no independent confirmation of participants' clinical presentation or therapeutic involvements. Although about half of the sample reported receiving concurrent treatment (consistent with national findings that only about half of those in need receive mental health care [69]), we do not know the nature, duration, or intensity of psychotherapies or psychotropic medications received or if participants engaged in other interventions (i.e., not delivered by a therapist or receipt of medication) In addition, although we collected information about both psychotherapy and psychotropic medications separately, the sample size was not sufficient to make comparisons between these modalities and so they were pooled for analysis. Future studies would ideally collect detailed information about the psychotherapies being received, as well as full details about concurrent psychotropic medications. Fourth, studies of online interventions can be affected by certain types of data quality issues (e.g. fraudulent participants, distraction, inattention, or intoxication during use) that may benefit from inclusion of quality checks in future experimental design [70, 71]. Fifth, although demographically diverse for a naturalistic sample, the study population was nonetheless majority college or advanced degree holders and employed. Additional studies targeting the mental health needs of those with lower levels of educational attainment and/or who may be facing under- or unemployment are needed. Finally, our definition of “per protocol” as using W-MA-02 in at least 4 of 8 study weeks and completing EOS depressive and anxiety symptom assessments may not have been granular enough to account fully for achieved outcomes or to reveal unique patterns of disengagement. Exploring alternative engagement metrics would enhance our understanding of relational agent usage and the relationship of outcomes to user characteristics.

Conclusions

The present study suggests early promise for a DMHI that employs an easily scalable NLP-based relational agent incorporating elements of CBT, DBT, and IPT to a demographically diverse naturalistic sample experiencing self-reported symptoms of depression and/or anxiety. Although exploratory in nature, this study reveals intriguing areas for further research that can help optimize the unique role DMHIs that include relational agents can play in filling a mental health service gap that has only widened in the wake of the COVID-19 pandemic.

Availability of data and materials

The datasets underlying the results of this study are confidential and include proprietary commercial information. Requests can be submitted to the corresponding author and will be subject to review and approval.

Abbreviations

ACT:

Acceptance and commitment therapy

AE/SAE:

Adverse and serious adverse events

CBT:

Cognitive behavioral therapy

DASS-21:

Depression Anxiety Stress Scale-21

DBT:

Dialectical Behavior Therapy

DMHI:

Digital Mental Health Intervention

EDC:

Electronic Data Capture

EOI:

End of intervention

EOS:

End of study

GAD-7:

Generalized Anxiety Disorder

iCBT:

Internet-based cognitive behavior therapy

IPT:

Interpersonal Psychotherapy

ITT:

Intention to treat

LGBTQ:

Lesbian, gay, bisexual, transgender, queer or questioning

MBCT:

Mindfulness-based cognitive therapy

NLP:

Natural language processing

PHQ-8:

Patient Health Questionnaire

PP:

Per protocol

RCT:

Randomized controlled trials

U.S.:

United States

References

  1. GBD 2019 Mental Disorders Collaborators. Global, regional, and national burden of 12 mental disorders in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Psychiatry. 2022;9(2):137–50. https://0-doi-org.brum.beds.ac.uk/10.1016/S2215-0366(21)00395-3.

    Article  PubMed Central  Google Scholar 

  2. COVID-19 Mental Disorders Collaborators. Global prevalence and burden of depressive and anxiety disorders in 204 countries and territories in 2020 due to the COVID-19 pandemic. Lancet. 2021;398(10312):1700–12. https://0-doi-org.brum.beds.ac.uk/10.1016/S0140-6736(21)02143-7.

    Article  Google Scholar 

  3. Evans AC, Bufka LF. The critical need for a population health approach: addressing the nation’s behavioral health during the COVID-19 pandemic and beyond. Prev Chronic Dis. 2020;17:E79. https://0-doi-org.brum.beds.ac.uk/10.5888/pcd17.200261.

    Article  PubMed  PubMed Central  Google Scholar 

  4. World Health Organization. The impact of COVID-19 on mental, neurological and substance use services: results of rapid assessment. Geneva: World Health Organization; 2020. Available from: https://www.who.int/publications-detail-redirect/978924012455. Cited 14 Dec 2022.

  5. Philippe TJ, Sikder N, Jackson A, Koblanski ME, Liow E, Pilarinos A, et al. Digital health interventions for delivery of mental health care: systematic and comprehensive meta-review. JMIR Ment Health. 2022;9(5):e35159. https://0-doi-org.brum.beds.ac.uk/10.2196/35159.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Sorkin DH, Janio EA, Eikey EV, Schneider M, Davis K, Schueller SM, et al. Rise in use of digital mental health tools and technologies in the United States during the COVID-19 pandemic: survey study. J Med Internet Res. 2021;23(4):e26994. https://0-doi-org.brum.beds.ac.uk/10.2196/26994.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Lattie EG, Stiles-Shields C, Graham AK. An overview of and recommendations for more accessible digital mental health services. Nat Rev Psychol. 2022;1:87–100. https://0-doi-org.brum.beds.ac.uk/10.1038/s44159-021-00003-1.

    Article  Google Scholar 

  8. Moshe I, Terhorst Y, Philippi P, Domhardt M, Cuijpers P, Cristea I, et al. Digital interventions for the treatment of depression: a meta-analytic review. Psychol Bull. 2021;147(8):749–86. https://0-doi-org.brum.beds.ac.uk/10.1037/bul0000334.

    Article  PubMed  Google Scholar 

  9. Pauley D, Cuijpers P, Papola D, Miguel C, Karyotaki E. Two decades of digital interventions for anxiety disorders: a systematic review and meta-analysis of treatment effectiveness. Psychol Med. 2021;53:1–13. https://0-doi-org.brum.beds.ac.uk/10.1017/S0033291721001999.

    Article  Google Scholar 

  10. Serrano-Ripoll MJ, Zamanillo-Campos R, Fiol-DeRoque MA, Castro A, Ricci-Cabello I. Impact of smartphone app–based psychological interventions for reducing depressive symptoms in people with depression: systematic literature review and meta-analysis of randomized controlled trials. JMIR MHealth UHealth. 2022;10(1):e29621. https://0-doi-org.brum.beds.ac.uk/10.2196/29621.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Boucher EM, Harake NR, Ward HE, Stoeckl SE, Vargas J, Minkel J, et al. Artificially intelligent chatbots in digital mental health interventions: a review. Expert Rev Med Devices. 2021;18(sup1):37–49. https://0-doi-org.brum.beds.ac.uk/10.1080/17434440.2021.2013200.

    Article  CAS  PubMed  Google Scholar 

  12. Parmar P, Ryu J, Pandya S, Sedoc J, Agarwal S. Health-focused conversational agents in person-centered care: a review of apps. NPJ Digit Med. 2022;5(1):21. https://0-doi-org.brum.beds.ac.uk/10.1038/s41746-022-00560-6.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Vaidyam AN, Wisniewski H, Halamka JD, Kashavan MS, Torous JB. Chatbots and conversational agents in mental health: a review of the psychiatric landscape. Can J Psychiatr. 2019;64(7):456–64. https://0-doi-org.brum.beds.ac.uk/10.1177/0706743719828977.

    Article  Google Scholar 

  14. Gaffney H, Mansell W, Tai S. Conversational agents in the treatment of mental health problems: mixed-method systematic review. JMIR Ment Health. 2019;6(10):e14166. https://0-doi-org.brum.beds.ac.uk/10.2196/14166.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Beatty C, Malik T, Meheli S, Sinha C. Evaluating the therapeutic alliance with a free-text CBT conversational agent (Wysa): a mixed-methods study. Front Digit Health. 2022;4:847991. https://0-doi-org.brum.beds.ac.uk/10.3389/fdgth.2022.847991.

    Article  PubMed  PubMed Central  Google Scholar 

  16. Schroeder J, Wilkes C, Rowan K, Toledo A, Paradiso A, Czerwinski M, et al. Pocket skills: a conversational mobile web app to support dialectical behavioral therapy. In: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. New York, NY, USA: 2018. https://0-doi-org.brum.beds.ac.uk/10.1145/3173574.3173972

  17. Fulmer R, Joerin A, Gentile B, Lakerink L, Rauws M. Using psychological artificial intelligence (Tess) to relieve symptoms of depression and anxiety: randomized controlled trial. JMIR Ment Health. 2018;5(4):e64. https://0-doi-org.brum.beds.ac.uk/10.2196/mental.9782.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Denecke K, Abd-Alrazaq A, Househ M. Artificial intelligence for chatbots in mental health: opportunities and challenges. In: Househ M, Borycki E, Kushniruk A, editors. Multiple Perspectives on Artificial Intelligence in Healthcare. Cham: Springer International Publishing; 2021. p. 115–28. https://0-doi-org.brum.beds.ac.uk/10.1007/978-3-030-67303-1_10.

  19. Abd-Alrazaq AA, Rababeh A, Alajlani M, Bewick BM, Househ M. Effectiveness and safety of using chatbots to improve mental health: systematic review and meta-analysis. J Med Internet Res. 2020;22(7):e16021. https://0-doi-org.brum.beds.ac.uk/10.2196/16021.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Bickmore T, Gruber A. Relational agents in clinical psychiatry. Harv Rev Psychiatry. 2010;18(2):119–30. https://0-doi-org.brum.beds.ac.uk/10.3109/10673221003707538.

    Article  PubMed  Google Scholar 

  21. Chaudhry BM, Islam A. A mobile application-based relational agent as a health professional for COVID-19 patients: design, approach, and implications. Int J Environ Res Public Health. 2022;19(21):13794. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph192113794.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Darcy A, Daniels J, Salinger D, Wicks P, Robinson A. Evidence of human-level bonds established with a digital conversational agent: cross-sectional, retrospective observational study. JMIR Form Res. 2021;5(5):e27868. https://0-doi-org.brum.beds.ac.uk/10.2196/27868.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Wampold BE. How important are the common factors in psychotherapy? An update World Psychiatry. 2015;14(3):270–7. https://0-doi-org.brum.beds.ac.uk/10.1002/wps.20238.

    Article  PubMed  Google Scholar 

  24. Hedman E, Andersson E, Lekander M, Ljótsson B. Predictors in internet-delivered cognitive behavior therapy and behavioral stress management for severe health anxiety. Behav Res Ther. 2015;64:49–55. https://0-doi-org.brum.beds.ac.uk/10.1016/j.brat.2014.11.009.

    Article  PubMed  Google Scholar 

  25. El Alaoui S, Ljótsson B, Hedman E, Kaldo V, Andersson E, Rück C, et al. Predictors of symptomatic change and adherence in internet-based cognitive behaviour therapy for social anxiety disorder in routine psychiatric care. PLoS ONE. 2015;10(4):e0124258. https://0-doi-org.brum.beds.ac.uk/10.1371/journal.pone.0124258.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Donker T, Batterham PJ, Warmerdam L, Bennett K, Bennett A, Cuijpers P, et al. Predictors and moderators of response to internet-delivered interpersonal psychotherapy and cognitive behavior therapy for depression. J Affect Disord. 2013;151(1):343–51. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jad.2013.06.020.

    Article  CAS  PubMed  Google Scholar 

  27. Catarino A, Bateup S, Tablan V, Innes K, Freer S, Richards A, et al. Demographic and clinical predictors of response to internet-enabled cognitive–behavioural therapy for depression and anxiety. BJPsych Open. 2018;4(5):411–8. https://0-doi-org.brum.beds.ac.uk/10.1192/bjo.2018.57.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Edmonds M, Hadjistavropoulos HD, Schneider LH, Dear BF, Titov N. Who benefits most from therapist-assisted internet-delivered cognitive behaviour therapy in clinical practice? Predictors of symptom change and dropout. J Anxiety Disord. 2018;54:24–32. https://0-doi-org.brum.beds.ac.uk/10.1016/j.janxdis.2018.01.003.

    Article  CAS  PubMed  Google Scholar 

  29. Warmerdam L, Van Straten A, Twisk J, Cuijpers P. Predicting outcome of internet-based treatment for depressive symptoms. Psychother Res. 2013;23(5):559–67. https://0-doi-org.brum.beds.ac.uk/10.1080/10503307.2013.807377.

    Article  PubMed  Google Scholar 

  30. Anderson-Lewis C, Darville G, Mercado RE, Howell S, Di Maggio S. mHealth technology use and implications in historically underserved and minority populations in the United States: systematic literature review. JMIR MHealth UHealth. 2018;6(6):e128. https://0-doi-org.brum.beds.ac.uk/10.2196/mhealth.8383.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Schueller SM, Hunter JF, Figueroa C, Aguilera A. Use of digital mental health for marginalized and underserved populations. Curr Treat Options Psychiatry. 2019;6(3):243–55. https://0-doi-org.brum.beds.ac.uk/10.1007/s40501-019-00181-z.

    Article  Google Scholar 

  32. Mohr DC, Kwasny MJ, Meyerhoff J, Graham AK, Lattie EG. The effect of depression and anxiety symptom severity on clinical outcomes and app use in digital mental health treatments: meta-regression of three trials. Behav Res Ther. 2021;147:103972. https://0-doi-org.brum.beds.ac.uk/10.1016/j.brat.2021.103972.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Fiol-DeRoque MA, Serrano-Ripoll MJ, Jiménez R, Zamanillo-Campos R, Yáñez-Juan AM, Bennasar-Veny M, et al. A mobile phone–based intervention to reduce mental health problems in health care workers during the COVID-19 pandemic (PsyCovidApp): randomized controlled trial. JMIR MHealth UHealth. 2021;9(5):e27039. https://0-doi-org.brum.beds.ac.uk/10.2196/27039.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Fitzpatrick KK, Darcy A, Vierhile M. Delivering cognitive behavior therapy to young adults with symptoms of depression and anxiety using a fully automated conversational agent (Woebot): a randomized controlled trial. JMIR Ment Health. 2017;4(2):e19. https://0-doi-org.brum.beds.ac.uk/10.2196/mental.7785.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Prochaska JJ, Vogel EA, Chieng A, Kendra M, Baiocchi M, Pajarito S, et al. A therapeutic relational agent for reducing problematic substance use (Woebot): development and usability study. J Med Internet Res. 2021;23(3):e24850. https://0-doi-org.brum.beds.ac.uk/10.2196/24850.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Nicol G, Wang R, Graham S, Dodd S, Garbutt J. Chatbot-delivered cognitive behavioral therapy in adolescents with depression and anxiety during the COVID-19 pandemic: feasibility and acceptability study. JMIR Form Res. 2022;6(11):e40242. https://0-doi-org.brum.beds.ac.uk/10.2196/40242.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Darcy A, Beaudette A, Chiauzzi E, Daniels J, Goodwin K, Mariano TY, et al. Anatomy of a Woebot® (WB001): agent guided CBT for women with postpartum depression. Expert Rev Med Devices. 2022;19(4):287–301. https://0-doi-org.brum.beds.ac.uk/10.1080/17434440.2022.2075726.

    Article  CAS  PubMed  Google Scholar 

  38. Suharwardy S, Ramachandran M, Leonard S, Gunaseelan A, Robinson A, Darcy A, et al. Effect of an automated conversational agent on postpartum mental health: a randomized, controlled trial. Poster presented at: Society for Maternal-Fetal Medicine's 40th Annual Pregnancy Meeting; 2020 Feb 3–8; Grapevine, TX.

  39. Prochaska JJ, Vogel EA, Chieng A, Baiocchi M, Maglalang DD, Pajarito S, et al. A randomized controlled trial of a therapeutic relational agent for reducing substance misuse during the COVID-19 pandemic. Drug Alcohol Depend. 2021;227:108986. https://0-doi-org.brum.beds.ac.uk/10.1016/j.drugalcdep.2021.108986.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Castor EDC. Castor Electronic Data Capture. 2019. Available from: https://castoredc.com.

  41. Kroenke K, Strine TW, Spitzer RL, Williams JBW, Berry JT, Mokdad AH. The PHQ-8 as a measure of current depression in the general population. J Affect Disord. 2009;114(1–3):163–73. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jad.2008.06.026.

    Article  PubMed  Google Scholar 

  42. Spitzer RL, Kroenke K, Williams JBW, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. 2006;166(10):1092–7. https://0-doi-org.brum.beds.ac.uk/10.1001/archinte.166.10.1092.

    Article  PubMed  Google Scholar 

  43. R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2021. Available from: https://www.R-project.org/.

  44. Amagai S, Pila S, Kaat AJ, Nowinski CJ, Gershon RC. Challenges in participant engagement and retention using mobile health apps: literature review. J Med Internet Res. 2022;24(4): e35120. https://0-doi-org.brum.beds.ac.uk/10.2196/35120.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Torous J, Lipschitz J, Ng M, Firth J. Dropout rates in clinical trials of smartphone apps for depressive symptoms: a systematic review and meta-analysis. J Affect Disord. 2020;263:413–9. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jad.2019.11.167.

    Article  PubMed  Google Scholar 

  46. Whiteford HA, Harris MG, McKeon G, Baxter A, Pennell C, Barendregt JJ, et al. Estimating remission from untreated major depression: a systematic review and meta-analysis. Psychol Med. 2013;43(8):1569–85. https://0-doi-org.brum.beds.ac.uk/10.1017/S0033291712001717.

    Article  CAS  PubMed  Google Scholar 

  47. Wittchen HU, Zhao S, Kessler RC, Eaton WW. DSM-III-R generalized anxiety disorder in the National Comorbidity Survey. Arch Gen Psychiatry. 1994;51(5):355–64. https://0-doi-org.brum.beds.ac.uk/10.1001/archpsyc.1994.03950050015002.

    Article  CAS  PubMed  Google Scholar 

  48. Wittchen HU, Hoyer J. Generalized anxiety disorder: nature and course. J Clin Psychiatry. 2001;62(Suppl 11):15–9 (discussion 20-1).

    PubMed  Google Scholar 

  49. Kroenke K, Spitzer RL. The PHQ-9: a new depression diagnostic and severity measure. Psychiatr Ann. 2002;32(9):509–15. https://0-doi-org.brum.beds.ac.uk/10.3928/0048-5713-20020901-06.

    Article  Google Scholar 

  50. Toussaint A, Hüsing P, Gumz A, Wingenfeld K, Härter M, Schramm E, Löwe B. Sensitivity to change and minimal clinically important difference of the 7-item Generalized Anxiety Disorder Questionnaire (GAD-7). J Affect Disord. 2020;265:395–401.

    Article  PubMed  Google Scholar 

  51. Tanguay-Sela M, Rollins C, Perez T, Qiang V, Golden G, Tunteng JF, et al. A systematic meta-review of patient-level predictors of psychological therapy outcome in major depressive disorder. J Affect Disord. 2022;317:307–18. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jad.2022.08.041.

    Article  PubMed  Google Scholar 

  52. Amati F, Banks C, Greenfield G, Green J. Predictors of outcomes for patients with common mental health disorders receiving psychological therapies in community settings: a systematic review. J Public Health (Oxf). 2018;40(3):e375–87. https://0-doi-org.brum.beds.ac.uk/10.1093/pubmed/fdx168.

    Article  CAS  PubMed  Google Scholar 

  53. Buckman JEJ, Saunders R, Stott J, Arundell LL, O’Driscoll C, Davies MR, et al. Role of age, gender and marital status in prognosis for adults with depression: an individual patient data meta-analysis. Epidemiol Psychiatr Sci. 2021;30:e42. https://0-doi-org.brum.beds.ac.uk/10.1017/S2045796021000342.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Spek V, Nyklícek I, Cuijpers P, Pop V. Predictors of outcome of group and internet-based cognitive behavior therapy. J Affect Disord. 2008;105(1–3):137–45. https://0-doi-org.brum.beds.ac.uk/10.1016/j.jad.2007.05.001.

    Article  PubMed  Google Scholar 

  55. Rozental A, Andersson G, Carlbring P. In the absence of effects: an individual patient data meta-analysis of non-response and its predictors in internet-based cognitive behavior therapy. Front Psychol. 2019;10:589. https://0-doi-org.brum.beds.ac.uk/10.3389/fpsyg.2019.00589.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Plöderl M, Tremblay P. Mental health of sexual minorities. A systematic review. Int Rev Psychiatry. 2015;27(5):367–85. https://0-doi-org.brum.beds.ac.uk/10.3109/09540261.2015.1083949.

    Article  PubMed  Google Scholar 

  57. Bowen D, Jabson J, Kamen C. mHealth: an avenue for promoting health among sexual and gender minority populations? Mhealth. 2016;2(9):36. https://0-doi-org.brum.beds.ac.uk/10.21037/mhealth.2016.09.01.

    Article  PubMed  PubMed Central  Google Scholar 

  58. Pisani AR, Gould MS, Gallo C, Ertefaie A, Kelberman C, Harrington D, et al. Individuals who text crisis text line: key characteristics and opportunities for suicide prevention. Suicide Life Threat Behav. 2022;52(3):567–82. https://0-doi-org.brum.beds.ac.uk/10.1111/sltb.12872.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Chen CK, Nehrig N, Chou LJ, McGowan R, Guyton AF, Mustafiz F, et al. Patient extratherapeutic interpersonal problems and response to psychotherapy for depression. Am J Psychother. 2019;72(4):101–22. https://0-doi-org.brum.beds.ac.uk/10.1176/appi.psychotherapy.20190005.

    Article  PubMed  Google Scholar 

  60. Button KS, Wiles NJ, Lewis G, Peters TJ, Kessler D. Factors associated with differential response to online cognitive behavioural therapy. Soc Psychiatry Psychiatr Epidemiol. 2012;47(5):827–33. https://0-doi-org.brum.beds.ac.uk/10.1007/s00127-011-0389-1.

    Article  PubMed  Google Scholar 

  61. Torous J, Bucci S, Bell IH, Kessing LV, Faurholt-Jepsen M, Whelan P, et al. The growing field of digital psychiatry: current evidence and the future of apps, social media, chatbots, and virtual reality. World Psychiatry. 2021;20(3):318–35. https://0-doi-org.brum.beds.ac.uk/10.1002/wps.20883.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Jones JM. LGBT Identification in U.S. ticks up to 7.1%. 2022. Available from: https://news.gallup.com/poll/389792/lgbt-identification-ticks-up.aspx. Cited 14 Dec 2022.

  63. United States Census Bureau. 2020 Census illuminates racial and ethnic composition of the country. 2021. Available from: https://www.census.gov/library/stories/2021/08/improved-race-ethnicity-measures-reveal-united-states-population-much-more-multiracial.html. Cited 14 Dec 2022.

  64. Ryan J, Lopian L, Le B, Edney S, Van Kessel G, Plotnikoff R, et al. It’s not raining men: a mixed-methods study investigating methods of improving male recruitment to health behaviour research. BMC Public Health. 2019;19:814. https://0-doi-org.brum.beds.ac.uk/10.1186/s12889-019-7087-4.

    Article  PubMed  PubMed Central  Google Scholar 

  65. Torous J, Firth J. The digital placebo effect: mobile mental health meets clinical psychiatry. Lancet Psychiatry. 2016;3(2):100–2. https://0-doi-org.brum.beds.ac.uk/10.1016/S2215-0366(15)00565-9.

    Article  PubMed  Google Scholar 

  66. Rodrigues AM, O’Brien N, French DP, Glidewell L, Sniehotta FF. The question–behavior effect: genuine effect or spurious phenomenon? A systematic review of randomized controlled trials with meta-analyses. Health Psychol. 2015;34:61–78. https://0-doi-org.brum.beds.ac.uk/10.1037/hea0000104.

    Article  PubMed  Google Scholar 

  67. König LM, Allmeta A, Christlein N, Van Emmenis M, Sutton S. A systematic review and meta-analysis of studies of reactivity to digital in-the-moment measurement of health behaviour. Health Psychol Rev. 2022;16(4):551–75. https://0-doi-org.brum.beds.ac.uk/10.1080/17437199.2022.2047096.

    Article  PubMed  Google Scholar 

  68. Hengartner MP. Is there a genuine placebo effect in acute depression treatments? A reassessment of regression to the mean and spontaneous remission. BMJ Evid Based Med. 2020;25(2):46–8. https://0-doi-org.brum.beds.ac.uk/10.1136/bmjebm-2019-111161.

    Article  PubMed  Google Scholar 

  69. Substance Abuse and Mental Health Services Administration. Highlights for the 2020 National Survey on Drug Use and Health. 2021. Available from: https://www.samhsa.gov/data/sites/default/files/2021-10/2020_NSDUH_Highlights.pdf. Cited 11 Jan 2023.

  70. Lee CT, Palacios J, Richards D, Hanlon AK, Lynch K, Harty S, Claus N, Swords L, O’Keane V, Stephan KE, Gillan CM. The precision in psychiatry (PIP) study: testing an internet-based methodology for accelerating research in treatment prediction and personalisation. BMC Psychiatry. 2023;23(1):25. https://0-doi-org.brum.beds.ac.uk/10.1186/s12888-022-04462-5.PMID:36627607;PMCID:PMC9832676.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Glazer JV, MacDonnell K, Frederick C, Ingersoll K, Ritterband LM. Liar! Liar! Identifying eligibility fraud by applicants in digital health research. Internet Interv. 2021;25:100401.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We would like to thank Robbert Zusterzeel, M.D., Ph.D., MPH, Alison Darcy, Ph.D., and Megan Flom, Ph.D. for their helpful comments, as well as Adam Platt, M.Sc. for his assistance in preparing the manuscript and figures.

Funding

This study was funded by Woebot Health.

Author information

Authors and Affiliations

Authors

Contributions

EC, AW, TYM, SP, AR, AKQ, and VFH contributed to the conceptualization, study design, implementation, and manuscript preparation of this study. AW and VFH reviewed the study design and provided the statistical analysis plan. AW conducted the statistical analysis. All authors reviewed, edited, and approved the final manuscript.

Corresponding author

Correspondence to Valerie Forman-Hoffman.

Ethics declarations

Ethics approval and consent to participate

This study was conducted in accordance with the principles of the Declaration of Helsinki. This study and all study procedures were approved by the WIRB-Copernicus IRB Group (WCG) institutional review board [Protocol #20216751] on January 20, 2022. All participants provided informed consent.

Consent for publication

Not applicable.

Competing interests

Authors EC, AW, TYM, SP, AR, AKQ, and VFH were employees of Woebot Health at the time of submission. TYM is an employee of the US Department of Veterans Affairs. The views expressed in this article are those of the authors and do not necessarily reflect the position or policy of the Department of Veterans Affairs or the United States government.

DIsclaimer

Woebot for Mood & Anxiety (W-MA-02) is an investigational medical device. It has not been evaluated, cleared, or approved by the FDA.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chiauzzi, E., Williams, A., Mariano, T.Y. et al. Demographic and clinical characteristics associated with anxiety and depressive symptom outcomes in users of a digital mental health intervention incorporating a relational agent. BMC Psychiatry 24, 79 (2024). https://0-doi-org.brum.beds.ac.uk/10.1186/s12888-024-05532-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12888-024-05532-6

Keywords