Enhancing Depression Detection in Social Media via Advanced User Profiling and Fine-Grained Age Groups

Sonia Shahzadi1, Sanjiv Kumar1, Harsh Mehta1
1Indian Institute of Technology (IIT) Delhi
DOI: https://doi.org/10.71448/bcds2341-3
Published: 30/04/2023
Cite this article as: Sonia Shahzadi, Sanjiv Kumar, Harsh Mehta. Enhancing Depression Detection in Social Media via Advanced User Profiling and Fine-Grained Age Groups. Bulletin of Computer and Data Sciences, Volume 4 Issue 1. Page: 24-37.

Abstract

Depression remains a severe global mental health crisis, affecting over 264 million people worldwide. Recent research has demonstrated that linguistic patterns in social media posts can serve as reliable markers for depression detection. Titla-Tlatelpa et al. [1] introduced a profile-based sentiment-aware approach that specialized classifiers according to users’ demographic traits and incorporated sentiment polarity through a novel Bag of Polarities (BoP) representation. However, their work relied on simple lexicon-based methods for profiling, which limited its potential. This paper addresses these limitations by proposing an enhanced profiling system that leverages state-of-the-art author profiling models and introduces fine-grained age categorization. Our approach replaces lexicon-based profiling with transformer-based models for more accurate gender and age prediction, and introduces a multi-class age classification system (teen, young adult, adult, middle-aged, senior) instead of binary young/senior categorization. Experimental results on Reddit and Twitter datasets show significant improvements, with up to 9.9% enhancement in F1-score on the Reddit dataset compared to the original approach, while maintaining interpretability. Our work demonstrates that accurate demographic profiling is crucial for the profile-based paradigm in mental health assessment.

Keywords: depression detection, social media text analysis, author profiling, transformer-based demographic prediction

Abstract

Depression remains a severe global mental health crisis, affecting over 264 million people worldwide. Recent research has demonstrated that linguistic patterns in social media posts can serve as reliable markers for depression detection. Titla-Tlatelpa et al. [1] introduced a profile-based sentiment-aware approach that specialized classifiers according to users’ demographic traits and incorporated sentiment polarity through a novel Bag of Polarities (BoP) representation. However, their work relied on simple lexicon-based methods for profiling, which limited its potential. This paper addresses these limitations by proposing an enhanced profiling system that leverages state-of-the-art author profiling models and introduces fine-grained age categorization. Our approach replaces lexicon-based profiling with transformer-based models for more accurate gender and age prediction, and introduces a multi-class age classification system (teen, young adult, adult, middle-aged, senior) instead of binary young/senior categorization. Experimental results on Reddit and Twitter datasets show significant improvements, with up to 9.9% enhancement in F1-score on the Reddit dataset compared to the original approach, while maintaining interpretability. Our work demonstrates that accurate demographic profiling is crucial for the profile-based paradigm in mental health assessment.

Keywords: depression detection, social media text analysis, author profiling, transformer-based demographic prediction
Sonia Shahzadi
Indian Institute of Technology (IIT) Delhi
Sanjiv Kumar
Indian Institute of Technology (IIT) Delhi
Harsh Mehta
Indian Institute of Technology (IIT) Delhi

DOI

Cite this article as:

Sonia Shahzadi, Sanjiv Kumar, Harsh Mehta. Enhancing Depression Detection in Social Media via Advanced User Profiling and Fine-Grained Age Groups. Bulletin of Computer and Data Sciences, Volume 4 Issue 1. Page: 24-37.

Publication history

Copyright © 2023 Sonia Shahzadi, Sanjiv Kumar, Harsh Mehta. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Browse Advance Search