Professional Documents
Culture Documents
Detecting Mental Distress Through User's Social Media Activity
Detecting Mental Distress Through User's Social Media Activity
ISSN No:-2456-2165
Abstract:- Users of social networking sites can In present era, data has turned into unstructured data from
approach their friends who are interested and numerous businesses, social media, organizations, banking, and so
expressing their thoughts, feelings, and sentiments on includes hidden patterns that, when studiedthoroughly, can
through ideas, photographs, and videos. This opens expose new extents of research and development. However,
the door to studying online information for user reading all this vital material and coming to a decision is not an
emotions and feelings in order to gain a better easy task. Text mining, opinion mining, text recognition, and so on
understanding of their emotions and attitudes when all come into play here.
utilizing these online platforms.
Natural Language Processing (NLP)
Depression may be dangerous to one's health,
particularly if it is recurring and of moderate or It is the practice of using software to recognize and deploy
severe degree. It can make the individual suffer a lot natural language such as speech and text automatically.
and make them perform poorly at job, school, and at
home. Suicide is a possibility when depression is The following are the two primary components of NLP that
severe. It is one of the leading causes of death among are defined and described: -
those between 15 to 29 of age. Natural Language Generation (NLG): The use of artificial
intelligence (AI) programming to generate written or spoken
Machine learning algorithms and Natural narratives from a data collection is known as natural language
Language As the facts state that around 700,000 generation. NLG incorporates computational linguistics, NLP,
people in one year kill themselves. and NLU, as well as human-to-machine and machine-to-human
interaction.
Processing will be employed in the proposed Natural Language Understanding (NLU): It primarily entails the
problem statement to detect if a person is going following two major tasks:
through mental distress. The main aim is to discover The provided natural language input is mapped to eloquent
that commonality within the tweets that can help in representations.
identifying whether the individual is on the edge of Recognizing different patterns in Natural Language.
mental distress so that there’s no delay is reaching
out and helping the person who is suffering. NLP Pipeline
I. INTRODUCTION The aim is to break the problem down into little chunks and
then use machine learning to address each one individually. Then
Depression, being one of the most common mental intricate things can be performed by chaining together numerous
illness, impacts about 300 million individuals throughout machine learning models that feed into each other.
this world. Early identification is crucial for prompt
action, which can help to prevent the illness from The steps to build a NLP Pipeline are:
worsening. As per WHO stats, around 280 million Sentence Segmentation: The first and the initial most stage in
people in the whole word are suffering from mental creating a natural language processing pipeline is this one.
distress. In many nations, depression is still Breaking up the content into discrete phrases is the very
underdiagnosed and untreated, resulting in negative self- initial step in the pipeline.
perception and, in the worst-case scenario, suicide [1]. Word Tokenization: After splitting out sentence, breaking the
The need to detect mental distress in individuals is material into discrete phrases is the first stage in the pipeline.
alarming. Hence, this project is aimed at detecting if a The sentences are broken down into words in order to define and
person is depressed by analyzing their social network understand the semantic meaning of each word independently in
posts and tweets. this step.
Parts of Speech Predictions for Each Token: This step is to find
People have begun to express their experiences and out the part of speech for each work as they get converted into
struggles with mental health illnesses via online forums, tokens now.
microblogs, and tweets as the Internet has grown in Text Lemmatization: The major goal of this stage is to figure out
popularity. Many researchers were influenced by their what each word's basic form is so it can be acknowledged if
online activities to develop new types of prospective different sentences are talking about the same entity or not. This
health-care solutions and approaches for early depression technique is known as lemmatization or determining the most
detection systems. They attempted to get a greater fundamental form or lemma of each word in the phrase.
performance increase by employing several Natural Identifying Stop Words: Before undertaking any statistical
Language Processing (NLP) methodologies and text analysis, filtering out terms called stop words is important. In
categorization methods. [2]
Hoyun Song∗Jinseon You∗Jin-Woo Chung Jong C. Some of the challenges with current models:
Park [15] proved that FAN considers only four There is a need to find out more features that can relate to
featureswhich are not sufficient on themselves entirely human behavior and help in the detection of depression.
to detect depression as they used Feature Attention The n-gram and tf-idf based features did not perform as
Network, Multilayer Perceptron(MLP), GloVe, GRU, expected over the dataset.
one of theRecurrent Neural Network(RNN) variants, L2 There are several improvements to be made for better
Regularization, Adam Optimizer, ConvolutionalNeural optimizations.
Network(CNN-E,CNN-R) it was observed that it Use of word embedding proved to be a disadvantage and the
outperforms all themodels except the CNN-R model, main issue with CNN model is the high amount of increase in
FAN shows a similar F1- score to the baseline the training time.
methodologies.In 2020 Zhenpeng Chen, Yanbin Cao, Fine grain emotion analysis can be done for the purpose of
and HuihanYaodeve developed a model anxiety detection.
DeepMojiModel,SEntiMojiModel [16]SEntiMoji was There is a requirement to work on the ethical aspects and
beneficial for tasks that mainly depend on emotion terms to extend this form of study (i.e. depression detection).
identification. The method of construction of
There is a need to build a smart AI system that can analyze
variousdatasets can be different. so, the performance
the symptoms from tweets accurately. The lack of a perfectly
should be analysedrationally was the main objective and
accurate model is a big disadvantage.
aspect behind their research.