Skip to main content

A comparative study of machine learning algorithms for predicting domestic violence vulnerability in Liberian women

Abstract

Domestic violence against women is a prevalent in Liberia, with nearly half of women reporting physical violence. However, research on the biosocial factors contributing to this issue remains limited. This study aims to predict women’s vulnerability to domestic violence using a machine learning approach, leveraging data from the Liberian Demographic and Health Survey (LDHS) conducted in 2019–2020.

We employed seven machine learning algorithms to achieve this goal, including ANN, KNN, RF, DT, XGBoost, LightGBM, and CatBoost. Our analysis revealed that the LightGBM and RF models achieved the highest accuracy in predicting women’s vulnerability to domestic violence in Liberia, with 81% and 82% accuracy rates, respectively. One of the key features identified across multiple algorithms was the number of people who had experienced emotional violence.

These findings offer important insights into the underlying characteristics and risk factors associated with domestic violence against women in Liberia. By utilizing machine learning techniques, we can better predict and understand this complex issue, ultimately contributing to the development of more effective prevention and intervention strategies.

Highlights

Chi-square association was used to determine the significant factors for Domestic violence (DV) in Liberia by using LDHS 2019–20 data.

LightGBM and RF performed the best of all seven machine learning models, with the highest accuracy and F1 score.

Victims of emotional violence were the most important feature of almost every model.

Peer Review reports

Introduction

Domestic violence, also known as Intimate partner violence (IPV), is the threat or use of physical, psychological, or emotional abuse towards a partner [1]. It is frequently manifested by gender-specific violence driven by sexism and gender inequity [2].Domestic violence is simply characterized as a man’s assault on a woman [3]. Women are experiencing unprecedented levels of violence in modern society. It is widely acknowledged as a serious violation of human rights and a global health concern [4].

Domestic violence is being put into a global context through things like feminist theories of violence, studies of cyberviolence, and works on privacy. Several researches suggest how a culture where males predominate causes violence against women and restricts women’s bodies in public areas, sexual expression, overall appearance, careers, and spare time. Cyberviolence against women also includes threats of rape, the sharing of private information, and the spreading of photos of a noose that have been changed. Also, domestic violence is linked to not having enough privacy at home [5]. The rates of domestic violence have been increased during the COVID-19 [6]. WHO research shows that the rate of violence between people who live together varies a lot from country to country, from 15% in Japan to 71% in rural Ethiopia [7]. Lifetime rates of emotional domestic abuse in Ethiopia were found to be 51.7%, while rates of physical and sexual assault were found to be between 19.2 and 59.0% [4]. According to a recent global study, businesses can have a significant impact on reducing domestic violence. Nine per cent or more of workers have experienced sexual harassment on the job.

Nevertheless, among all forms of domestic violence, emotional abuse was by far the most prevalent [8]. Characteristics such as maternal education, housing, an unplanned pregnancy, partner substance usage, and a lack of prenatal care were associated with a higher risk of experiencing domestic violence [9]. Not only that, but being a housewife, having fewer children, marrying at a younger age, having a shorter marriage, and having a husband who is unemployed are all connected [8]. In a Pakistani study, poverty, in-law influence, second marriage, stepchildren, forced personal connections, the husband’s irresponsibility and drug use, and children with disabilities were all named as risk factors for domestic violence [10, 11]. Victims of intimate partner abuse had worse mental health, including depression, PTSD, and anxiety, than non-victims [12]. Undesirable physical and mental health effects, especially traumatic brain damage, chronic pain, insomnia, pelvic pain, depression, and post-traumatic stress disorder, may arise from IPV [13,14,15,16].

The high prevalence of sexual and other forms of gender-based violence in armed conflict was once considered an unfortunate but unavoidable side effect of war [17]. Violence and rape are hard to prove everywhere, but especially in Liberia, which has been in conflict from 1989 to 2003 and has seen damage to its infrastructure and social fabric [18, 19]. Two hundred seventy thousand people died from violence or illnesses that went untreated because they couldn’t get to a hospital or get medicine. Over700,000 people escaped Liberia, and more than 1.4 million were internally displaced [20]. In Montserrado and Nimba, more than half of women had been hurt by their partners in ways that were not sexual, and 20% had been raped outside of marriage [20].Nearly half (45%) of individuals in post-conflict Liberia reported physical violence from non-partners [21]. Domestic violence has disempowered women in West Point, Liberia by causing low self-esteem, dependence, low skills, low self-confidence, trauma, stigma, and job loss [22]. Intimate partner violence has a devastating effect on women’s health, both psychologically and physiologically [23]. Furthermore, women’s independence in Liberia has been hampered by domestic abuse. Home abuse adversely affects children’s cognitive growth and mothers’ mental and emotional well-being. Many initiatives have been launched to guarantee women’s safety and promote their independence. There is a lack of quality biosocial studies examining the topic of domestic violence in Liberia.

This study aims to present the recent prevalence of domestic violence, the associated factors, and also to predict domestic violence using machine learning. The ANN, KNN, RF, DT, XGBoost, LightGBM, and CatBoost algorithms.

However, there has been no investigation of domestic violence in Liberia utilizing ANN, KNN, RF, DT, XGBoost, LightGBM, and CatBoost algorithms with the most current data from the Liberian Demographic and Health Survey (LDHS). We will focus on using machine learning algorithms rather than deep learning because of the structured and limited datasets, and also to avoid overfitting. In our study, we will run six different machine learning (ML) algorithms and compare how well they can predict things to find the best predictive model for our study. This study differs from others in the way features are selected and the accuracy of 10-fold cross-validation. The classifiers will also help policymakers find features of domestic violence early on so they can stop it and help Liberian women and future generations have a better world.

Review of related works

There are few related works of domestic violence in Liberia, but we found some recent research who used machine learning algorithms in similar work. A research evaluated the high-effect factors of citizens’ happiness, but it had constraints regarding data, algorithm selection, processing overhead, and real-time detection. The dataset utilized is undiversified, which makes machine learning models challenging to generalize. Furthermore, the algorithms used (SVM-RBF and IB-KNN) may not be optimal, and other methods should be investigated. Additionally, ethical concerns like censorship and privacy in automated systems must be addressed. Finally, practicality and effectiveness must be assessed through real-world installation and evaluation in email servers, social media platforms, and news websites [24].

Another study used machine learning algorithms to detect unipolar and bipolar depression detection on actigraphic registration of motor activity. However, the dataset’s uneven class distribution needs to improve the analysis. This imbalance, particularly in Bipolar II patients, might distort results and impair the model’s real-world applicability. Strategies such as oversampling or synthetic data production should be investigated to solve this issue. Furthermore, there is no investigation of the interpretability and explainability of depression categorization models in the study. Understanding how these models make predictions is critical, especially when trust and transparency are essential in a therapeutic setting [25]. To address these limitations and advance the field of machine learning in depression detection, future research should consider strategies for class imbalance mitigation, more extensive and diverse datasets, improved model interpretability, validation in clinical settings, and comparison of dimensionality reduction techniques.

In 2022, a study suggested a directed network link prediction approach based on path extension similarity to increase the forecast accuracy of network node potential edges. The study mentions the use of numerous real data sets for accuracy verification and robustness analysis. However, how far the proposed strategy may be applied to different networks or datasets is still being determined. ML researchers frequently look for approaches that generalize well across domains and datasets. The research focuses on link prediction in directed networks using a particular approach (path extension similarity). The proposed method may only be applied to some sorts of ML issues, which is a limitation [26].

A researched proposed novel ensemble and robust anomaly detection method based on collaborative representation-based detector. The focused pixels utilized to estimate the background data are drawn randomly from the image. A critical disadvantage in this study is the unpredictability induced by using randomly picked pixels to estimate background data. Furthermore, while the publication claims that the approach is less sensitive to outliers, a thorough assessment of its robustness is required. This should include assessing its performance in various outlier scenarios involving various types and degrees of anomaly to better understand its strengths and limitations [27].

A study on influencer marketing sheds light on the influence of fake followers on perceptions of influencing power. However, it has numerous shortcomings that highlight topics for further research in machine learning (ML) applications in influencer marketing. While the study analyzes the impact of false followers, it does not directly investigate the application of machine-learning approaches to detect or quantify fraudulent followers. Given the constantly developing strategies employed by influencers to disguise themselves, future studies could dive into the creation and implementation of ML models for more precise and automatic identification of false followers [28].

An article that used the multi-modal fusion in visual question answering highlights the role of attention mechanisms in Visual Question Answering (VQA) but does not detail ML testing for these mechanisms. Future research should build standardized testing procedures and datasets for evaluating attention-based models in cross-modal retrieval scenarios to ensure their dependability in practical applications [29]. An article in 2023 recently used an improved multi-label method to classify emotions for short texts. The study focuses on improving classification accuracy and speed but needs to extensively address the challenges related to noisy or ambiguous data in real-world Twitter conversations. The study’s experiments were also conducted on a Twitter corpus, which may not fully represent the diversity of short text data found on other social media platforms or digital environments [29].

A research combined the multi-layer semantic representation network with the deep fusion matching network to overcome the restrictions of merely examining a sentence representation module or a reasoning model. It does not, however, provide explicit solutions for dealing with the mutual limitations identified between these modules. While the joint optimization model improves recognition accuracy, the paper notes there is still potential for improvement. ML testing should investigate sophisticated optimization strategies and novel model architectures to obtain even higher reasoning accuracy, potentially pushing the bounds of natural language reasoning capabilities. [30]. In 2023, a study recognized extended dialogue emotion using commonsense knowledge graph guidance; however, the work relies on external commonsense knowledge from the ATOMIC atlas and does not examine potential limits or biases in this knowledge source. External knowledge’s correctness and completeness can impact emotion detection quality; therefore, any limits in the ATOMIC dataset should be considered [31].In our research, we embark on a comprehensive exploration of machine learning (ML) algorithms, employing six distinct models to discern their efficacy in predictive modeling. Our objective is to determine the most proficient predictive model for our study, which focuses on forecasting domestic violence among Liberian women. To achieve this, we harness the power of Artificial Neural Networks (ANN), K-Nearest Neighbors (KNN), Random Forest (RF), Decision Trees (DT), XGBoost, LightGBM, and CatBoost algorithms.

It is worth noting that, to the best of our knowledge, there has been a paucity of research endeavors that have utilized these advanced ML models for predicting domestic violence. This study endeavors to contribute novelty to this field by applying cutting-edge AI models to address critical women’s health issues, particularly in the context of domestic violence within underdeveloped countries.

Our study distinguishes itself from prior research in two pivotal aspects. Firstly, we employ a unique approach to feature selection, offering a novel perspective on identifying crucial indicators of domestic violence. Secondly, we rigorously assess our models’ performance through a rigorous 10-fold cross-validation process, enhancing the reliability and robustness of our findings.

The outcomes of our research hold significant potential for policymakers and advocacy groups. By identifying early indicators of domestic violence, our classifiers can play a pivotal role in proactively addressing and mitigating this pressing issue. Ultimately, our work aspires to contribute to a brighter future for Liberian women and subsequent generations, fostering a safer and more equitable world.

Materials and methods

Data collection

The investigation utilized data from the 2019-20 Liberia Demographic and Health Survey (LDHS) [32], which was conducted by the Liberia Institute of Statistics and Geo-Information Services (LISGIS) with permission from the Ministry of Health (MOH). This survey is the fifth in a series of demographic and health surveys conducted in Liberia.

Study design and settings

The 2019-20 LDHS sample frame was created based on the Liberia Institute of Statistics and Geo-Information Services (LISGIS)’s 2008 NPHC. Liberia is divided into 15 counties, into five zones with three counties each, and further organized into clans. Each clan was partitioned into enumeration areas (EAs) during the 2008 NPHC. The census frame shows that each EA has an average of 100 households.

The LDHS 2019-20 utilized a two-stage cluster design. First, EA sample sites or clusters were selected. The EAs within each sample stratum were selected based on their size, resulting in a total of 325 cluster groupings. Second, households were selected from each cluster. The households were listed during the listing, and on average, 129 households were located in each cluster. From the listed households, 30 were randomly selected, resulting in a total sample size of 9,745 households. This sample is representative at the national, urban, and rural levels and covers all five regions, including the 15 counties [33]. After excluding type errors, missing, and unnecessary values, the remaining sample size was 1,907, which provides representative data for important metrics in all 15 counties.

Measures

(1) Dependent variable

Ever having witnessed domestic violence was the dependent variable in our study. Two categories, “Yes” (experienced domestic abuse) and “No” (did not experience domestic violence), are used to classify the dependent variable. One indicates “Yes”, whereas zero indicates “No”.

(2) Independent variable

As an independent variable, we used the sociodemographic information of the respondents. Region (North Central, North Western, South Central, South Eastern A, and South Eastern B), Residence (Urban, Rural), Educational Level (No Education, Primary, Secondary, Higher), Body Mass Index (BMI), Wealth Index (Poor, Middle, Rich), Religion (Christian, Muslim, Traditional Religion, No Religion), Partner’s Education Level, Partner’s Occupation (Did not work, Public or private sector, Sales & services, Agricultural Sector, Skilled & Unskilled Manual), Respondent’s Occupation, Partner’s Age (11–20, 21–30, 31–40, 41–50, 51–60, 61–70, > 71), Lack of Independence (Yes, No), Victims of Emotional Violence (Yes, No), and Partner’s drinking habit (Yes, No) were among the sociodemographic factors. To classify BMI, the BMI standard scale was applied [34].

Data splitting and model building

We used 80% of the dataset for training and 20% for testing. Using the training data, we fitted a variety of models. To ensure robustness, we tuned the model to find the ideal hyperparameters using grid search and 10-fold cross-validation. After creating the ultimate best model, the fitting process was repeated on the training set. Finally, we evaluated the model’s performance using an independent test set, which gave us helpful information on the predictability of our method for classifying domestic violence.

Workflow chart for predicting DV

Figure 1 shows the workflow of the Machine learning classifiers for predicting the vulnerability of women to domestic violence in Liberia.

Fig. 1
figure 1

Workflow chart of domestic violence vulnerability in Liberian women

Predictive model generation

We have applied seven distinct machine learning models to assess the accuracy of each model for our dataset to forecast the incidence of domestic violence in Liberia. Only some studies generate their methodology using machine learning models [35,36,37,38]. Brief descriptions of the following models are provided below:

Artificial neural network (ANN)

The phrase “artificial neural network” refers to a subfield of artificial intelligence influenced by biology and fashioned after the brain. Typically, an artificial neural network is a computer network modeled after the biological neural networks that comprise the structure of the human brain. Similar to how neurons in the real brain are interconnected, artificial neural networks likewise contain interconnected neurons that multiple network levels. These neurons are referred to as nodes [38]. ANNs are used in various applications, including image and speech recognition, natural language processing, and predictive analytics.

Another criticalaspect of ANN model is its architecture, which includes the number of layers, the number of neurons per layer, and the activation functions used. In an ANN, the input is sent from the neurons in one layer to the neurons in the next layer after the bias and weight vectors have modified it. In the neurons of the hidden layer, information is processed, and the signal is modified using an activation function before being transmitted to the outer layer. Table 1 shows the hyperparameter values of the ANN model.

Table 1 Control parameters for ANN model for predicting DV

K-nearest neighbor (KNN)

K-nearest neighbor is a machine learning approach considered straightforward [39]. A categorization system based on similarities can be used to map the prediction of the domestic violence problem. A collection of vectors is created by mapping the test data and the historical observed data. The N-dimensional representation of each vector correlates to a feature. For classification tasks, KNN takes the majority class among the K closest neighbors as the predicted class for the new data point. For regression tasks, KNN takes the average (or weighted average) of the K closest neighbors as the predicted value for the new data point. Then, a similarity measure such as Euclidean distance is calculated to make a choice. A description of KNN is given in this section. When using KNN, which is regarded as lazy learning, no prior model or function is built; instead, the closest K records from the training data set that are most similar to the test are produced (i.e., query records). The class label is then decided by a majority vote among the chosen K records, and it is subsequently applied to the query record.

The following is how the prediction of domestic violence is calculated using KNN:

  1. 1.

    We calculated the K closest neighbors.

  2. 2.

    Calculated distance between the query record and the training samples.

  3. 3.

    Organize all training records by distance values.

  4. 4.

    Assign the prediction value for the query record to the class labels of the K neighbors with the most votes.

Table 2 represents the values of parameters for developing KNN model.

Table 2 Control parameters for KNN model for predicting DV

Random Forest (RF)

Random Forest (RF) is a machine learning algorithm that belongs to the ensemble methods family. Ensemble methods combine multiple base models to improve the overall performance of the prediction task. Random Forest is a supervised learning algorithm that can be used for both regression and classification problems. American academic Leo Breiman of the University of California, Berkeley initially devised the random forest algorithm in 2001. It uses supervised data mining techniques. It uses supervised data mining techniques [40].

Table 3 shows the parameter values of RF model.

Table 3 Control parameters for RF model for predicting DV

Decision tree (DT)

A well-liked supervised machine learning technique called Decision Tree (DT) is utilized for both classification and regression problems. The primary purpose of a DT is to build a tree-like model of decisions and potential outcomes using a collection of input features. Recursively dividing the data into progressively smaller sections depending on the values of the input characteristics creates the tree. A decision tree is a grid or tool used to enable choices among several alternatives, such as event outcomes, resource use, costs, and their application [41]. It is one method of displaying an algorithm. Decision tree applications are frequently utilized in decision research and analysis. Here we used Decision tree Classification. In general, it is employed for prediction purposes. It painstakingly analyses the data contained in vast facts to find useful patterns and relationships. In this work, a decision tree classifier from the Scikit-learn Python package is used [42].

The control parameter values of DT model are shown in Table 4.

Table 4 Parameters for DT model

Xtreme gradient boosting (XGBoost)

It is an implementation of gradient boosting, an ensemble learning method combining multiple weak models to form a robust model. The ensemble machine learning method XGBoost (Extreme Gradient Boosting) uses a gradient boosting framework, which is decision tree based. Only the first derivative information is used in conventional boosting tree models. The residual of the previous n-1 trees is employed while training the nth tree, making dispersed training challenging. XGBoost expands the loss function in a second-order Taylor manner and can automatically utilize the CPU’s multithreading for parallel processing. Additionally, XGBoost employs a number of techniques to prevent overfitting [43].

Some of the key features of XGBoost include:

  1. 1.

    Regularization: To avoid overfitting, XGBoost offers a number of regularisation approaches, including L1 and L2 regularisation.

  2. 2.

    Cross-validation: To identify the ideal number of boosting rounds, XGBoost includes cross-validation built in.

  3. 3.

    Handling missing values: The dataset’s missing values can be handled by XGBoost since it automatically learns how to manage them during training.

  4. 4.

    Processing in parallel: XGBoost is incredibly quick and scalable because it can parallelize the calculation of trees during training.

Table 5 shows the parameter values of XGBoost model.

Table 5 Control parameters for XGBoost model for predicting DV

Light gradient boosting (LightGBM)

Microsoft created the open-source GBDT (gradient boosting decision tree) algorithm, LightGBM. The parallel voting decision tree approach employs the histogram-based technique to speed training, reduce memory usage, and combine advanced network connectivity to maximize parallel learning. Each iteration involves splitting the training data into different machines, doing local voting to choose the top k attributes, and global voting to receive the top 2k attributes. To identify the leaf with the most significant splitter gain, LightGBM employs a leaf-wise approach (see Fig. 2). Because of its precision, effectiveness, and stability, GBDT is now frequently employed [44].

Key characteristics of LightGBM include:

  1. 1.

    Faster training speed: Training is completed more quickly, it can handle big datasets with millions of instances and characteristics.

  2. 2.

    Less memory usage: LightGBM employs a cutting-edge method called Gradient-based One-Side Sampling (GOSS) to save memory.

  3. 3.

    More precision: LightGBM employs exclusive feature bundling (EFB) to increase accuracy. EFB aggregates characteristics with comparable values into a single feature.

  4. 4.

    Tunable parameters: Varioushyperparameters, including the learning rate, the number of leaves, and the maximum depth, are offered by LightGBM and may be adjusted to enhance performance.

Fig. 2
figure 2

The tree generating technique in LightGBM [44]

Table 6 shows the values of control parameter of LightGBM model.

Table 6 Parameters for LightGBM model

Categorical boosting (CatBoost)

One of the most recent boosting algorithms is CatBoost (Categorical Boosting). Although it functions similarly to the Gradient and XGboost methods and is also intended to work with categorical information, this approach has advanced features that make it more dependable, quick, and accurate. The following are the benefits of CatBoost over other GBDT algorithms:

First off, categorical features are well handled by this method. With the appropriate average label value, categorical characteristics can be replaced by the conventional GBDT algorithm. The decision tree’s average label value will serve as the node-splitting criterion. CatBoost, in addition, incorporates various category traits. All categorical characteristics and combinations in the current tree are combined with all categorical features in the dataset using a greedy method by CatBoost. Thirdly, the gradient bias can be eliminated with CatBoost. In GBDT, a weak learner is generated in each iteration, and each learner is trained using the gradient of the previous learner. The output is the sum of the classified results from all of the learners [45].

The control parameter values of CatBoost model shown in Table 7.

Table 7 Parameters for CatBoost model for predicting DV

Results

The dependent variable was shown as a bar chart in Fig. 3, which showed that 55.74% of the participants had experienced domestic violence while 44.26% had not.

Fig. 3
figure 3

Bar chart of dependent variables (Domestic Violence)

Table 8 shows how often domestic violence happens and the selected covariates’ background characteristics. Women in the South-Eastern B region had a greater likelihood of experiencing domestic abuse (51.4%) than women in other regions. Women had a higher risk of experiencing domestic violence if they came from a Christian family (44.2%), lived in a rural area (44.5%), received only primary education (50.2%), belonged to a rich family (45.7%), or if their husband or partner had only primary education (47.8%). Women between the ages of 21 and 30 also had a higher risk of experiencing domestic violence (52.2%). 48.0% of women who worked in sales and services; 51.1% of women who lacked independence; 78.1% of women who experienced emotional violence; and 78.3% of women whose husbands or partners had a drinking problem were at increased risk (59.0%). It was revealed that there was a significant link between the chosen covariates and the incidence of domestic violence against women (P < 0.001).

Table 8 Association between selected features with domestic violence in Liberia

We want to measure the accuracy, precision, recall, and F1-score of seven machine learning algorithms on our data set. The prediction accuracy of the ANN, KNN, RF, DT, XGBoost, LightGBM, and CatBoost algorithms for our data set was 81%, 78%, 81%, 82%, 81%, and 82%, respectively, as shown in Table 9. In this instance, the DT and CatBoost algorithms produced the highest accuracy, precision, recall, and F1-score results. However, the KNN method had the lowest accuracy, precision, recall, and F1-score values. Overall, the DT method performed better than the other six algorithms for our prediction data set.

Table 9 Evaluation of the model’s performance
Fig. 4
figure 4

Confusion matrix of the ML models

Figure 4 shows the normalized confusion matrix for each of the following algorithms: ANN, KNN, RF, DT, XGBoost, LightGBM, and CatBoost. Figure 4 shows a confusion matrix that can help make this prediction result clearer. Figure 4 shows that the percentage true positive value in the confusion matrix of the LightGBM model is 76%. This means that it labels 76% of true positive values as true positive values. On the other hand, the number of true negative scores is 87%, which means that this model labels true negative classes as true negative 87% of the time. LightGBM gives the highest true positive value compared to other algorithms.

The relevance of a feature describes whether certain data elements are more practical or significant than others. It can help to comprehend the solution better, and feature selection can occasionally result in model enhancement when the selected feature is implemented [46]. Feature importance is essentially a strategy that allocates a score to input features based on utility. They are adept at predicting a variable of interest. We determined the feature’s importance for RF, DT, XGBoost, LightGBM, and CatBoost. Figures 5, 6, 7, 8 and 9 illustrate the significant features of selected algorithms independently.

RF

Fig. 5
figure 5

Feature importance of Random Forest (RF) model

Fig. 6
figure 6

Feature importance of Decision Tree (DT) model

Fig. 7
figure 7

Feature importance of XGBoost model

Fig. 8
figure 8

Feature importance of LightGBM model

Fig. 9
figure 9

Feature importance of CatBoost model

The receiver operating characteristic curve, or ROC curve, is a graph that shows how well binary classifiers can diagnose problems. This study’s main goal was to find domestic violence cases and evaluate the results using seven different algorithms. Figure 10 demonstrates that the RF method has the best performance compared to the other machine learning algorithms.

Fig. 10
figure 10

ROC curve of the ML models

Discussion

Domestic violence is a serious issue affecting millions of women worldwide. In recent years, machine learning (ML) techniques have been increasingly utilized to identify and predict contributing factors to domestic violence. Our study is one of the major types of research that has used ML to predict domestic violence in Liberia. Previous studies by Amusa et al. (2022) [47], Petering et al. (2018) [48], and others have also employed ML strategies to predict domestic violence [49, 50].

Deep learning (DL) models, notably RNNs, have been the most popular computational strategy in ML in recent years, but they require vast amounts of data to function well [52]. DL also uses many layers of algorithms, each of which interprets data differently [51]. Our investigation was limited by lacking a big dataset for Liberian women’s domestic violence vulnerability prediction. While, traditional ML models are less likely to overfit, making them more resilient and effective with smaller datasets. Additionally, ML models like random forests (RF), decision trees (DT), and k-nearest neighbors (KNN) are easier to read and reveal feature importance. Interpretability is essential for research transparency and policymaker and stakeholder action for domestic violence prediction. Therefore, we used ML classifiers to analyze nationally representative survey data in Liberia. The model identified six factors that were significant among thirteen variables: educational level, partner’s employment, partner’s age, lack of independence, victims of emotional abuse, and partner’s drinking habit. This allowed us to automate the process of domestic violence prevention without requiring significant human effort.

Our findings reveal that 55.74% of Liberians have been victims of domestic violence at some point in their lives. However, women may be unable to take essential steps to address this situation. According to the World Health Organization (WHO), more than 50% of women in Bangladesh, Ethiopia, Peru, and Tanzania are highly exposed to domestic violence, which reaches an alarming 71% in Ethiopia [52]. Additionally, in South Africa and India, the respective percentages of domestic violence are 21.9% and 45.37% [47, 53].

In our investigation, most algorithms without KNN achieved the highest accuracy and recall ratings, ranging between 81% and 82%. In particular, ANN, DT, and CatBoost demonstrated the best accuracy compared to other research done in Bangladesh, while the RF model was the most accurate [47, 54]. Another study also implemented the RF model to predict domestic violence due to its ease of handling predicting errors and the provision of instructive visualizations [55, 56].

Our study identified several significant risk factors for domestic violence. Education level was a good predictor of domestic violence, consistent with previous studies [11, 54, 57]. In particular, illiterate women may be unaware of their rights, leading to a greater reliance on their partners and an increased likelihood of domestic abuse. Our study also revealed the partner’s drinking habit to be a risk factor for domestic violence, as the spouse may become disoriented due to alcohol consumption. A study conducted in Pakistan similarly found drinking to be a risk factor for domestic violence [10].

Furthermore, our study found that the partner’s age and occupation are also significant factors associated with domestic violence. A study conducted in Uganda supports these findings [58]. Women’s independence is a crucial factor in gender equality, which is one of the primary aims of sustainable development. Our findings suggest that a lack of independence is one of the most significant determinants of domestic violence, indicating that gender equality has not yet been achieved [59].

Limitations

Our study utilized the LDHS 2019 dataset, known for its authenticity and adherence to DHS program requirements. However, since the survey was conducted a few years ago, the current values may have changed, which is a potential limitation. Additionally, since the study is cross-sectional, causal inferences and temporal trends could not be analyzed. Moreover, the analysis was limited to a specific set of variables due to the unavailability of DHS datasets. Nonetheless, our study provides valuable insights into the domestic violence features using machine learning algorithms, which have not been explored in depth before.

Conclusion

In conclusion, our study utilized machine learning techniques to identify and predict contributing variables to domestic violence in Liberia. The results demonstrated that machine learning classifiers, such as CatBoost and DT, performed well in predicting domestic violence. However, there were some inconsistencies in the findings, indicating that researchers should consider hybrid machine learning approaches for superior results. We suggest that future studies combine data from DHS to enhance the efficacy of machine learning models. We also recognize the challenges of tuning hyper parameters to produce accurate results and determining relevant features. We recommend that policymakers and the government focus on the characteristics identified in our study, such as education level, partner’s employment, partner’s age, lack of independence, victims of emotional abuse, and partner’s drinking habit, to prevent domestic violence and promote gender equality.

Data Availability

The data that support the findings of this study are openly available in The DHS Program at https://dhsprogram.com/data/available-datasets.cfm, reference country Liberia. All essential source code for this study will be accessible on Kaggle at the following link : https://www.kaggle.com/code/tingku/predicting-domestic-violence-in-liberia.

References

  1. Flury M, Nyberg E, Riecher-Rössler A. Domestic Violence against women: definitions, epidemiology, risk factors and consequences. Swiss Med Wkly. 2010;140(NOVEMBER):23–7. https://doi.org/10.4414/SMW.2010.13099.

    Article  Google Scholar 

  2. Carol Hagemann W. (1992). Strategies against violence in gender relations. Centaurus-Verlag-Ges. https://katalog.ub.uni-heidelberg.de/cgi-bin/titel.cgi?katkey=29141491.

  3. Alejo K. Long-Term Physical and Mental Health Effects of Domestic Violence. Res J Justice Stud Forensic Sci Sci. 2014;2:5. https://doi.org/10.31979/THEMIS.2014.0205.

    Article  Google Scholar 

  4. Semahegn A, Mengistie B. Domestic Violence against women and associated factors in Ethiopia; systematic review. Reproductive Health. 2015;12(1):1–12. https://doi.org/10.1186/S12978-015-0072-1/TABLES/1.

    Article  Google Scholar 

  5. Brenda Smith CW. (2005). Domestic Violence at the Margins (S. Natalie J., Ed.). Rutgers University Press. https://www.rutgersuniversitypress.org/domestic-violence-at-the-margins/9780813535708.

  6. Kourti A et al. “Domestic Violence During the COVID-19 Pandemic: A Systematic Review,” Trauma, Violence, & Abuse, vol. 24, no. 2, pp. 719–745, Apr. 2023, https://doi.org/10.1177/15248380211038690.

  7. World Health Organization. (2014). Strengthening the role of the health system addressing violence against women.

  8. Shah SH, Rajani K, Kataria L, Trivedi A, Patel S, Mehta K. Perception and prevalence of Domestic Violence in the study population. Industrial Psychiatry Journal. 2012;21(2):137. https://doi.org/10.4103/0972-6748.119624.

    Article  PubMed  PubMed Central  Google Scholar 

  9. WHO. (2011). Domestic violence and its associated factors in Iran: according to World Health Organization model. WHO. http://eprints.qums.ac.ir/5436/.

  10. Hussain H, Hussain S, Zahra S, Hussain T. Prevalence and risk factors of Domestic Violence and its impacts on women’s mental health in Gilgit-Baltistan, Pakistan. Pakistan J Med Sci. 2020;36(4):627–31. https://doi.org/10.12669/PJMS.36.4.1530.

    Article  Google Scholar 

  11. George J, Nair D, Premkumar N, Saravanan N, Chinnakali P, Roy G. The prevalence of Domestic Violence and its associated factors among married women in a rural area of Puducherry, South India. J Family Med Prim Care. 2016;5(3):672. https://doi.org/10.4103/2249-4863.197309.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Lagdon S, Armour C, Stringer M. Adult experience of mental health outcomes as a result of intimate partner Violence victimisation: a systematic review. Eur J Psychotraumatology. 2014. https://doi.org/10.3402/EJPT.V5.24794. 5.

    Article  Google Scholar 

  13. Campbell J, Jones AS, Dienemann J, Kub J, Schollenberger J, O’Campo P, Gielen AC, Wynne C. Intimate partner Violence and physical health consequences. Arch Intern Med. 2002;162(10):1157–63. https://doi.org/10.1001/ARCHINTE.162.10.1157.

    Article  PubMed  Google Scholar 

  14. Dillon G, Hussain R, Loxton D, Rahman S. (2013). Mental and Physical Health and Intimate Partner Violence against Women: A Review of the Literature. International Journal of Family Medicine, 2013, 1–15. https://doi.org/10.1155/2013/313909.

  15. Swiss S, Jennings PJ, Aryee Gv, Brown GH, Jappah-Samukai RM, Kamara MS, Schaack RDH, Turay-Kanneh RS. Violence against women during the Liberian civil conflict. JAMA. 1998;279(8):625–9. https://doi.org/10.1001/JAMA.279.8.625.

    Article  CAS  PubMed  Google Scholar 

  16. Walker N, Beek K, Chen H, Shang J, Stevenson S, Williams K, Herzog H, Ahmed J, Cullen P. The experiences of Persistent Pain among Women with a history of intimate Partner Violence: a systematic review. Trauma Violence and Abuse. 2022;23(2):490–505. https://doi.org/10.1177/1524838020957989.

    Article  Google Scholar 

  17. Tol WA, Stavrou V, Greene MC, Mergenthaler C, Garcia-Moreno C, van Ommeren M. Mental health and psychosocial support interventions for survivors of sexual and gender-based Violence during armed conflict: a systematic review. World Psychiatry. 2013;12(2):179. https://doi.org/10.1002/wps.20054.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Panter-Brick C. Conflict, Violence, and health: setting a new interdisciplinary agenda. Soc Sci Med. 2010;70(1):1–6. https://doi.org/10.1016/J.SOCSCIMED.2009.10.022.

    Article  PubMed  Google Scholar 

  19. Stark L, Ager A. A systematic review of prevalence studies of gender-based Violence in complex emergencies. Trauma Violence & Abuse. 2011;12(3):127–34. https://doi.org/10.1177/1524838011404252.

    Article  Google Scholar 

  20. Stark L, Warner A, Lehmann H, Boothby N, Ager A. Measuring the incidence and reporting of Violence against women and girls in Liberia using the neighborhood method. Confl Health. 2013;7(1):1–9. https://doi.org/10.1186/1752-1505-7-20/FIGURES/1.

    Article  Google Scholar 

  21. Kelly J, Colantuoni E, Robinson C, Decker MR. From political to personal Violence: links between conflict and non-partner physical Violence in post-conflict Liberia. Glob Public Health. 2019;14(12):1639–52. https://doi.org/10.1080/17441692.2019.1650949.

    Article  PubMed  Google Scholar 

  22. Wilson, Sophia. Domestic Violence and women’s dis-empowerment in west point township-Monrovia, Liberia. Kampala International University; 2018. http://hdl.handle.net/20.500.12306/1256.

  23. Sungho D, Kumar PN, Dayal A, Dolo D, Dorleleay W, Higgins W, Holzinger A, Martens T, Vasilov C, Liberia I, Kamara J. (2022). Reducing intimate Partner Violence: evidence from a multifaceted female empowerment program in Urban Liberia *.

  24. IEEE Transactions on Computational Social Systems. https://doi.org/10.1109/TCSS.2023.3246181.

  25. Zakariah M, Alotaibi YA. Unipolar and bipolar depression detection and classification based on Actigraphic Registration of Motor Activity Using Machine Learning and uniform manifold approximation and projection methods. Diagnostics. 2023;13(14):2323. https://doi.org/10.3390/diagnostics13142323.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Guo F, Zhou W, Lu Q, Zhang C. Path extension similarity link prediction method based on matrix algebra in directed networks. Comput Commun. 2022;187:83–92. https://doi.org/10.1016/J.COMCOM.2022.02.002.

    Article  Google Scholar 

  27. Wang S, Hu X, Sun J, Liu J. Hyperspectral anomaly detection using ensemble and robust collaborative representation. Inf Sci. 2023;624:748–60. https://doi.org/10.1016/j.ins.2022.12.096.

    Article  Google Scholar 

  28. Zhou L, Jin F, Wu B, Chen Z, Wang CL. Do fake followers mitigate influencers’ perceived influencing power on social media platforms? The mere number effect and boundary conditions. J Bus Res. 2023;158:113589. https://doi.org/10.1016/j.jbusres.2022.113589.

    Article  Google Scholar 

  29. Lu, S., Liu, M., Yin, L., Yin, Z., Liu, X., Zheng, W.,... Kong, X. (2023). The multi-modal fusion in visual question answering: a review of attention mechanisms. PeerJ Computer Science, 9, e1400. https://doi.org/10.1016/10.7717/peerj-cs.1400.

  30. Liu X, Shi T, Zhou G, Liu M, Yin Z, Yin L,…, Zheng W (2023) Emotion classification for short texts: an improved multi-label method. Humanit Social Sci Commun 10(1):306. https://doi.org/10.1057/s41599-023-01816-6

    Article  Google Scholar 

  31. Zheng W, Yin L. Characterization inference based on joint-optimization of multi-layer semantics and deep fusion matching network. PeerJ Comput Sci. 2022. https://doi.org/10.7717/peerj-cs.908.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Nie W, Bao Y, Zhao Y, Liu A. Long dialogue emotion detection based on Commonsense Knowledge Graph Guidance. IEEE Trans Multimedia. 2023. https://doi.org/10.1109/TMM.2023.3267295.

    Article  Google Scholar 

  33. Liberia Demographic and Health Survey 2019-20. Monrovia, Liberia and Liberia Institute of Statistics and Geo-Information Services (LISGIS), Ministry of Health [Liberia], and ICF, Rockville. Maryland, USA: Liberia Institute of Statistics and Geo-Information Services (LISGIS), Ministry of Health, and ICF.

  34. Liberia. (2019). The DHS Program. DHS. https://dhsprogram.com/publications/publication-FR362-DHS-Final-Reports.cfm.

  35. Nuttall FQ. Body Mass Index: obesity, BMI, and Health: a critical review. Nutr Today. 2015;50(3):117–28. https://doi.org/10.1097/NT.0000000000000092.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Behesht Abad AR, Mousavi S, Mohamadian N, Wood DA, Ghorbani H, Davoodi S, Alvar MA, Shahbazi K. Hybrid machine learning algorithms to predict condensate viscosity in the near wellbore regions of gas condensate reservoirs. J Nat Gas Sci Eng. 2021;95:104210. https://doi.org/10.1016/J.JNGSE.2021.104210.

    Article  Google Scholar 

  37. Behesht Abad AR, Tehrani PS, Naveshki M, Ghorbani H, Mohamadian N, Davoodi S, Aghdam SK, ye, Moghadasi J, Saberi H. Predicting oil flow rate through orifice plate with robust machine learning algorithms. Flow Meas Instrum. 2021a;81:102047. https://doi.org/10.1016/J.FLOWMEASINST.2021.102047.

    Article  Google Scholar 

  38. Hazbeh O, Ahmadi Alvar M, Khezerloo-ye Aghdam S, Ghorbani H, Mohamadian N, Moghadasi J. Hybrid computing models to predict oil formation volume factor using multilayer perceptron algorithm. J Petroleum Min Eng. 2021;23(1):17–30. https://doi.org/10.21608/JPME.2021.52149.1062.

    Article  Google Scholar 

  39. Md N, Alam Khan M Saef Ullah Miah, Md. Shahjalal, Sarwar TB. Md. Shahariar Rokon, “Predicting Young Imposter Syndrome Using Ensemble Learning”, Complexity, vol. 2022, Article ID 8306473, 10 pages, 2022. https://doi.org/10.1155/2022/8306473.

  40. Aha DW, Kibler D, Albert MK, Quinian JR. Instance-based learning algorithms. Mach Learn 1991. 1991;6:1(1):37–66. https://doi.org/10.1007/BF00153759.

    Article  Google Scholar 

  41. Behesht Abad AR, Tehrani PS, Naveshki M, Ghorbani H, Mohamadian N, Davoodi S, Aghdam SK, ye, Moghadasi J, Saberi H. Predicting oil flow rate through orifice plate with robust machine learning algorithms. Flow Meas Instrum. 2021b;81:102047. https://doi.org/10.1016/J.FLOWMEASINST.2021.102047.

    Article  Google Scholar 

  42. Demšar J. Statistical Comparisons of Classifiers over multiple data sets. J Mach Learn Res. 2006;7:1–30.

    Google Scholar 

  43. Aguilar-Chinea RM, Rodriguez IC, Expósito C, Melian-Batista B, Moreno-Vega JM. Using a decision tree algorithm to predict the robustness of a transshipment schedule. Procedia Comput Sci. 2019;149:529–36. https://doi.org/10.1016/J.PROCS.2019.01.172.

    Article  Google Scholar 

  44. Li W, Yin Y, Quan X, Zhang H. Gene expression value prediction based on XGBoost Algorithm. Front Genet. 2019;10:1077. https://doi.org/10.3389/FGENE.2019.01077/BIBTEX.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Ju Y, Sun G, Chen Q, Zhang M, Zhu H, Rehman MU. “A Model Combining Convolutional Neural Network and LightGBM Algorithm for Ultra-Short-Term Wind Power Forecasting,“ in IEEE Access, vol. 7, pp. 28309–28318, 2019. https://doi.org/10.1109/ACCESS.2019.2901920.

  46. Friedman JH. Stochastic gradient boosting. Comput Stat Data Anal. 2002;38(4):367–78. https://doi.org/10.1016/S0167-9473(01)00065-2.

    Article  Google Scholar 

  47. Rajabi M, Beheshtian S, Davoodi S, Ghorbani H, Mohamadian N, Radwan AE, Alvar MA. Novel hybrid machine learning optimizer algorithms to prediction of fracture density by petrophysical data. J Petroleum Explor Prod Technol. 2021b;11(12):4375–97. https://doi.org/10.1007/S13202-021-01321-Z/FIGURES/13.

    Article  Google Scholar 

  48. Amusa LB, Bengesai Av, Khan HTA. Predicting the vulnerability of women to intimate Partner Violence in South Africa: evidence from Tree-based machine learning techniques. J Interpers Violence. 2022;37(7–8):NP5228–45. https://doi.org/10.1177/0886260520960110.

    Article  PubMed  Google Scholar 

  49. Petering R, Um MY, Fard NA, Tavabi N, Kumari R, Gilani SN. Artificial intelligence to predict intimate partner Violence perpetration. Artificial Intelligence and Social Work; 2018. p. 195.

  50. Choubineh A, Ghorbani H, Wood DA, Moosavi R, Khalafi S, E., Sadatshojaei E. Improved predictions of wellhead choke liquid critical-flow rates: modelling based on hybrid neural network training learning based optimization. Fuel. 2017;207:547–60. https://doi.org/10.1016/J.FUEL.2017.06.131.

    Article  CAS  Google Scholar 

  51. Ghorbani H, Moghadasi J, Wood DA, Alzubaidi L, Zhang J, Humaidi AJ et al. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data 8, 53 (2021). https://doi.org/10.1186/s40537-021-00444-8.

  52. Alzubaidi L, Zhang J, Humaidi AJ et al (2021) Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data 8:53. https://doi.org/10.1186/s40537-021-00444-8

    Article  PubMed  PubMed Central  Google Scholar 

  53. Najafabadi MM, Villanustre F, Khoshgoftaar TM, Seliya N, Wald R, Muharemagic E. Deep learning applications and challenges in big data analytics. J Big Data. 2015;2(1):1.

    Article  Google Scholar 

  54. Overcoming Violence. (2011). Domestic Violence Facts. UNIFEM. http://www.overcomingviolence.org/en/resources/campaigns/women-against-violence/week-6-stories-from-around-the-/domestic-violence-facts.html.

  55. Singh BP, Singh KK, Singh N. Couple Interaction and Predicting vulnerability to Domestic Violence in Uttar Pradesh, India. J Interpers Violence. 2014;29(12):2304–24. https://doi.org/10.1177/0886260513518432.

    Article  PubMed  Google Scholar 

  56. Hossain MM, Asadullah M, Rahaman A, Miah MS, Hasan MZ, Paul T, Hossain MA. Prediction on Domestic Violence in Bangladesh during the COVID-19 outbreak using machine learning methods. Appl Syst Innov 2021. 2021;4(4):77. https://doi.org/10.3390/ASI4040077.

    Article  Google Scholar 

  57. Berk RA, Sorenson SB, Barnes G. Forecasting Domestic Violence: a Machine Learning Approach to help inform arraignment decisions. J Empir Legal Stud. 2016;13(1):94–115. https://doi.org/10.1111/JELS.12098.

    Article  Google Scholar 

  58. Debarchana G. Predicting vulnerability of Indian women to Domestic Violence incidents. Esearch and Practice in Social Sciences; 2007.

  59. Mishra A, Patne S, Tiwari R, Srivastava DK, Gour N, Bansal M. A cross-sectional study to find out the prevalence of different types of Domestic Violence in Gwalior City and to identify the various risk and protective factors for Domestic Violence. Indian J Community Medicine: Official Publication Indian Association Prev Social Med. 2014;39(1):21–5. https://doi.org/10.4103/0970-0218.126348.

    Article  Google Scholar 

  60. Tumwesigye NM, Kyomuhendo GB, Greenfield TK, Wanyenze RK. Problem drinking and physical intimate partner Violence against women: evidence from a national survey in Uganda. BMC Public Health. 2012;12(1):399. https://doi.org/10.1186/1471-2458-12-399.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Rahman M, Hoque MA, Makinoda S. Intimate Partner Violence Against women: is women empowerment a reducing factor? A study from a National Bangladeshi Sample. J Family Violence. 2011;26(5):411–20. https://doi.org/10.1007/s10896-011-9375-3.

    Article  Google Scholar 

Download references

Acknowledgements

We would like to express our gratitude to the Demographic and Health Survey (DHS) program for granting us access to the data used in this analysis. The DHS program provides valuable data that enables researchers to investigate important public health issues and make evidence-based decisions. We acknowledge the contributions of the DHS program and the participants who generously provided their time and information for this study.

Funding

The authors did not receive any funding.

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization: MNAK; Methodology: MNAK and RR; Software: RR; Validation: MNAK,RR and SSS; Formal analysis: RR and SSS; Investigation: MNAK; Data curation: SSS and RR; Writing - original draft preparation: MNAK, RR, SSS, ZIK and MAR ; Writing - review and editing: MNAK and RR; Supervision: MNAK. All authors approved the final manuscript.

Corresponding author

Correspondence to Md. Nafiul Alam Khan.

Ethics declarations

Ethics approval and consent to participate

Not Applicable.

Consent for publication

Not Applicable.

Competing interests

The authors declare no competing interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rahman, R., Khan, M.N.A., Sara, S.S. et al. A comparative study of machine learning algorithms for predicting domestic violence vulnerability in Liberian women. BMC Women's Health 23, 542 (2023). https://doi.org/10.1186/s12905-023-02701-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12905-023-02701-9

Keywords