Bayesian approach to structural equation models for ordered categorical and dichotomous data

Structural equation modeling (SEM) is a statistical methodology that is commonly used to study the relationships between manifest variables and latent variables. In analysing ordered categorical and dichotomous data, the basic assumption in SEM that the variables come from a continuous normal distri...

Full description

Bibliographic Details
Main Author: Thanoon, Y. Thanoon
Format: Thesis
Language:English
Published: 2017
Subjects:
Online Access:http://eprints.utm.my/81645/1/ThanoonYThanoonPFS2017.pdf
_version_ 1796863428815486976
author Thanoon, Y. Thanoon
author_facet Thanoon, Y. Thanoon
author_sort Thanoon, Y. Thanoon
collection ePrints
description Structural equation modeling (SEM) is a statistical methodology that is commonly used to study the relationships between manifest variables and latent variables. In analysing ordered categorical and dichotomous data, the basic assumption in SEM that the variables come from a continuous normal distribution is clearly violated. A rigorous analysis that takes into account the discrete nature of the variables is therefore necessary. A better approach for assessing these kinds of discrete data is to treat them as observations that come from a hidden continuous normal distribution with a threshold specification. A censored normal distribution and truncated normal distribution, each includes interval, right and left where the later are with known parameters, are used to handle the problem of ordered categorical and dichotomous data in Bayesian non-linear SEMs. The truncated normal distribution is used to handle the problem of non-normal data (ordered categorical and dichotomous) in the covariates in the structural model. Two types of thresholds (having equal and unequal spaces) are used in this research. The Bayesian approach (Gibbs sampling method) is applied to estimate the parameters. SEM treats the latent variables as missing data, and imputes them as part of Markov chain Monte Carlo (MCMC) simulation results in the full posterior distribution using data augmentation. An example using simulation data, case study and bootstrapping method are presented to illustrate these methods. In addition to Bayesian estimation, this research provide the standard error estimates (SE), highest posterior density (HPD) intervals and a goodness-of-fit test using the Deviance Information Criterion (DIC) to compare with the proposed methods. Here, in terms of parameter estimation and goodness-of-fit statistics, it is found that the results with a censored normal distribution are better than the results with a truncated normal distribution, with equal and unequal spaces of thresholds. Furthermore, the results with unequal spaces of thresholds are less than the results of equal spaces of thresholds in the interval of the censored and truncated normal distributions, this is including the left censored and truncated normal distributions. The results of equal spaces of thresholds are less than the results of unequal spaces of thresholds in right censored and truncated normal distributions. In other cases, the results of bootstrapping method are better than the real data results in terms of SE and DIC. The results of convergence showed that dichotomous data needs more iterations to convergence than ordered categorical data.
first_indexed 2024-03-05T20:26:37Z
format Thesis
id utm.eprints-81645
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T20:26:37Z
publishDate 2017
record_format dspace
spelling utm.eprints-816452019-09-10T01:53:04Z http://eprints.utm.my/81645/ Bayesian approach to structural equation models for ordered categorical and dichotomous data Thanoon, Y. Thanoon QA Mathematics Structural equation modeling (SEM) is a statistical methodology that is commonly used to study the relationships between manifest variables and latent variables. In analysing ordered categorical and dichotomous data, the basic assumption in SEM that the variables come from a continuous normal distribution is clearly violated. A rigorous analysis that takes into account the discrete nature of the variables is therefore necessary. A better approach for assessing these kinds of discrete data is to treat them as observations that come from a hidden continuous normal distribution with a threshold specification. A censored normal distribution and truncated normal distribution, each includes interval, right and left where the later are with known parameters, are used to handle the problem of ordered categorical and dichotomous data in Bayesian non-linear SEMs. The truncated normal distribution is used to handle the problem of non-normal data (ordered categorical and dichotomous) in the covariates in the structural model. Two types of thresholds (having equal and unequal spaces) are used in this research. The Bayesian approach (Gibbs sampling method) is applied to estimate the parameters. SEM treats the latent variables as missing data, and imputes them as part of Markov chain Monte Carlo (MCMC) simulation results in the full posterior distribution using data augmentation. An example using simulation data, case study and bootstrapping method are presented to illustrate these methods. In addition to Bayesian estimation, this research provide the standard error estimates (SE), highest posterior density (HPD) intervals and a goodness-of-fit test using the Deviance Information Criterion (DIC) to compare with the proposed methods. Here, in terms of parameter estimation and goodness-of-fit statistics, it is found that the results with a censored normal distribution are better than the results with a truncated normal distribution, with equal and unequal spaces of thresholds. Furthermore, the results with unequal spaces of thresholds are less than the results of equal spaces of thresholds in the interval of the censored and truncated normal distributions, this is including the left censored and truncated normal distributions. The results of equal spaces of thresholds are less than the results of unequal spaces of thresholds in right censored and truncated normal distributions. In other cases, the results of bootstrapping method are better than the real data results in terms of SE and DIC. The results of convergence showed that dichotomous data needs more iterations to convergence than ordered categorical data. 2017 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/81645/1/ThanoonYThanoonPFS2017.pdf Thanoon, Y. Thanoon (2017) Bayesian approach to structural equation models for ordered categorical and dichotomous data. PhD thesis, Universiti Teknologi Malaysia. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:126128
spellingShingle QA Mathematics
Thanoon, Y. Thanoon
Bayesian approach to structural equation models for ordered categorical and dichotomous data
title Bayesian approach to structural equation models for ordered categorical and dichotomous data
title_full Bayesian approach to structural equation models for ordered categorical and dichotomous data
title_fullStr Bayesian approach to structural equation models for ordered categorical and dichotomous data
title_full_unstemmed Bayesian approach to structural equation models for ordered categorical and dichotomous data
title_short Bayesian approach to structural equation models for ordered categorical and dichotomous data
title_sort bayesian approach to structural equation models for ordered categorical and dichotomous data
topic QA Mathematics
url http://eprints.utm.my/81645/1/ThanoonYThanoonPFS2017.pdf
work_keys_str_mv AT thanoonythanoon bayesianapproachtostructuralequationmodelsfororderedcategoricalanddichotomousdata