1. The Core Framework: Nonlinear Dynamic Latent Class SEM (NDLC-SEM)
The forecasting of critical states in the SAM study required innovative techniques to capture the complex, multi-level nature of student dropout.The methodological approach relies on the Nonlinear Dynamic Latent Class Structural Equation Model (NDLC-SEM) — a flexible Bayesian framework integrated with a modified Forward Filtering Backward Sampling (FFBS) algorithm for real-time prediction.
1.1 Key Capabilities
The NDLC-SEM framework combines the capabilities of several dynamic models to simultaneously address four essential data structures for modeling the longitudinal dropout process using Intensive Longitudinal Data (ILD). It models:- Inter-Individual Differences (Traits): Stable characteristics (e.g., cognitive abilities).
- Intra-Individual Changes (States): Changeable psychological states (e.g., affective and motivational states).
- Unobserved Heterogeneity of Trajectories: Captured via time-varying latent classes following a hidden Markov process.
- In this study, these classes represent the discrete latent variable :
- : Intention to stay
- : Intention to quit
- In this study, these classes represent the discrete latent variable :
- Time-Dependent Nonlinearities: Latent within-person variables predict transition probabilities with nonlinear effects.
2. Model Specification
The model specification outlines how observed variables relate to latent constructs (measurement models) and how these latent constructs evolve and interact across levels (structural models).2.1 Measurement Models
Within-Level (States – ): Seventeen observed variables () operationalize seven continuous latent within-factors (), representing affective/cognitive states such as stress, fear of failure, and affect balance. All observed variables were centered and oriented so that higher values indicate stronger intention to quit. The within-level measurement model is consistent across latent states: Where:- = (17 × 1) vector of observed variables
- = (7 × 1) vector of latent state factors
- = vector of residuals
Between-Level (Traits – ): A single latent construct — cognitive ability (IQ) — was modeled using three CFT-3 test items measured at baseline. Where:
- = (3 × 1) vector of observed indicators
- = latent IQ factor
- = uncorrelated residuals
2.2 Structural Dynamics (Within-Level)
The within-level dynamics were modeled using a first-order autoregressive process (AR(1)), specific to the discrete latent class : Where:- = latent states at previous time
- = class-specific intercept vector
- = diagonal matrix of AR(1) effects
- = innovation term
2.3 Structural Models (Between- and Cross-Level Interactions)
The stable latent trait (IQ, ) influences both the intercepts and autoregressive dynamics of the latent state processes. Intercept Function: AR Coefficients with Cross-Level Moderation: Here, allows cognitive ability (IQ) to moderate motivational and self-regulatory state dynamics over time.2.4 Markov Switching Model (Transition Probabilities)
The discrete latent state evolves according to a hidden Markov process, governed by transition probabilities derived via a logit link function. Probability of Staying in (No Intention to Quit): Logit Function Definition: Return Probability (P₁₂):The transition from intention to quit () to intention to stay () is assumed rare and slow: This aligns with the Rubicon model, positing that individuals rarely revert once a quitting intention is formed.
3. Identification of Latent Discrete States
Identifying the latent states () as “intention to drop out” involved a confirmatory modeling strategy:- Imposed Constraints:
Persons in state constrained to show higher scores on all seven negative affect scales. - Predictive Link:
Transition probabilities were regressed on affective scales and their interactions with IQ. - Temporal Restrictions:
Transitions back to restricted to low probability. - Partial Observation:
Dropouts observed during the semester were coded directly as (manifest dropout).
4. Forecasting Implementation: Forward Filtering Backward Sampling (FFBS)
Forecasting dynamic latent states is achieved through the Forward Filtering Backward Sampling (FFBS) algorithm — a Bayesian sequential estimation method adapted for hidden Markov models (see West & Harrison, 1997).4.1 Key Features of the FFBS Adaptation
- Integrates seamlessly with the Gibbs sampler (forecasting within estimation loop).
- Handles latent time-dependent predictors () driving state transitions.
- Produces real-time posterior forecasts for the latent dropout intention states.
4.2 Core Algorithmic Steps
Step 1 — Reformulation (Aspect i.)
Reformulate NDLC-SEM into a Dynamic Linear Model (DLM) framework:- Observation Equation:
- System Equation:
Step 2 — Define Strata (Aspect ii.)
Define four strata for every consecutive time pair , covering all possible transitions between “stay” and “quit” states.Step 3 — Continuous State Prediction (Aspect iii.)
For each stratum, sample latent factor scores, producing four forecast draws:Step 4 — Marginal Predictive Density (Aspect iv.)
Compute the mixture of predictive densities across the four strata: This produces the overall forecast distribution of the continuous latent variable.4.3 Posterior Updating and Smoothing
After observing , priors are updated and the joint posterior over model combinations is computed: The smoothed posterior for each latent state is then:4.4 H-Steps-Ahead Forecasting
For future predictions ():- Use last observed posteriors ().
- Recursively define expected values and covariance matrices .
- Generate the -step-ahead forecast distribution of latent factors .
Summary:
The integrated NDLC-SEM + FFBS framework enables dynamic, real-time prediction of latent dropout intentions, bridging stable cognitive traits and fluctuating emotional-motivational states.
It provides a powerful approach to modeling nonlinear psychological dynamics in longitudinal educational data.