Why does TVEM matter?
As data collection technology such as smart phones and pedometers create richer and denser datasets, TVEM will allow researchers to answer new questions and to answer existing questions with greater nuance than was possible just a few years ago. Problem behaviors (e.g., substance use) and their associations with predictors (e.g., craving, mood) change over time, and TVEM helps us understand these changes. This applies to smoking, obesity, substance use, HIV disease course, and any other area of behavioral science that collects ecological momentary assessment (EMA) data or other forms of ILD.
What are ILD and how many data points do I need?
There is no hard, fast rule for exactly what constitutes ILD. Generally, ILD are defined as data with more than 20 or 40 measurements over time. But in truth, it is not the number of observations that matter, it is the relationships you are trying to model and whether you have enough data points to measure the change. As a simple example, consider the graph below. Imagine this represents someone’s craving for food on a scale of 1 to 5 over the course of two days.
The more dynamic and complex the relationship we are trying to model, the more valuable it is to have intensive longitudinal data. So the number of observations matters only in that you have enough measurement occasions to accurately determine the shape of the curve. If the shape of the curve above is accurate, then adding 50 more data points will not give you more information, though it will increase your certainty of the curve’s shape. NOTE: data spacing is also important. Large gaps in the measurements can decrease a TVEM’s usefulness.
If I have enough data points, how many subjects do I need?
If the macro does not provide a p-value, how do I publish my results?
In many journals in the behavioral sciences, researchers report a p-value to express whether an association is significant. However, a time-varying effect is an irregularly shaped curve, made up of an infinite number of points. This means that an overall p-value would not meaningfully demonstrate the association that was modeled. The entire curve is needed to show at what times the association is or is not statistically significant and to show how the association changes.
For a time-invariant parameter, significance can be expressed through either a p-value or a confidence interval (as seen in the figure on the right, where the red dot is the estimate, and the blue dots are the confidence intervals). In order to show variation over time, the p-value is not useful but a dynamic confidence interval is. So, while journals typically print p-values, a confidence interval provides exactly the same information. TVEM results present an estimated coefficient and 95% confidence intervals for the entire curve which can be used to determine statistical significance at any point on the curve. That is why the TVEM macro expresses results graphically.