$ f(s) = E[T|X_s=1] - E[T|X_s=0] $ (*).
This is the difference between the expected time between admission and discharge/death, $T$, given infected at time $s$ and given not infected at time $s$.
This means that those individuals who are not infected could be so in the future. This is a snap shot of the case-control split in the sample at time $s$, so it doesn't tell us about what will happen after this. Those in state 0 at time $s$ may become infected for some time in the future or they may not pass go and jump straight to the sink state.
Intuitively, if we think about this in a latent time/counterfactual way then when patients are still in state 0, even if they do jump to state 1 (infected) later, they still would have jumped to the sink at a time after that.
By setting up like this we include the holding time in state 0 prior to either an infection or death/discharge as a non-infection length of stay time, and so not biasing the infection LOS times by not accounting for the two-way causality.
A weighting game
Eqn (*) is averaged over all $s$ to give an expected excess length of stay. The question is how to choose the weightings. It is suggested to weight the days when there are more infections more heavily or when there are more jumps out of state 0, regardless of whether to infection of death/discharge. By weighting in this way more emphasis is placed on the excess LOS on days when more happens, That is to say that when there are larger changes in the state populations and risks set then the difference in LOS is more influential on the estimate. Intuitively, this makes sense since otherwise we will count days when there is little or not change in the system.For example, the times when transitions from 0 to 1 occur are the times before which the jumping individuals and those that remain in state 0 have been in state 0 together. That is, they have the same history (filtration) up to that time, say $s$ and then diverge at that time. So a comparison on the LOS between these two groups is a comparison accounting for the uninfected time too i.e. time-dependent. Conversely, the times at which transitions from 0 to 2 occur are those individuals that do not have an associated other group who transition from 0 to 1. So at this time $s$ we're are cleaning-up the sample to remove individuals that aren't helpful in the comparison between the infected and non-infected groups.
This rational is done probabilistically over the continuous variable $s$, rather than at discrete time points used above. This approximation could be useful for checking though.
The excess LOS is a weighted mean estimate of the separate LOS for each $s$. If we think about this as a sample size problem, we would place more weight on the larger samples and less on the smaller sample sizes. In essence, this is really placing emphasis on the points that contain more information. In the LOS context this would correspond to placing more weight on the times at which there has just been a transition to state 1, the infected state. Obviously, the infected individuals are most likely to be in this state at the beginning of their holding time.
Beyersmann also includes the times at which there has been a transition from 0 to 2, the death/discharge state. This is a removal of individuals that no longer contribute to the case-control comparison. To me this is a less obvious thing to use in the weights.
If we think about it for a countable set of uniformly spaced $s$, then the proportion of the interval an interval $[0,T]$ comprising of admission time and the proportion comprising on infection time will determine the influence of $T$ on the non-infected and infected LOS respectively.
Now, it we position the times $s$ non-uniformly, so that they are closer to the times when there are more transitions and further from the times where there are fewer transitions then we will pick-up more of the detail and fidelity of the process.
As the days progress the espected LOS for both infected and not infected will obviously increase. But the probability of having left state 0 will increase as the population continues to diminishes and be absorbed in state 2, sincehe survival function is monotonically decreasing.
No comments:
Post a Comment