Data representations and -analyses of binary diary data in pursuit of stratifying children based on common childhood illnesses
Research output: Contribution to journal › Journal article › Research › peer-review
Standard
Data representations and -analyses of binary diary data in pursuit of stratifying children based on common childhood illnesses. / de Rooi, Johan; Nørgaard, Sarah K.; Rasmussen, Morten A.; Bønnelykke, Klaus; Bisgaard, Hans; Smilde, Age K.
In: PLOS ONE, Vol. 13, No. 11, e0207177, 29.11.2018, p. 1-15.Research output: Contribution to journal › Journal article › Research › peer-review
Harvard
APA
Vancouver
Author
Bibtex
}
RIS
TY - JOUR
T1 - Data representations and -analyses of binary diary data in pursuit of stratifying children based on common childhood illnesses
AU - de Rooi, Johan
AU - Nørgaard, Sarah K.
AU - Rasmussen, Morten A.
AU - Bønnelykke, Klaus
AU - Bisgaard, Hans
AU - Smilde, Age K.
PY - 2018/11/29
Y1 - 2018/11/29
N2 - In this article we analyse diary reports concerning childhood symptoms of illness, these data are part of a larger study with other types of measurements on childhood asthma. The children are followed for three years and the diaries are updated, by the parents, on a daily basis. Here we focus on the methodological implications of analysing such data. We investigate two ways of representing the data and explore which tools are applicable given both representations. The first representation relies on proper alignment and point by point comparison of the signals. The second approach takes into account combinations of symptoms on a day by day basis and boils down to the analysis of counts. In the present case both methods are well applicable. However, more generally, when symptom episodes are occurring more at random locations in time, a point by point comparison becomes less applicable and shape based approaches will fail to come up with satisfactory results. In such cases, pattern based methods will be of much greater use. The pattern based representation focuses on reoccurring patterns and ignores ordering in time. With this representation we stratify the data on the level of years, so that possibly yearly differences can still be detected.
AB - In this article we analyse diary reports concerning childhood symptoms of illness, these data are part of a larger study with other types of measurements on childhood asthma. The children are followed for three years and the diaries are updated, by the parents, on a daily basis. Here we focus on the methodological implications of analysing such data. We investigate two ways of representing the data and explore which tools are applicable given both representations. The first representation relies on proper alignment and point by point comparison of the signals. The second approach takes into account combinations of symptoms on a day by day basis and boils down to the analysis of counts. In the present case both methods are well applicable. However, more generally, when symptom episodes are occurring more at random locations in time, a point by point comparison becomes less applicable and shape based approaches will fail to come up with satisfactory results. In such cases, pattern based methods will be of much greater use. The pattern based representation focuses on reoccurring patterns and ignores ordering in time. With this representation we stratify the data on the level of years, so that possibly yearly differences can still be detected.
U2 - 10.1371/journal.pone.0207177
DO - 10.1371/journal.pone.0207177
M3 - Journal article
VL - 13
SP - 1
EP - 15
JO - PLoS ONE
JF - PLoS ONE
SN - 1932-6203
IS - 11
M1 - e0207177
ER -
ID: 209946849