Introduction
Methods
Study Design
Search Strategy
Key concept | Search terms used |
---|---|
Population: autism spectrum disorder | "Autism spectrum" or “autism” or “autistic spectrum disorder” or “autistic disorder” or “child development” or “development disorder” |
Population: newborn | "newborn" or “infant” or “toddler” or “neonate” |
Intervention: early screening | "early screening" or “early detection” or “early diagnosis” or “screening” or “assess” or “test” |
Outcome: cry | "cry" or “cries” or”crying” or “atypical cry” |
Inclusion and Exclusion Criteria
-
Reported quantitative measures of F0 (e.g., mean and standard deviation) for both AutI/EL and TD/DL groups.
-
Included infants aged between 0 and 18 months (to capture a critical developmental period before a reliable diagnosis is made and to ensure sufficient data for robust statistical analysis while maintaining clinical relevance).
-
Provided data sufficient to calculate Standardized Mean Differences (SMD) and standard errors.
-
Studies that did not report F0 metrics or failed to provide data enabling the calculation of effect sizes were excluded.
-
Gray literature and unpublished works were not eligible for inclusion (research and materials that are not formally published in traditional peer-reviewed journals, including reports, theses, government documents, white papers, and other non-commercial publications). Strictly qualitative research, book chapters, and review articles were excluded.
Screening Process
Quality Assessment
Data Extraction
Research article | Groups, gender & recording age | Age and test for diagnosis of ASD | Audio features | Type of data analysis | Statistical analysis findings | Machine learning findings | Number of cry episodes (CEs) & cry types | Limitations | Quality assessment |
---|---|---|---|---|---|---|---|---|---|
(Esposito & Venuti, 2009) | Groups: AutI (n = 10) TD (n = 10) Gender: AutI (5 m/5f) TD (5 m/5f) Recording Age: 12 months | Age: 38.4 months: Diagnosis: - DSM-4 -ADOS-2 - Griffiths Mental Development Scale | - Duration - Waveform modulation - Dysphonation | Statistical analysis Test: -ANOVA - Tukey HSD post hoc tests | In AutI: - Longer hyperphonation cry periods (M = 0.14, SE = 0.04, p ≤ 0.05) - Shorter aspiration/ expiration (less waveform modulation and more dysphonation) (M = –0.24, SE = 0.05, p ≤ 0.05) - Shorter pauses, more arousal (M = –0.08, SE = 0.05, p ≤ 0.05) - No differences in moans | N/A | CEs: 32 Cry types: Feeding, changing diaper and startled | -Sample size - Heterogeneity in AutI traits - Low quality of audio recording in amateur settings - Home videos | Fair |
(Esposito & Venuti, 2010a) | Groups: AutI (n = 10) TD (n = 10) Gender: AutI (5 m/5f) TD (5 m/5f) Recording Age: 5 and 18 months | Age: 35.2 months Diagnosis: - DSM-4 - ADOS-2 - Griffiths Mental Development Scale | - F0 | Statistical analysis Test:—GLM - Tukey HSD post hoc tests | In AutI: - No change of F0 trajectory - Higher F0 at five and at 18 months (M = 40.35, SE = 15.82, p ≤ 0.05) | N/A | CEs: 160 Cry types: Non-elicited cries during infant daily routine | -Sample size -Heterogeneity in AutI traits-Low quality of audio recording in amateur settings - Home videos | Good |
(Esposito & Venuti, 2010b) | Groups: AutI (n = 14) TD (n = 14) Gender: AutI (7 m/7f) TD (7 m/7) Recording Age: 18 months | Age: 36.7 months Diagnosis: - DSM-4 -ADOS-2 - Griffiths Mental Development Scale | - F0 | Statistical analysis Test: - GLM - ANOVA - Tukey HSD post hoc tests | In AutI: - Higher F0 (M = 65.44, SE = 19.85, P ≤ 0.05) - No differences in the cry episodes length - No significant among feeding, changing diaper, pain cries | N/A | CEs: 140 Cry types: Feeding, changing diaper and pain | - Sample size - Heterogeneity of AutI traits- Low quality of audio recording in amateur settings - Home videos | Fair |
(Khozaei et al., 2020) | Groups: AutI (n = 31) TD (n = 31) Gender: AutI (24 m/7f) TD (24 m/7f) Recording Age: 18–53 months | Age: - 35.6 months (AutI) - 30.8 months (TD) Diagnosis: - DSM-5 - GARS-2 questionnaire | -F0 - MFFC - other quality voice features | Machine Learning Analysis | N/A | RBF-SVM The sensitivity, specificity, and precision for boys were 85.71%, 100%, and 92.85%; 71.42%, 100%, and 85.71% for girls, respectively | CEs: 367 Cry types: uncomfortable, unwilling, sleepy, scared, thirsty, complaining | - Sample size for training (10 AutI / 10 TD) - Unbalanced dataset - Tested by gender | Fair |
(Moffitt et al., 2022) | Groups: AutI (n = 61) Gender: Not reported Recording Age: 24–66 months | Age > 24 months Diagnosis: - DSM-5 - ADOS-2 - CSS -RRB -SA | - F0 - proportion of time crying | Statistical analysis Test: Correlation (Regression) | In AutI: - F0 predicted by ADOS-2, RRB and CSS (β = 0.40, t (63) = 3.83, p < 0.001) | N/A | CEs: Not reported Cry types: Not reported | - No TD group | Fair |
Research article | Groups & age (months) | Age & test for diagnosis of ASD | Audio features | Type of data analysis | Statistical analysis findings | Machine learning findings | Number of cry episodes (CEs) & cry type | Limitations | Quality assessment |
---|---|---|---|---|---|---|---|---|---|
(Orlandi et al., 2012) | Groups: EL (n = 7) DL (n = 17) Gender: Not reported Recording Age: 10 days, 6 weeks and 12 weeks | Age: Not reported Diagnosis: Not reported | - Number of voiced and unvoiced segments - Cry Duration - F0 - F1 - F2 - Melody | Statistical Descriptive Analysis (mean, std) | In EL: - Lower F0 (10 days (EL: f0mean = 401.0, std = 108.2; TD: f0mean = 428.2, std = 145.9) 6 weeks (EL: f0mean = 417.7, std = 89.9; TD: f0mean = 447.9, std = 166.8) 12 weeks (EL: f0mean = 398.0, std = 93.3; TD: f0mean = 477.2, std = 188.7) - Higher F1 and F2 - Less and shorter cries - Similar melody patterns in hunger and boring cries In DL: - Large differences in the melody between hunger and boring cries | N/A | CEs: −74 (10 days) −75 (6 weeks) - 80 (12 weeks) Cry types: hunger and boring cry | - Sample size - Not reporting diagnosis information | Fair |
(Sheinkopf et al., 2012) | Groups: EL (n = 21) DL (n = 18) Gender: EL (6 m/15f) DL (8 m/10f) Recording Age: 6 months | Age: 36 months Diagnosis: - DSM-IV - ADOS - MSEL - MacArthur | - Phonation - Cry Utterance duration - Average amplitude (loudness) - Amplitude variation (range) - F0 - F0 variation - F1 and F2 - Hyperphonation | Statistical Analysis t-test | In EL: - Higher F0 (t (10) = 2.82, P = 0.018) and F0 variation trend in pain related cries (t (10) = 2.14, P = 0.058) - Smaller amplitude trend (t (10) = 2.02, P = 0.070) - No differences in phonation | N/A | CEs: Not reported Cry types: pain and non-pain cries | -Small sample size - Difficulties distinguishing the causes of crying - Problems in F1 and F2 estimation - Manual analysis for CEs extraction | Fair |
(Esposito et al., 2014) | Groups: EL (n = 13) DL (n = 14) Gender: EL (9 m/4f) DL (7 m/7f) Recording Age: 15 months | Age: 36 months Diagnosis: - DSM-IV-TR - ADOS | Time domain:—Cry duration Frequency domain: - F0 mean - F0 max - F0 variation (range) | Statistical Analysis ANCOVA | In EL - Higher F0 (F = 16.82, p < 0.01) and F0 max (F = 6.12, p < 0.05) - Shorter duration (F = 7.62, p < 0.05) - No significant differences in F0 variation | N/A | CEs: 159 Cry types: Cries elicited using the SSP, a standard infant attachment assessment | - Small sample size | Good |
(Unwin et al., 2017) | Groups: EL (n = 22) DL (n = 27) Gender: EL (18 m/4f) DL (12 m/15f) Recording Age: 12 months | Age: 12 months Diagnosis: - MSEL - ADEC Age: 24 months Diagnosis: - MSEL - ADOS-G | - F0 - Cry Duration - Amplitude - Formants (F1, F2) | Statistical Analysis (ANOVA, Pearson) | In EL: - Lower F0 trend (F = 0.01, p = 0.92) - A trend in shorter cry duration (F = 4.43, p = 0.04) correlating with more severe ADOS-G (p = 0.08) score and poorer performance on MSEL receptive language (p = 0.07) | N/A | CEs: 146 Cry types: Parent-reported cause of infant distress (e.g. hunger, fatigue, frustration) | - Sample size - Multiple comparison testing -Limited reporting on infant positioning | Good |
(Santos et al., 2013) | Groups: EL (n = 23) DL (n = 20) Gender: EL (15 m/8f) DL (13 m/7f) Recording Age: 18 months | Age: 36 months Diagnosis: - ADOS-2 - ADI-R - Medical History | - F0 - Formants - Harmonics - Energy - CPP - hnr - Jitter - Shimmer - Voiced ratio | Machine Learning Analysis | N/A | PNN 97.7% SVM 79.1 | CEs: 2187 Cry types: Not reported | - Sample size - No follow up information | Fair |
(Manigault et al., 2023) | Groups: EL (n = 363) Gender: EL (202 m/161f) Recording Age: < 1 month | Age: 24 months Diagnosis: Bayley III M-CHAT CBCL | - Energy - F0 - Formants - Cry utterances - Signal quality | Machine Learning and Correlation Analysis | Estimates of models trained using acoustic cry characteristics were associated with clinical and developmental assessments at 2 years | Random Forest. 56 acoustics characteristics per utterance. Energy, F0, formants, and signal quality were the most variables used by the models | CEs: 14,701 Cry types: Elicited cries during routine caregiving in the NICU, such as diaper change | - Improve cry collection - Lacking specific intervention strategies -No accuracy reported | Good |
-
The mean and standard deviation of F0 for AutI/EL and TD/DL groups.
-
Sample sizes for both AutI/EL and TD/DL groups.
-
Study-level characteristics, such as the infant age range (0–18 months), methods used to elicit crying, and the conditions under which recordings were made.
Research article | AutI/EL | TD/DL | ||
---|---|---|---|---|
n | F0 (mean ± std) | n | F0 (mean ± std) | |
(Esposito & Venuti, 2010b) | 14 | 530.56 ± 91.30 | 14 | 465.12 ± 48.63 |
(Esposito & Venuti, 2010a) | 10 | 535.33 ± 86.16 | 10 | 456.33 ± 59.74 |
(Orlandi et al., 2012) | 7 | 398 ± 93.30 | 17 | 477.2 ± 188.70 |
(Sheinkopf et al., 2012) | 21 | 504.87 ± 57.42 | 18 | 420.28 ± 40.32 |
(Esposito et al., 2014) | 13 | 370.54 ± 30.48 | 14 | 329.96 ± 26.84 |
(Unwin et al., 2017) | 22 | 420.91 ± 65.48 | 27 | 444.11 ± 88.65 |
Statistical Analysis
Random-effects model (k = 6) | ||||||
---|---|---|---|---|---|---|
SMD | SE | Z | p | CI lower bound | CI upper bound | |
Intercept | 0.685 | 0.358 | 1.91 | 0.114 | – 0.235 | 1.604 |
Heterogeneity statistics | |||||||
---|---|---|---|---|---|---|---|
Tau | Tau2 | I2 | H2 | R2 | df | Q | p |
0.787 | 0.6196 (SE = 0.4943) | 80.2% | 5.05 | 5.000 | 27.332 | < .001 |
Model fit statistics and information criteria | |||||
---|---|---|---|---|---|
log-likelihood | Deviance | AIC | BIC | AICc | |
Maximum-likelihood | – 7.238 | 14.534 | 18.477 | 18.060 | 22.477 |
Restricted maximum-likelihood | – 6.442 | 12.884 | 16.884 | 16.103 | 22.884 |
Publication bias assessment | ||
---|---|---|
Test name | Value | p |
Fail-safe N | 31.000 | < .001 |
Begg and Mazumdar rank vorrelation | 0.067 | 1.000 |
Egger's regression | 0.509 | 0.638 |
Trim and fill number of studies | 0.000 |