Level 1 Criteria
-
C Class-level RCT
- Although randomization was at the student level, the intervention is tutoring delivered to individuals/small groups, which the ERCT standard treats as an allowed exception for Criterion C.
- "Small groups of up to four students are paired with the same highly-qualified tutor in 30-minute sessions four times each week." (p. 4)
- Relevant Quotes: 1) "Small groups of up to four students are paired with the same highly-qualified tutor in 30-minute sessions four times each week." (p. 4) 2) "Randomization occurred at the student level within schools and grade cohorts, ensuring that treatment and control students were directly comparable at baseline." (p. 4) 3) "Students were grouped by instructional need, with group sizes ranging from 1:1 to 1:4." (p. 8) Detailed Analysis: Criterion C requires randomization at the class (or school) level to minimize contamination, but it allows an exception for interventions that are inherently individualized (e.g., tutoring/personal teaching). This study explicitly randomizes "at the student level within schools and grade cohorts," which would normally fail a strict class-level randomization requirement. However, the intervention is clearly described as tutoring delivered in small groups (1:1 to 1:4) with a tutor, which is consistent with the ERCT exception logic: student-level assignment is acceptable when the educational delivery is tutoring rather than a whole-class pedagogy. Final sentence: Criterion C is met because this is a tutoring intervention where student-level randomization is an allowed ERCT exception.
-
E Exam-based Assessment
- The outcomes are measured using widely recognized standardized assessments (TPRI, STAAR, DIBELS, LEAP), satisfying the exam-based assessment requirement.
- "In Texas, the Texas Primary Reading Inventory (TPRI) was administered in grades 1–2, and the Texas STAAR assessment in grade 3." (p. 9)
- Relevant Quotes: 1) "In Texas, the Texas Primary Reading Inventory (TPRI) was administered in grades 1–2, and the Texas STAAR assessment in grade 3." (p. 9) 2) "In Louisiana, student performance was measured using DIBELS (grade 2) and the state’s LEAP assessments (grade 4)." (p. 9) Detailed Analysis: Criterion E requires that academic outcomes be measured with standardized, widely recognized exam-based assessments rather than custom researcher-made tests tailored to the intervention. The report explicitly names established assessments used by Texas and Louisiana (TPRI, STAAR, DIBELS, LEAP). The paper does not describe these instruments as being created for this study; they are standard measures used in the relevant education systems. Final sentence: Criterion E is met because the study uses standardized assessments (TPRI, STAAR, DIBELS, LEAP) as outcome measures.
-
T Term Duration
- The evaluation reports outcomes over the full 2024-25 school year, which exceeds the minimum one-term follow-up requirement.
- "The trial was conducted across the 2024-25 school year in a large suburban district in Louisiana as well as a district in Texas, and delivered tutoring for a full school year." (p. 4)
- Relevant Quotes: 1) "The trial was conducted across the 2024-25 school year in a large suburban district in Louisiana as well as a district in Texas, and delivered tutoring for a full school year." (p. 4) 2) "This section presents the estimated impacts of Air Reading on student literacy outcomes over the 24-25 school year." (p. 12) Detailed Analysis: Criterion T requires outcome measurement at least one academic term after the intervention begins. The paper repeatedly situates both implementation and impact estimation across the "2024-25 school year" and reports results "over the 24-25 school year." A full school year necessarily exceeds one term (typically ~3-4 months), so the minimum ERCT tracking duration requirement for T is satisfied. Final sentence: Criterion T is met because outcomes are measured over a full school year, which is longer than one academic term.
-
D Documented Control Group
- The control condition is defined as business-as-usual literacy instruction and the report provides baseline characteristics and equivalence checks for treatment vs. control.
- "Students assigned to treatment were offered Air Reading tutoring during the school day, while control students received business-as-usual literacy instruction." (p. 4)
- Relevant Quotes: 1) "Students assigned to treatment were offered Air Reading tutoring during the school day, while control students received business-as-usual literacy instruction." (p. 4) 2) "The sample included students in grades 1–4 randomized to either the Air Reading program (n = 174) or the business-as-usual control condition (n = 203)." (p. 4) 3) "Table 1 provides descriptive statistics on student demographics and baseline achievement, pooled across the full sample and disaggregated by cohort and treatment status." (p. 8) 4) "Balance checks indicated no meaningful baseline differences between treatment and control groups overall or within cohorts." (p. 8) Detailed Analysis: Criterion D requires that the control group be clearly described, including what it received, and that baseline characteristics are documented so the reader can judge comparability. The report explicitly defines the control condition as "business-as-usual literacy instruction" and gives sample sizes for each arm. It also points to baseline descriptive statistics (Table 1) and explicitly states that baseline balance checks showed no meaningful differences. Final sentence: Criterion D is met because the control condition and baseline comparability are clearly documented.