Many educational disparities between boys and girls appear early and persist through their educational journeys. They are less ready than girls upon school entry, are more likely to be diagnosed with learning disabilities—including in reading and language—earn lower grades, and are more likely to drop out before completing high school. These gaps matter and are reflected in lower rates of college enrollment, declining labor-market outcomes, and fewer stable pathways into adulthood for young men.
Given these longstanding patterns, what actually works to improve academic outcomes, especially for boys?
The honest answer from decades of U.S. education research is sobering. A huge range of educational interventions have been tried, but the results are often underwhelming. Many interventions have no significant effect on the outcomes they try to influence while others show great promise in pilot studies but end up with much smaller impacts when scaled. Given widely varying effectiveness, federal legislation has placed increasing emphasis on interventions with rigorous empirical support (see the American Rescue Plan funding for schools following the pandemic as a recent example).
High-dose tutoring, and tutoring more broadly, is one of the few interventions that shows large, replicable effects. In a recent meta-analysis of randomized controlled trials (RCTs), tutoring shows the largest average effect size (figure 1).
Figure 1
Data noteTo distinguish between tutoring and some of other classes of interventions that might seem superficially similar, Deitrichson, et al. (2017) define some of these categories as follows:
High-dose tutoring—which tends to be defined as frequent (at least three days per week), one-on-one or with small groups of students, and from a teacher or other instructor that supplements regular classroom learning—is one of the few that shows strong, positive effects. And its theory of action is straightforward:
Finally, the positive effects of high-dose tutoring may be particularly salient and important for boys since the most rigorously tested tutoring interventions tend to focus on early-literacy, boys are more likely to struggle in this area, and reading proficiency in elementary school strongly predicts later education outcomes. In one of the largest known tutoring interventions targeting struggling first-grade readers, 60% of the sample were boys. Nationally, boys account for 57% of students identified with a specific learning disability (SLD), a category that includes reading-related disabilities such as dyslexia, and they are more likely than girls to score below the NAEP “basic” level in 4th-grade reading (44% versus 36%). Using state-level NAEP data combined with enrollment counts, we find that boys constitute a majority of below-basic readers in every state, ranging from 112 boys per 100 girls in Kansas (53% boys) to 149 boys per 100 girls in West Virginia (60% boys).
Figure 2
Compared to many education interventions, the evidence base for tutoring is quite strong. Hundreds of RCTs have been performed, including many that are recent or meet the strict quality standards described by the Institute of Education Sciences’ What Works Clearinghouse (WWC). Below we highlight some of these RCTs with a preference for those that have larger samples, break out impacts by gender, provide costs, or highlight important research gaps. We also summarize several recent meta-analyses that have aggregated RCT results and note factors—like subject area, student age, and tutor training—that moderate tutoring’s effectiveness.
On average, tutoring raises achievement by roughly 0.29 SD (about four months of extra learning for a struggling elementary-school student), though impacts vary based on several factors. They tend to be lower for older students than younger students (0.13 SD for 6-11 graders vs. 0.41 for pre-K and kindergarteners), mostly due to the declining impact of literacy interventions with age. Effects also shrink as programs scale beyond about 1,000 students, either because high-quality tutors become harder to find, implementation fidelity declines, or programs enroll students that need less help. Conversely, researchers find that effects are strongest in programs with certified teachers or trained paraprofessionals, low student-teacher ratios (less than 3:1), high dosage (greater than 60 total hours), and in-school delivery. Notably, even the most comprehensive meta-analyses (like figure 2) do not analyze effects by gender, likely because there are not enough individual studies to estimate an average effect.
Where results for boys and girls are broken out separately, the gains for both tend to be similar. However, boys do make up a larger share of struggling readers, and assuming similar benefits on average, tutoring can potentially narrow early gender gaps without tailoring the curriculum by sex. More research is needed, but what we do know is that when tutoring is frequent, personalized, and run by trained adults, it delivers some of the largest, most reliable learning returns in the education literature.
Figure 3
“Not Too Late” high‑school math tutoring RCTs, Chicago (5,300 9th–10th‑graders; 2013–17)
Paraprofessional tutors hired by Saga Education worked with pairs of students for a dedicated 50‑minute class period each school day, or about 150 hours per year. Across two cohorts, treatment gains in math were 0.18 SD in the first study and 0.40 SD in the replication, with course failures cut nearly in half. Effects of 0.23 SD remained up to two years later. The model delivered large, durable impacts at a cost of roughly $3,200–$4,800 per student, illustrating that trained staff can generate sizable learning gains in math even among adolescent students. Benefits were similar for boys and girls and extended across the initial score distribution.
High‑dosage tutoring and reading achievement RCT, New York City (1,700 sixth‑graders across 60 schools; 2013‑16)
Eligible students received at least 130 hours of 4‑to‑1 guided‑reading sessions (45‑60 minutes per day) as part of an existing literacy initiative. Average effects on state ELA scores were modest and imprecise (~0.05 SD) and differed based on race: Black students gained 0.09 SD per year whereas Hispanic peers benefited little, a gap the authors link to tutor‑match quality rather than dosage. Math scores were unchanged. On average, however, attendance rose by 1.2 percentage points. The study shows that after‑school, paraprofessional‑led tutoring can boost engagement and selectively raise achievement, but that implementation details and student–tutor fit matter. It also highlights some of the greater difficulties raising reading achievement for older students. The authors did not report differential effects by gender.
Reading Corps volunteer tutoring RCTs, Minnesota and Wisconsin (622 K-3 students; 2017–18)
AmeriCorps members—typically recent college graduates or other trained community volunteers—were given several days of literacy instruction and ongoing coaching from school- and program-based reading specialists. Each tutor worked one-on-one with struggling readers for twenty minutes a day, five days a week, using a structured phonics-based curriculum. The randomized trials (replicating a similar study in 2014) took place in 24 Minnesota and 10 Wisconsin schools. In Minnesota, kindergarten students who received Reading Corps tutoring outperformed their peers by roughly 0.74 SD on a letter-sound fluency test—a very large gain for a single semester. First-graders gained 0.62 SD on decoding skills and 0.52 SD on passage reading. In Wisconsin, kindergarteners improved by 0.47 SD and first-graders by 0.43 SD on similar measures. Gains were largest in the early grades, but even second and third grade students made gains of about 0.23 in reading fluency. The study did not report any differences in effects by gender.
Level Literacy Intervention RCTs, Georgia, New York, and Colorado (747 K-2 graders; 2009-2012)
The WWC subgroup analyses of a leveled literacy intervention (LLI) found that the program improved early literacy outcomes for both boys and girls, though gains were slightly stronger for girls. In kindergarten, both male and female students showed statistically significant improvements (0.5–0.7 SD), but girls’ effects were larger. By first grade, gender differences widened: boys showed small or even negative effects on some measures, whereas girls demonstrated consistent positive impacts. By second grade, effects were small and nonsignificant for both groups. Overall, WWC evidence indicates that LLI yielded modest and more reliable benefits for girls than for boys, suggesting that boys’ literacy gains from the intervention were weaker and less consistent across grade levels.
Nickow et al. global meta‑analysis of tutoring RCTs (89 studies, 732 effect estimates; 1985‑2020)
In a meta-analysis of one-on-one and small-group preK-12 tutoring RCTs across almost 40 years, the authors found that across all grades and subjects, tutoring raised achievement by an average 0.29 SD. Effects were larger when programs used certified teachers or paraprofessionals, occurred during school, employed one‑to‑one formats, and targeted early grades; volunteer‑ and parent‑led models still helped but less so. The authors did not report pooled estimates by gender.
Dietrichson et al. meta‑analysis of low‑SES interventions (101 (36 tutoring) studies; 2000‑2014)
Among elementary and middle‑school students from low‑income families, tutoring produced the biggest average gain (0.36 SD), outpacing feedback/progress‑monitoring (0.32) and cooperative learning (0.22). Tutoring effects were robust in the subset of higher‑quality RCTs and differ substantively by program duration or whether professionals or paraprofessionals delivered instruction. The authors attempted to identify effects based on the proportion of female participants but the estimates were small and insignificant.
Kraft et al. 2024 meta-analysis of tutoring at scale (265 RCTs, 340 interventions; 1967‑2023)
Pooled effects remain large in small-to-medium programs but decline markedly with scale: roughly 0.44 SD for studies serving <100 students, 0.3 SD for 100–399, 0.21 SD for 400–999, and 0.16 SD for ≥1,000 students. These scaling penalties persist even after accounting for study quality and publication period. The largest impacts were found when tutoring was in-person, in-school during the school day, ≥3 times per week for more than 15 hours total, and with student-to-tutor ratios ≤3:1. “Best-practice” programs that include this bundle average 0.27-0.43 SD. The authors did not report pooled estimates by gender.
Overall, high-dose tutoring interventions show large, positive effects on average, especially compared to alternatives. Tutoring interventions focused on reading and literacy tend to show larger effects in earlier elementary grades and those focused on math find larger effects in middle school. We found relatively few studies that break out effects by gender although those that do tend to find similar impacts for both boys and girls.
As mentioned earlier, it’s important to note that while boys and girls may see similar benefits from tutoring interventions, when it comes to reading, boys are more often the ones falling behind. For example, in one evaluation of a literacy tutoring intervention, almost 60% of students receiving the intervention were male. Simply by virtue of who is more likely to receive it, wide-spread literacy tutoring would both raise reading achievement overall and reduce the gender gap in reading.
Finally, high-dose tutoring is no magic bullet. It works because it tends to be relatively intensive, and costly. Identifying ways to capture the most important elements that make high-dose tutoring effective while reducing cost is imperative. Programs like Reading Recovery can cost an estimated $6,000–$13,000 (all amounts in 2025 dollars) per student. Saga Education’s math tutoring program, described in the “Not Too Late” RCT, cost an estimated $3,200-$4,800 per student, though the authors note that the price has since been lowered to $1,900 per student.
Making use of volunteers and virtual options are two promising options to reduce costs. A recent (and positive) evaluation of OnYourMark virtual tutoring reported costs of approximately $1,400 per student. Reading Partners, which has also been rigorously evaluated, uses volunteers and costs schools about $1,000 per student (though total program costs are higher). Another recent RCT of an in-school math tutoring program found that high-dose computer-assisted tutoring was nearly as effective as in-person tutoring, and at only two-thirds the cost.
While there has been a significant amount of research on the effects of high-dose tutoring, important research gaps remain. We encourage researchers and policymakers to:
Despite some of these gaps, the results so far are compelling, and worth acting upon.
Do you know of any studies that we’ve missed? Please reach out at [email protected]!
Get the latest developments on the trends and issues facing boys and men.
Bhatt, Monica P., Jonathan Guryan, Salman A. Khan, Michael LaForest-Tucker, and Bhavya Mishra. 2024. Can Technology Facilitate Scale? Evidence from a Randomized Evaluation of High Dosage Tutoring. National Bureau of Economic Research. https://www.nber.org/papers/w32510.
De Ree, Joppe, Mario A. Maggioni, Bowen Paulle, Domenico Rossignoli, Nienke Ruijs, and Dawid Walentek. 2023. “Closing the Income-Achievement Gap? Experimental Evidence from High-Dosage Tutoring in Dutch Primary Education.” Economics of Education Review 94 (94): 102383.
Dietrichson, Jens, Martin Bøg, Trine Filges, and Anne-Marie Klint Jørgensen. 2017. “Academic Interventions for Elementary and Middle School Students With Low Socioeconomic Status: A Systematic Review and Meta-Analysis.” Review of Educational Research 87 (2): 243–82. https://doi.org/10.3102/0034654316687036.
Dietrichson, Jens, Trine Filges, Julie K. Seerup, et al. 2021. “Targeted School-Based Interventions for Improving Reading and Mathematics for Students with or at Risk of Academic Difficulties in Grades K-6: A Systematic Review.” Campbell Systematic Reviews 17 (2): e1152. https://doi.org/10.1002/cl2.1152.
Fiester, Leila. 2010. “Early Warning! Why Reading by the End of Third Grade Matters. KIDS COUNT Special Report.” Annie E. Casey Foundation. https://eric.ed.gov/?id=ED509795.
Fryer, Roland G., and Meghan Howard-Noveck. 2020. “High-Dosage Tutoring and Reading Achievement: Evidence from New York City.” Journal of Labor Economics 38 (2): 421–52. https://doi.org/10.1086/705882.
Gersten, Russell, Kelly Haymond, Rebecca Newman-Gonchar, Joseph Dimino, and Madhavi Jayanthi. 2020. “Meta-Analysis of the Impact of Reading Interventions for Students in the Primary Grades.” Journal of Research on Educational Effectiveness 13 (2): 401–27. https://doi.org/10.1080/19345747.2019.1689591.
Guryan, Jonathan, Jens Ludwig, Monica P. Bhatt, et al. 2023. “Not Too Late: Improving Academic Outcomes among Adolescents.” American Economic Review 113 (3): 738–65. https://doi.org/10.1257/aer.20210434.
Hollands, Fiona M., Michael J. Kieffer, Robert Shand, Yilin Pan, Henan Cheng, and Henry M. Levin. 2016. “Cost-Effectiveness Analysis of Early Reading Programs: A Demonstration With Recommendations for Future Research.” Journal of Research on Educational Effectiveness 9 (1): 30–53. https://doi.org/10.1080/19345747.2015.1055639.
Jacob, Robin, Catherine Armstrong, A. Brooks Bowden, and Yilin Pan. 2016. “Leveraging Volunteers: An Experimental Evaluation of a Tutoring Program for Struggling Readers.” Journal of Research on Educational Effectiveness 9 (sup1): 67–92. https://doi.org/10.1080/19345747.2016.1138560.
Kraft, Matthew A., Beth E. Schueler, and Grace Falken. 2024. “What Impacts Should We Expect from Tutoring at Scale? Exploring Meta-Analytic Generalizability.” EdWorking Paper, nos. 24–1031. https://www.matthewakraft.com/s/tutoringatscale_exploringmeta.pdf.
Loeb, Susanna, Sarah Novicoff, Cynthia Pollard, Carly Robinson, and Sara White. 2023. “The Effects of Virtual Tutoring on Young Readers: Results from a Randomized Controlled Trial.” National Student Support Accelerator. https://blog.onyourmarkeducation.org/hubfs/Sales%20Collateral/NSSA%20Brief%20-%20Effects%20of%20Virtual%20Tutoring%20on%20Young%20Readers.pdf.
Markovitz, Carrie E., Marc W. Hernandez, E. C. Hedberg, and Heidi W. Whitmore. 2022. “Evaluating the Effectiveness of a Volunteer One-on-One Tutoring Model for Early Elementary Reading Intervention: A Randomized Controlled Trial Replication Study.” American Educational Research Journal 59 (4): 788–819. https://doi.org/10.3102/00028312211066848.
May, Henry, Aly Blakeney, Pragya Shrestha, Mia Mazal, and Nicole Kennedy. 2024. “Long-Term Impacts of Reading Recovery through 3rd and 4th Grade: A Regression Discontinuity Study.” Journal of Research on Educational Effectiveness 17 (3): 433–58. https://doi.org/10.1080/19345747.2023.2209092.
Neitzel, Amanda J., Cynthia Lake, Marta Pellegrini, and Robert E. Slavin. 2022. “A Synthesis of Quantitative Research on Programs for Struggling Readers in Elementary Schools.” Reading Research Quarterly 57 (1): 149–79. https://doi.org/10.1002/rrq.379.
Nickow, Andre Joshua, Philip Oreopoulos, and Vincent Quan. 2020. The Impressive Effects of Tutoring on PreK-12 Learning: A Systematic Review and Meta-Analysis of the Experimental Evidence. https://doi.org/10.26300/EH0C-PC52.
Ritter, Gary W., Joshua H. Barnett, and Ginger R. Albin. 2009. “The Effectiveness of Volunteer Tutoring Programs for Elementary and Middle School Students: A Meta-Analysis.” Review of Educational Research 79 (1): 3–38. https://doi.org/10.3102/0034654308325690.
Ritter, Gary W., Joshua H. Barnett, George S. Denny, and Ginger R. Albin. 2009. “The Effectiveness of Volunteer Tutoring Programs for Elementary and Middle School Students: A Meta-Analysis.” Review of Educational Research 79 (1): 3–38. https://doi.org/10.3102/0034654308325690.
Sirinides, Philip, Abigail Gray, and Henry May. 2018. “The Impacts of Reading Recovery at Scale: Results From the 4-Year I3 External Evaluation.” Educational Evaluation and Policy Analysis 40 (3): 316–35. https://doi.org/10.3102/0162373718764828.
What Works Clearinghouse. 2017. Beginning Reading Intervention Report: Leveled Literacy Intervention. Institute of Education Sciences, U.S. Department of Education. https://ies.ed.gov/ncee/WWC/InterventionReport/679.

Men are 42% of students but 51% of stopouts. Why men leave college and how re-enrollment programs can bring them back.

Gifted programs boost college entry for low-income boys—a 28-point jump for those just over the IQ threshold.

Education degrees go mostly to women—just 17% of BAs & 21% of MAs go to men, limiting the pipeline for male teachers.