Lesson 05 of 13
Overview
This episode explores Dylan Wiliam’s Comment-Match Routine, a simple classroom strategy that gets students to match anonymous feedback to anonymous writing so they have to read, compare, and think. It also digs into Ruth Butler’s research on how grades can drown out comments, and why discussion and revision make feedback actually stick.
Welcome to the show -- and Colin, I wanna start with a routine teachers could literally use tomorrow morning: the Comment-Match Routine. It comes from Dylan Wiliam in Educational Leadership, published by ASCD in April 2016, in an article called “The Secret of Effective Feedback.” And I like it immediately because it is so specific. Not “improve feedback culture.” Not “build reflective learners.” One move. [curious] April 2016, ASCD, Dylan Wiliam -- that's a very respectable pedigree already. But go on: what is the move, exactly? Instead of writing a comment beside a grade, you put ONE comment on ONE separate slip of paper for each piece of student writing. No names. No scores. Then a group of four students gets four pieces of writing and four loose comment slips, and their only job is to figure out which comment belongs to which piece. [questioning tone] Wait -- four anonymous papers, four anonymous comments, and they have to do the matching? So the feedback becomes almost a sorting puzzle. Exactly. And Wiliam describes a language arts teacher named Charlotte Kerrigan using it, which matters to me because then it stops sounding like conference-talk magic and starts sounding like, oh, an actual teacher did this with actual kids before lunch. Charlotte Kerrigan is the bit that makes it concrete, yes. Not a laboratory condition -- a classroom. And the clever thing here is that the students can't glance, nod, and shove the paper into a backpack. To match the slip, they have to inspect the writing. [warmly] Right, and that's the whole game. Because if I hand back an essay that says, “Use more evidence to support your claim,” some kids read it, sure. A lot of kids do that thing where their eyes drop straight to the score, they have a little internal weather event, and the comment basically dies on the page. [dryly] Ah yes, the ancient pedagogical sequence: receive paper, scan for number, decide whether life is fair, ignore everything else. [laughs] That is EXACTLY it. And with this routine, there is no number to latch onto. They have to ask, okay, which paragraph actually lacks evidence? Which one has evidence but weak explanation? Which one makes a claim that's sort of floating there with no quote underneath it? The comment isn't wallpaper anymore. And because there are four papers, not one, they're comparing. Comparison matters. They're seeing what “specific evidence” looks like in one piece versus another. It's not just “teacher said improve.” It's “this comment fits HERE because this writer quotes the text but never explains the quote.” Yes. And the nice teacher move is that you don't need some giant setup. You pick a recent short piece of writing -- short is important, because nobody's got forty-seven minutes to run a forensic lab on five-page essays -- and you choose four students whose writing is different enough that the comments are not interchangeable. That phrase “not interchangeable” is doing a lot of work. Because if all four comments are basically “add more detail,” then the routine collapses, doesn't it? [skeptical] It absolutely collapses. Then the kids are not reading feedback; they're playing “guess what the teacher probably means.” So step one, pick a recent short piece of work. Step two, choose four students with different enough writing. Step three, write one specific comment per piece. Step four, strip off names and grades. Step five, hand out the four papers plus the four comments, let the group match them, and then have students write one sentence on what they will change before revising. That last sentence -- “what I will change” -- is the hinge, isn't it? Because without that, you've got a clever activity. With it, you have a bridge into revision. Now, the research spine behind this is Ruth Butler's 1988 study. And the memorable finding -- the one worth carrying around in your head -- is that when students get a grade and a comment together, they tend to ignore the comment and look only at the grade. Ruth Butler, 1988 -- that is old enough that we've had time to learn from it, and somehow we still do the same thing. [sighs] We write a thoughtful little note, tuck it next to an 82, and then act surprised when the 82 eats the note alive. [deadpan] The grade has a gravitational field. Once it's there, everything else orbits it. And Wiliam's bigger point -- which I think is the right test for all feedback -- is that feedback only matters if it changes what students do next. Not whether I wrote it. Not whether it sounded smart. Not whether it used my nicest purple pen. Did it change the next draft? Did it change the next attempt? If not, then honestly it may have been more for me than for them. Exactly. And Wiliam uses a phrase I rather like: feedback has to “cause thinking.” That's the standard. Not produce a red-ink trail. Cause thinking. If the learner doesn't have to think, compare, justify, decide -- then we're often just decorating papers. [chuckles] “Decorating papers” is rude... and fair. Because the matching task forces the thing we usually hope will happen privately. To solve it, students have to read every comment carefully, compare it against the writing, and argue with their group about the best fit. And that argument is not a side effect. It's the mechanism. If one student says, “No, this comment belongs to paragraph three because the claim is strong but the evidence is thin,” and another says, “No, paragraph two has the quote but no explanation,” they are articulating criteria. They're thinking aloud about quality. Let me give you the Tuesday version, because this happened in my 3rd-period 7th-grade English class -- literary analysis paragraphs, one paragraph each, thank God. I pulled four that were different enough to work. One kid had a sharp claim and almost no evidence. One had a beautiful quote dropped in like it fell from the sky -- no explanation. One was all summary, no analysis. And one was actually pretty solid but needed a clearer link back to the prompt. [leaning in] “Dropped in like it fell from the sky” -- that's the one I'll remember. A quote with no explanation. So what happened when they started matching At first, they did what kids do -- they tried to speed-run it. Like, “this one sounds good, put it there.” But then one table got stuck on a comment that said, “Your evidence is relevant, but you need to explain how it proves your claim.” And suddenly they're pointing at two paragraphs, reading lines aloud, saying, “No, THIS one has a quote,” “Yeah but this one actually has a claim,” “No, look, she never says why the quote matters.” [pauses] And that argument told me more than a silent reread ever would. It showed me what they noticed in writing. That's the lovely tension in it. The students' debate reveals their schema -- or lack of one. You can hear whether they understand evidence, explanation, claim, all of it. The talk becomes diagnostic for the teacher as well. Yes, and then the follow-up sentence matters: “What will you change before revising?” Just one sentence. Not a whole reflection form nobody wants. Just: “I will explain how my quote proves the character feels isolated,” or “I will replace summary with one sentence of analysis.” Small enough to do, specific enough to matter. There's also a practical bonus here. If a teacher wants help generating those comment slips, you can use AI -- carefully, and secondarily. Paste the four student responses and prompt it something like: “Draft one specific, actionable comment for each student response. Include no scores and no names. Return the comments in random order for printing.” Useful servant, terrible master. [skeptical] Exactly. The routine is the method. The tool is just helping you not stay up at 10:45 p.m. wordsmithing comment number nineteen. But the comments still have to be specific enough that they match one paper better than the others. And that's the ending, really. If students can successfully match the comment, then at last they're reading feedback AS feedback. But if they can't -- if every slip seems to fit every paper -- that may not mean the students failed. It may mean the comments were too generic to help at all. [reflective] Which is a little brutal, honestly. But useful. Because sometimes the routine doesn't just teach students to read comments better. It teaches us whether we wrote comments worth reading. Thanks for listening.