As regular readers know, I am not a fan of mandatory, high-stakes testing, which accomplishes virtually nothing, but makes up for it by costing a fortune. So I’m updating an article I wrote back in February of 2017. It’s pretty long, but I recommend you read it. You’ll gain a new appreciation for the insanity of this kind of testing, which is a massive fraud.
For the last two years, I’ve been teaching 11thgrade, which means I no longer have to spend much of the school year directly drilling for the English STAAR Test. But there is no escaping it, which is why this article by Valerie Strauss from The Washington Post caught my eye:
Badly worded or poorly conceived questions on standardized tests are not uncommon (remember the questionabout a “talking pineapple” on a New York test in 2012?). But here’s something new: The author of source material on two Texas standardized tests says she can’t actually answer the questions about her own work because they are so poorly conceived. She also says she can’t understand why at least one of her poems — which she calls her ‘most neurotic’ — was included on a standardized test for students.
Here’s one of the poems as it appeared in a STAAR test:
Notice the page number. Kids are given five hours to complete an English STAAR test. Many need every minute, and more.
By Sara Holbrook
When I realized I couldn’t answer the questions posed about two of my own poems on the Texas state assessment tests (STAAR Test), I had a flash of panic – oh, no! Not smart enough. Such a dunce. My eyes glazed over. I checked to see if anyone was looking. The questions began to swim on the page. Waves of insecurity. My brain in full spin.
The two poems in question are ‘A Real Case,’ appearing on the 2014 Grade 7 STAAR Reading Test, and ‘Midnight,’ appearing on the 2013 Grade 8 STAAR Reading Test. Both poems originally appeared in ‘Walking on the Boundaries of Change,’ Boyds Mills Press, 1998.
Let me begin by confessing that ‘A Real Case’ is my most neurotic poem. I have a pile of them to be sure, but this one is the sour cherry on top. The written evidence of my anxieties, those evil gremlins that ride around on tricycles in my mind shooting my self-confidence with water pistols. How in the name of all that’s moldy didthispoem wind up on a proficiency test?
That’s a very good question. A Real Case is so obscure I would not pick it for a Sophomore level class, to say nothing of a test that can keep kids from graduating from high school. They’re just not sufficiently cognitively developed to deal with it. To use this in a 7th grade test is madness, educational malpractice.
I apologize to those kids. I apologize to their teachers. Boy howdy, I apologize to the entire state of Texas. I know the ‘90s were supposed to be some kind of golden age, but I had my bad days and, clearly, these words are the pan drippings of one of them. Did I have a purpose for writing it?
Does survival count?
Teachers are also trying to survive as they are tasked with teaching kids how to take these tests, which they do by digging through past tests, posted online. Forget joy of language and the fun of discovery in poetry, this is line-by-line dissection, painful and delivered without anesthetic. One teacher wrote to me last month, working after 10 p.m., trying to figure out the test maker’s interpretation of my poem ‘Midnight.’ This poem isn’t quite as jarring as’“A Real Case,’ simply symptomatic of aforementioned neuroses: It’s about insomnia.
“Hello Mrs. Holbrook. My name is Sean, and I’m an 8th grade English teacher in Texas. I’m attempting to decipher the number of stanzas in your poem, ‘Midnight’. This isn’t clear from the formatting in our most recent benchmark. The assessment asks the following question:
“Dividing the poem into two stanzas allows the poet to―
- A) compare the speaker’s schedule with the train’s schedule.
B ) ask questions to keep the reader guessing about what will happen
- C) contrast the speaker’s feelings about weekends and Mondays
- D) incorporate reminders for the reader about where the action takes place.
The answer is C) to contrast the speaker’s feelings about weekends and Mondays.
How many stanzas are in this poem? Where are they located? I would appreciate your help. Thank you so much!”
Oh, goody. I’m a benchmark. Only guess what? The test prep materials neglected to insert the stanza break. I texted him an image of how the poem appeared in the original publication. Problem one solved. But guess what else? I just put that stanza break in there because when I read it aloud (I’m a performance poet), I pause there. Note: that is not an option among the answers because no one ever asked me why I did it.
These test questions were just made up, and tragically, incomprehensibly, kids’ futures and the evaluations of their teachers will be based on their ability to guess the so-called correct answer to made up questions.
Then I went online and searched Holbrook/MIDNIGHT/Texas and the results were terrifying. Dozens of districts, all dissecting this poem based on poorly formatted test prep materials.
Texas, please know, this was not the author’s purpose in writing this poem.
Now, gentle readers, for a quick trip to Texas reality. The analysis and interpretation of poetry is one of my fortes. I’m not at all snooty or pretentious, but concentrate on genuine insights. Poetry helps us find meaning in life. It cannot be relegated to a single, correct interpretation, because one can return to good poetry–and good art of all kinds–throughout their lifetime and find all manner of insights they could not possibly find earlier, because every time they return, they are new. They have new and greater experiences to add to the search for meaning.
So it was, when the STAAR test was new, replacing entirely the previous, perfect, state-of-the-art test before it, I found myself, several times a week, teaching every sophomore boy in my medium-sized Texas high school. For the first time, analysis of poetry would be required, a nod to greater academic “rigor.”
I was not teaching them how to understand and appreciate poetry, nor its literary qualities. I was teaching them, based on our best guesses of how the people who wrote and would be scoring the STAAR test, would possibly grade it.
I explained to my principal, who was honestly trying to give the kids the best possible opportunity to pass the STAAR, precisely what I’d be doing and why, and while he intellectually understood, he didn’t get it until he stopped by for a session and realized, to his horror, there were at least two actually correct answers for each question. I had to teach the kids to think like an educrat, to pick the single answer an educrat thought to be most correct. Suffice it to say few, if any, of the answers made sense in terms of the rational analysis of poetry. Let’s return to Ms. Holbrook:
At the end of this article is a question-by-question breakdown of the test questions on A REAL CASE and my thinking as I attempted to answer them. But fair warning: Youreyes are going to glaze over as you read through them. But try to hang in there. Pretend your future depends on it. That you might not be promoted into the next grade with all the other kids your height and will have to remain in seventh for the rest of your life if you don’t pass. Seventh grade! That muddy trough where kids try to keep afloat clinging to the wispy thread of: This won’t last forever. But if you don’t pass this blamed (blaming) test, it just might. Oh no! Put a pencil between your teeth, bite down, and open your test packet.
Meantime, here is my question:
- Does this guessing game mostly evidence:
A the literacy mastery of the student?
B the competency of the student’s teacher?
C the absurdity of the questions?
D the fact that the poet, although she has never put her head in an oven, definitely has issues.
Let’s go with D since I definitely have issues, including issues with these ridiculous test questions.
The same year that “Midnight” appeared on the STAAR test (2013), Texas paid Pearson some $500 million to administer the tests, reportedly without proper training to monitor the contract. Test scorers, who are routinely hired from ads on (where else?), Craiglist, also receive scant training, as reported by this seasoned test scorer.I’m not sure what the qualifications are for the people who make up the questions, but the ability to ride unicorns comes to mind.
For more than a decade I read the graded-by-people-hired-on-Craigslist essays of every student in my school. It taught me one thing: the criteria the Texas Education Agency tells teachers will be used to grade writings are not the criteria they use to actually grade them. This, as one might imagine, puts teachers at something of a disadvantage in helping their kids to pass the STAAR. What it does to the kids is criminal.
Testing isn’t about teaching and learning. It isn’t about evaluating schools–actually it is more about grading schools than doing anything for kids, but that’s another article. There are far cheaper and far more effective methods of doing those things. This is about money. Since instituting mandatory, high-stakes testing, Texas has blown untold billions. Even in Texas, that’s real money
It’s also about bureaucratic empire building. A great many educrats and other “educators” in every state with mandatory, high stakes testing hitch their career wagons to the testing money train. If the train stops, they may find themselves suddenly irrelevant and unemployable. Ms. Holbrook again:
Stop it. Just stop it.
The only way to stop this nonsense is for parents to stand up and say, no more. No more will I let my kid be judged by random questions scored by slackers from Craigslist while I pay increased taxes for results that could just as easily have been predicted by an algorithm. That’s not education, that’s idiotic.
Educrats fear parents. They should. At the beginning of the STAAR era, the test makers, state educrats and legislators went batshit crazy and mandated fifteen–FIFTEEN–separate tests for graduation from high school. A sufficient number of parents–they ignore teachers–threatened to do to the Legislature and Tea Headquarter what the peasants did to Dr. Frankenstein’s castle, and they reluctantly cut things back to the current five tests, which is five too many.
I won’t drag you through the entire dissection of my poem MIDNIGHT, just the concluding stanza:
. . . And I meander to its rhythm,
flopping like a fish.
Why can’t I get to sleep?
Why can’t I get to sleep?
14 The poet uses a simile in lines 23 and 24 to reveal that the speaker —
F wants to be outside G cannot get comfortable H does not like fishing J might be having a dream
I say G, H, and J. I can’t get comfortable with any of this, it all seems like a bad dream (which indeed can keep me awake) and correct, I don’t like fishing (ick, worms). But fishing through school testing is even creepier than a fistful of worms, especially when it’s mislabeled as a legitimate measure of student and teacher competency.
Parents, educators, legislators, readers of news reports:STOP TAKING THESE TEST RESULTS SERIOUSLY
Idiotic, hair-splitting questions pertaining to nothing, insufficient training, profit-driven motives on the part of the testing companies, and test results that simply reveal the income and education level of the parents – For this we need to pay hundreds of millions of dollars and waste 10-45 days of classroom time each year to administer them? More if you consider the amount of days spent in test prep?
What creative ideas might Sean have been cooking up at 10 p.m. on a cold Wednesday night to excite his kids about reading and learning if he hadn’t been wandering down this loopy labyrinth? Would he have been drafting a lesson plan for those kids to develop their writing and communication skills through writing their own poetry? Maybe he just would have been catching an extra hour of sleep to feel energized for the colossal task he is faced with every day, turning on adolescents to reading, writing, and learning.
Maybe by leading kids to poetry instead of force feeding it to them, Sean could have helped them sort through their own neuroses, helping to become better adults and see themselves as something other than a test score, as worthless as a leftover bath.
But we can’t know that, because at 10 p.m. on December 13, 2016, Sean was writing to me, trying to decipher misleading test prep materials he’d been handed to ready his kids for a test they will take sometime next spring.
I may be neurotic, but this is crazy.
But then, what do I know. I can’t answer the questions on my own poetry. Read below:
STARR Test, Grade 7 2014
Let’s take these questions one at a time:
32 Which lines from the poem best suggest that the speaker’s situation is temporary?
F Doubtful,/I have a fever
G Tomorrow,/I’ll come around
H TODAY!/It hit like a thwop of mashed potatoes
J I could attract mosquitoes—/maybe—not friends.
I’m guessing G, but I could make a pretty good argument for H as it (all caps) belongs to today. Mustn’t overthink this, clock’s ticking. Let’s go with G.
33 What is the most likely reason that the poet uses capitalization in line 6?
A To highlight a problem the speaker experiences
B To stress the speaker’s expectations for tomorrow
C To indicate that the speaker’s condition happens unexpectedly
DT o show the speaker’s excitement about an upcoming event
Could be A. All caps is a way to highlight a fact, right? I guess I wanted to stress the fact that the feeling belongs to TODAY, but maybe the answer is B. Let’s see, today is not tomorrow, could be that. But climbing into the test maker’s mind, I’m guessing they want the answer C. But here’s the thing: I remember adding the ALL CAPS during revision. Was it to highlight the fact it arrived today or was it to indicate that it happened unexpectedly? Not sure. Move on, lots to cover.
34 Read the following lines from the poem.
The poet includes these lines most likely suggest the speaker –
F does not wish to be pushed on a swing
G wants to deal with the situation alone
H does not often receive help from others
J is not physically strong
Definitely I don’t like being pushed around, especially on a mood swing. Or maybe F and G, clearly the speaker doesn’t want anyone in her space, pushing her around. Right? F, G, or wait, how about H? Exactly. Pushing me around when I’m in a mood is not helpful, but people do that all the time. Cheer up. Get that look off your face. Not helpful. J is just stupid, but is it a trick? (What happened to choice I? Where is I? Who am I?)
35 The imagery in lines 16 through 19 helps the reader understand –
A the shift in the speaker’s attitude
B the speaker’s unpleasantness
C why the speaker has no friends
D what the speaker thinks of others
Where is E, all of the above? C. Incoming dog breath has no friends. That’s obvious. Or B. Unpleasant yes, but that’s kind of an understatement. And of course there’s an argument to be made for A, I did shift into this mood TODAY. It wasn’t there yesterday. I obviously am not thinking much about others, that’s true (D), I was pretty much into myself and I was having a bad day.
36 The poet reveals the speaker’s feelings mainly by –
F using similes and metaphors to describe them
G explaining their effect on others
H connecting them to memories
J repeating specific words for emphasis
Now I really need that all above option. Yes F, using similes and metaphors in description. Righto. G, the phrase “could attract mosquitoes, not friends” is a pretty sure indicator my lousy mood had a bad effect on others. H? How else except through memory would I conjure up nasty dog breath and a terminal case of split ends? And then there’s J, repeated words (today, today, today). This one was the real stumper. Total guessing game on this one.
My final reflection is this: any test that questions the motivations of the author without asking the author is a big baloney sandwich. Mostly test makers do this to dead people who can’t protest. But I’m not dead.
So do I Ms. Holbrook, and so do my students–and teachers–and principals–and parents–and all other sentient beings.
Some, I’m sure, will argue for the absolute necessity, brilliance and perfection of such tests, but I know educated adults, including teachers, cannot pass an English STAAR test. They think too logically and rationally, applying real world thinking processes. Should their high school diplomas be repossessed? Or are we, perhaps, to take the word of the poet of several poems used in STAAR testing? Doesn’t she count? Don’t kids count? How about rationality?
Money and political/educratic power certainly do.