Corrected: An earlier version of this article misspelled the last name of Michelle Exstrom, the education program director at the National Conference of State Legislatures.
The Every Student Succeeds Act opened the door to two types of assessment flexibility for all states that could seriously reshape the national testing landscape. But a year after the law was signed, few states or districts appear inclined to take advantage of them.
The new federal education law allows states to permit school districts to substitute a 鈥渘ationally recognized鈥 high school test, such as the ACT or the SAT, for their states鈥 own high school exam. The law also lets states divide annual assessments into chunks, and roll those interim results into one summative score for accountability purposes.
Both options are separate from a more-limited pilot program that will let a handful of states develop a new breed of assessments on their own.
The limited pilot program has yet to get off the ground.
Separately no state has yet decided to go with the aggregated-interims option, though at least one is exploring the idea. Arizona is the only state so far that plans to try the option of a locally chosen high school test.
A law signed by Arizona Gov. Doug Ducey in March 2016 requires that state鈥檚 board of education to come up with a menu of tests that districts can use instead of the state鈥檚 own AzMerit. Districts are supposed to be able to choose from that list for grades 9-12 in the 2017-18 school year, and for grades 3-8 in 2018-19.
Board member Calvin Baker said that the panel is starting to hear presentations from vendors, who will have to prove that their products are valid for use in the state鈥檚 accountability system. Baker, the superintendent of the Vail school district, near Tucson, said he is reserving judgment about the approach until the board completes its work. He likes its flexibility, but he wants to be sure that tests other than AzMerit yield the information the state needs.
鈥淚鈥檒l want to know, do these other tests truly test our state standards? Is it possible to compare the results of one of those tests to our state test?鈥 he said. 鈥淭he bigger quest is to protect accountability. We want to be flexible, but we also need to be sure we鈥檝e got a system that identifies schools鈥 accomplishments, and where they need help.鈥
But Arizona could run into legal problems with the 鈥渕enu鈥 approach in grades 3-8. ESSA requires states to administer the same test to all students in grades 3-8. It鈥檚 only at the high school level that it allows states to substitute another assessment. State education department spokesman Charles Tack acknowledged that the new law 鈥渨ould appear to be in conflict with ESSA.鈥 U.S. Department of Education spokeswoman Jessica Allen said federal officials are 鈥渃oncerned鈥 about the menu approach to testing in grades 3-8.
鈥淲e understand that this portion of the law does not take effect until the 2018-19 school year, and we hope that the state will use this time to ensure that its schools administer high-quality assessments in accordance with the law,鈥 Allen said in an email.
Florida lawmakers in their 2016 session considered a bill that would have allowed districts to use other tests, such as the PSAT, SAT, or ACT Aspire, instead of the state鈥檚 exam in high school, but the bill died without a vote by either chamber. A Colorado law requires the department of education to 鈥渋nvestigate and review鈥 assessment options, including one that would let local districts choose their own tests. The Colorado law also requires the education department to apply for another kind of ESSA testing flexibility, the innovative assessment pilot.
Our juniors are test-fatigued. ... It鈥檚 hard to get them motivated, and it鈥檚 hard not to empathize with their test fatigue."
Testing was a hot topic in state legislatures in 2016. Key themes emerged from the hundreds of bills that passed through those chambers, according to Michelle Exstrom, the education program director at the National Conference of State Legislatures, which tracks state laws. Many of the bills dealt with switching assessments, clarifying parents鈥 rights to opt students out of tests, or giving districts more testing flexibility. But few moved states toward the kinds of testing flexibility ESSA offers, she said.
In meetings last year about testing issues, most states expressed reluctance to let districts choose their own high school tests, said Marianne Perie, who oversees an assessment working group for the Council of Chief State School Officers.
Complex Oversight
One reason for that could be the complexity of overseeing different tests, said Scott Norton, the CCSSO鈥檚 director of strategic initiatives in standards, accountability and assessment.
鈥淒istricts want to use a test that matters, and students care more about their SAT or ACT score than about their state test score,鈥 he said. 鈥淚f districts could use one of those, and get rid of their state test, they鈥檇 reduce their burden and get more [student] engagement. But states might not be that enthusiastic about managing different tests. It鈥檚 simpler for them if everyone is on the same page.鈥
Though they may get pushback from their states, many school districts find the local high school option appealing. Vermont surveyed its stakeholders and found that 83 percent of administrators and 88 percent of teachers favored a switch from Smarter Balanced, a test designed by one of two federally funded consortia, to the SAT or ACT at the high school level.
鈥淥ur juniors are test-fatigued,鈥 said Mike McGraith, the principal of Montpelier High School in Montpelier, Vt. 鈥淢any take AP tests in May, and they鈥檙e already taking the SAT multiple times, the PSAT in the fall, and the science [state test]. Then we have to turn to this group of 17-year-olds and say, 鈥榃ould you please try your hardest on this fairly involved long test that doesn鈥檛 count for you, but it counts for us?鈥 It鈥檚 hard to get them motivated, and it鈥檚 hard not to empathize with their test fatigue.鈥
In considering the option of rolling interim tests into a summative result, states and districts could enjoy some benefits, Norton said. As part of a coherent system, interim tests could help reduce testing overall by serving both as periodic, instructionally useful feedback, and also, combined, as a summative result for accountability, he said. But it would also be a heavy lift for states, since they鈥檇 have to ensure that the interims have met rigorous specifications, and give high-security tests several times a year, instead of just once, he said.
North Carolina is in the second year of a pilot that uses the aggregated results of three interim tests each year to measure student achievement. But Tammy Howard, who is overseeing the so-called 鈥淣C Check-In鈥 project as the state鈥檚 director of accountability services, said it isn鈥檛 intended as a possible substitute for North Carolina鈥檚 own end-of-grade and end-of-course tests. The state is simply interested in seeing how well 鈥渟egmented鈥 tests capture students鈥 mastery.
Instructional Feedback
Teachers love the interim tests so much, Howard said, that they鈥檙e asking the state not to use them as part of a summative system for accountability. Because the tests are for instructional feedback, they鈥檙e not held in high-security conditions, so teachers can get better access to the test questions, and detailed results, than if they were high-stakes accountability assessments, Howard said. 鈥淭eachers find the feedback so useful, they want to keep it that way,鈥 she said.
An assessment task force in Pennsylvania recommended that the state explore the possibility of using interim tests to get a summative score for accountability. Beth Olanoff, a special assistant to the state secretary of education on ESSA, said she and her colleagues are discussing the possibility and plan to conduct a feasibility study on the approach.
It鈥檚 appealing, on the one hand, because it would 鈥渃hunk鈥 summative tests into several sittings across the year, minimizing the impact of one big end-of-year test, especially on the youngest students, Olanoff said. And it could offer teachers more frequent, instructionally useful feedback. But testing is disruptive, and Pennsylvania officials still 鈥渘eed to have a fuller conversation about whether [the approach is] worth that kind of disruption more frequently,鈥 Olanoff said.
Venessa A. Keesler, Michigan鈥檚 deputy superintendent for educator, student, and school supports, said Michigan is mulling the use of interim tests in some grades, but psychometricians have raised caution flags about whether aggregated interim results would be valid for summative purposes. Teachers want tests with 鈥渕ore of a formative feel,鈥 to help them better meet students鈥 needs, Keesler said. But whether that鈥檚 possible in the context of a state accountability system isn鈥檛 entirely clear, she said.
鈥淣o state is really doing that,鈥 Keesler said. 鈥淭his is emerging science.鈥