澳门跑狗论坛

Assessment

Software Said To Grade Essays for Content

By Debra Viadero 鈥 April 22, 1998 3 min read
  • Save to favorites
  • Print
Email Copy URL

Researchers at the University of Colorado at Boulder have developed a computer-software program that they say can grade student essays for content just as well as teachers can.

鈥淲e鈥檙e not seeing this as a replacement for teachers,鈥 said Peter W. Foltz, one of three developers of the program, known as Intelligent Essay Assessor. 鈥淏ut we think it may help teachers have more time to interact with students.鈥

Thomas K. Landauer

The program was created and tested over a period of 10 years by psychologists at the university, who initially sought to create a computer model that would mimic the way people learn about words and language. But, along the way, they said, they discovered that their theoretical model could also be used for more practical purposes, such as grading essays or giving students diagnostic practice in summarizing material.

Since then, the researchers have tested the program with almost 1,200 essays.

鈥淔rom 6th graders to first-year medical students, we get consistently good results,鈥 said Thomas K. Landauer, a University of Colorado psychology professor and the project鈥檚 lead investigator. 鈥淭he agreement between a human and the machine is about exactly the same as between two humans.鈥

But unlike humans, the computer will always give the same grade to the same essay, the researchers said.

鈥淚t doesn鈥檛 matter what else it was doing that day or if it was tired,鈥 said Darrell Laham, a University of Colorado doctoral student who鈥檚 also taking part in the project. 鈥淚t doesn鈥檛 ever have a bad day.鈥

The findings were presented here last week at the annual meeting of the American Educational Research Association.

Researchers have experimented for years on ways to use computers for grading essays. Ellis B. Page, a Duke University professor of psychology and education, has developed a program called Project Essay Grade, or peg, which assesses the quality of writing in students鈥 essays. It has shown similarly good results. (鈥淢aking the Grade,鈥 May 31, 1995.)

But the program unveiled last week may be among the first to grade essays for content.

The Colorado researchers have applied for a patent for the program, but say they may be a few years away from marketing it commercially. The software requires a computer with about 20 times the memory of an ordinary personal computer.

The researchers predict that in five years, however, the program will be able to run on most computers.

More Time for Teachers?

Ultimately, researchers say, such software could be a boon for professors and teachers as well as for commercial testing companies that grade thousands of exams.

鈥淭eachers in middle schools report that they spend six-and-a-half hours a week grading essays, and high school teachers spend more than ten-and-a-half hours,鈥 Gary B. Struck, a professor of educational psychology at the University of North Carolina at Chapel Hill, said during a conference session on the PEG project.

In addition, many teachers lack confidence in the grades they give on essays and would like some means of assessing their scoring, he said. 鈥淭hey鈥檇 like to have some kind of comparison.鈥

The developers of Intelligent Essay Assessor said their program might also be a useful supplement to textbooks published by commercial textbook companies. Students could use the program to test their knowledge of the textbook material.

Hard To Beat

The new software program grades essays by evaluating how closely the students鈥 words approximate content already fed into the program through textbooks and model essays that professors have already graded by hand.

鈥淵ou could write something like, 鈥楾he doctor operated on the patient,鈥 and if the text said, 鈥楾he physician did surgery,鈥 the computer would see those things as very similar,鈥 said Mr. Foltz, who is also an assistant professor of psychology at New Mexico State University.

Longer essays also get better grades because researchers have found that essay length correlates closely with the quality of the content.

Critics of the program have suggested that students could easily fool the computer by including the right content but putting it in the wrong order.

Students could still get a good grade that way, the researchers said, 鈥渂ut it鈥檚 hard to say good ideas without saying it well,鈥 Mr. Foltz said.

What鈥檚 more, the computer flags essays that are 鈥渢oo creative鈥 or that contain a lot of inconsistencies. It brings such essays to the attention of human graders. Plagiarized essays also get flagged.

The researchers have spent a lot of time trying to fool the system, Mr. Landauer said, but 鈥渢he easiest way to cheat this system is to study hard, know the material, and write a good essay.鈥

Events

This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
Reading & Literacy Webinar
Literacy Success: How Districts Are Closing Reading Gaps Fast
67% of 4th graders read below grade level. Learn how high-dosage virtual tutoring is closing the reading gap in schools across the country.
Content provided by 
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
Artificial Intelligence Webinar
AI and Educational Leadership: Driving Innovation and Equity
Discover how to leverage AI to transform teaching, leadership, and administration. Network with experts and learn practical strategies.
Content provided by 
This content is provided by our sponsor. It is not written by and does not necessarily reflect the views of 澳门跑狗论坛's editorial staff.
Sponsor
School Climate & Safety Webinar
Investing in Success: Leading a Culture of Safety and Support
Content provided by 

EdWeek Top School Jobs

Teacher Jobs
Search over ten thousand teaching jobs nationwide 鈥 elementary, middle, high school and more.
Principal Jobs
Find hundreds of jobs for principals, assistant principals, and other school leadership roles.
Administrator Jobs
Over a thousand district-level jobs: superintendents, directors, more.
Support Staff Jobs
Search thousands of jobs, from paraprofessionals to counselors and more.

Read Next

Assessment Opinion Why Are Advanced Placement Scores Suddenly So High?
In 2024, nearly three-quarters of students passed the AP U.S. History exam, compared with less than half in 2022.
10 min read
Image shows a multi-tailed arrow hitting the bullseye of a target.
DigitalVision Vectors/Getty
Assessment Grades and Standardized Test Scores Aren't Matching Up. Here's Why
Researchers have found discrepancies between student grades and their scores on standardized tests such as the SAT and ACT.
5 min read
Student writing at a desk balancing on a scale. Weighing test scores against grades.
Vanessa Solis/澳门跑狗论坛 + Getty Images
Assessment Why Are States So Slow to Release Test Scores?
Nearly a dozen states still haven't put out scores from spring tests. What's taking so long?
7 min read
Illustration of a man near a sheet of paper with test scores on which lies a magnifying glass and next to it is a question mark.
iStock/Getty
Assessment A District鈥檚 Experiment: What Happens When Schools Do Less Testing?
Los Angeles Unified will excuse some schools from periodic assessments. Supporters hope it will inspire new ways to measure learning.
6 min read
An illustration on a red background of a silhouette of an individual carrying a ladder and walking away from a white arrow shaped sign post, with an arrow facing the opposite direction that has been cut out within the arrow shaped sign with cut pieces of paper on the ground below it.
DigitalVision Vectors