ASAG2024: A Combined Benchmark for Short Answer Grading (SIGCSE Virtual 2024 - Conference)

Who

Gérôme Meyer, Philip Breuer, Jonathan Fürst

Track

SIGCSE Virtual 2024 Conference

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Thu 5 Dec 2024 14:15 - 14:30 at Track 1 - Thursday - Posters 11

Abstract

Open-ended questions test a more thorough understanding compared to closed-ended questions and are often a preferred assessment method. However, open-ended questions are tedious to grade and subject to personal bias. Therefore, there have been efforts to speed up the grading process through automation. Short Answer Grading (SAG) systems aim to automatically score students’ answers in examinations. Despite growth in SAG methods and capabilities, there exists no comprehensive short-answer grading benchmark across different subjects, grading scales, and distributions. Thus, it is hard to assess the capabilities of current automated grading methods in terms of their generalizability. In this preliminary work, we introduce the combined ASAG2024 benchmark to facilitate the comparison of automated grading systems. Combining seven commonly used short-answer grading datasets in a common structure and grading scale. For our benchmark, we evaluate a set of recent SAG methods, revealing that while LLM-based approaches reach new high scores, they still are far from reaching human performance. This opens up avenues for future research on human-machine SAG systems.

Link to Presentation: https://youtu.be/Lg5EYwCKUtg

Gérôme Meyer

ZHAW University of Applied Sciences

Switzerland

Philip Breuer

ZHAW University of Applied Sciences

Switzerland

Jonathan Fürst

ZHAW University of Applied Sciences

Switzerland

Time Zone

The program is currently displayed in (UTC) Coordinated Universal Time.

Use conference time zone: (UTC) Coordinated Universal TimeSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Thu 5 Dec
Displayed time zone: (UTC) Coordinated Universal Time change

14:00 - 14:30	Posters 11Conference at Track 1 - Thursday

14:00 15m Poster		Integrating Making and Computational Thinking in Early Childhood Education: Preliminary Outcomes from a Teacher Trainer Workshop on Designing an Intervention Conference Tobias Bahr University of Stuttgart
14:15 15m Poster		ASAG2024: A Combined Benchmark for Short Answer Grading Conference Gérôme Meyer ZHAW University of Applied Sciences, Philip Breuer ZHAW University of Applied Sciences, Jonathan Fürst ZHAW University of Applied Sciences

Information for Participants

Thu 5 Dec 2024 14:00 - 14:30 at Track 1 - Thursday - Posters 11

Info for room Track 1 - Thursday:

Track 1 - Thursday December 5th

To access the live meeting for this track, please use the following Zoom link:

https://acm-org.zoom.us/j/91010406005?pwd=nz75t067VhoZdEMPWAAcKxMmfbQuKj.1