大规模编程学习者错误分析：3700万次编译揭示新手常见问题

26 浏览量更新于2024-08-25 收藏 449KB PDF 举报

"这篇研究论文名为《3700万次编译：大规模学生数据中初学者编程错误的调查》(37 Million Compilations - Investigating Novice Programming Mistakes in Large-Scale Student Data)，作者是Amjad Altadmri和Neil C. C. Brown，来自英国肯特大学计算机学院。该研究利用全球超过250,000名学生一年的编译事件数据，从大型Blackbox数据集中分析了编程错误的频率、修复时间以及错误在用户间的分布情况，揭示了这些因素之间的相互关系及其在学年中的发展变化。这些发现对于课程设计、教材编写以及针对常见（或最难解决）错误的工具开发具有指导意义。" 在编程教育领域，了解学生犯错误的模式以及他们修复错误所需的时间是至关重要的。传统的学生错误研究通常局限于个别机构内的数百名学生样本。然而，这项研究的独特之处在于它分析了来自全球各地学生的海量编译数据，涵盖了超过3700万次的编译事件。这使得研究结果更具普遍性和代表性。通过分析这些编译事件，研究者能够识别出最常见的编程错误类型，这些错误可能源于对语言语法的不熟悉、逻辑思维的错误或者编程概念的理解不足。他们还考察了错误修复的时间，这有助于理解学生在遇到问题时的学习过程和解决问题的能力。例如，某些错误可能很快就能被纠正，而其他一些可能需要更长时间，这可能表明学生在理解和解决问题上的困难。此外，研究还关注了错误在不同用户间的传播情况。这可能揭示出教学方法、教材或编程环境的共性问题，如果很多学生都犯同样的错误，可能意味着教学材料或课程设计需要改进。通过对这些数据的深入挖掘，可以找出最具挑战性的错误，以便在教育工具和资源中优先解决这些问题，提高学生的学习效率和编程技能。根据论文的关键词“编程错误”和“Blackbox”，我们可以推测Blackbox可能是一个用于收集和分析学生编程行为的大数据平台，它允许研究人员观察和研究学生在实际编程环境中的行为模式。这项研究为编程教育提供了一种新的视角，通过大数据分析来识别和理解学生编程学习过程中的普遍挑战，旨在优化教学策略，帮助学生更好地克服编程学习中的障碍。

37 Million Compilations:

Investigating Novice Programming Mistakes in Large-Scale

Student Data

Amjad Altadmri

School of Computing

University of Kent

Canterbury, Kent, UK

aa803@kent.ac.uk

Neil C. C. Brown

School of Computing

University of Kent

Canterbury, Kent, UK

nccb@kent.ac.uk

ABSTRACT

Previous investigations of student errors have typically fo-

cused on samples of hundreds of students at individual in-

stitutions. This work uses a year’s worth of compilation

events from over 250,000 students all over the world, taken

from the large Blackbox data set. We analyze the frequency,

time-to-ﬁx, and spread of errors among users, showing how

these factors inter-relate, in addition to their development

over the course of the year. These results can inform the de-

sign of courses, textbooks and also tools to target the most

frequent (or hardest to ﬁx) errors.

Categories and Subject Descriptors

K.3.2 [Computers And Education]: Computer and In-

formation Science Education

General Terms

Experimentation

Keywords

Programming Mistakes; Blackbox

1. INTRODUCTION

Knowledge about students’ mistakes and the time taken

to ﬁx errors is useful for many reasons. For example, Sadler

et al [10] suggest that understanding student misconceptions

is important to educator eﬃcacy. Knowing which mistakes

novices are likely to make or ﬁnding challenging informs the

writing of instructional materials, such as textbooks, and

can help improve the design and impact of beginner’s IDEs

or other educatoinal programming tools.

Previous studies that have investigated student errors dur-

ing [Java] programming have focused on cohorts of up to 600

students at a single institution [1, 4, 5, 7, 8, 13]. However,

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from permissions@acm.org.

SIGCSE’15, March 4–7, 2015, Kansas City, MO, USA.

 2015 ACM 978-1-4503-2966-8/15/03 ...$15.00.

http://dx.doi.org/10.1145/2676723.2677258.

the recently launched Blackbox data collection project [3]

aﬀords an opportunity to observe the mistakes of a large

number of students across many institutions – for example,

in one year of data, the project collected error messages and

Java code from around 265,000 users worldwide. A previ-

ous study by the authors utilized four months of data from

Blackbox to study educators opinions against the frequency

of mistakes [2]. The contribution in our proposed paper is

to go further, and provide a more detailed investigation into

characteristics of the mistakes, trying to answer the follow-

ing research questions:

• What are the most frequent mistakes in a large-scale

multi-institution data set?

• What are the most common errors, and common classes

of errors?

• Which errors take the shortest or longest time to ﬁx?

• How do these errors evolve during the academic terms

and academic year?

2. RELATED WORK

The concept of monitoring student programming behav-

ior and mistakes has a long history in computing education

research. The series of workshops on Empirical Studies of

Programming [11] in the 1980s had several papers making

use of this technique for Pascal and other languages. More

recently, there have been many such studies speciﬁcally fo-

cused on Java, which is also the topic of this study.

Many of these studies used compiler error messages to

classify mistakes. Jadud [8] looked in detail at student mis-

takes in Java and how students went about solving them.

Tabanao et al. [13] looked at the association between errors

and student course performance. Denny et al. [4] looked at

how long students take to solve diﬀerent errors. Dy and

Rodrigo [5] looked at improving the error messages given

to students. Ahmadzadeh et al. [1] looked at student error

frequencies and debugging behavior. Jackson et al. [7] iden-

tiﬁed the most frequent errors among their novice program-

ming students. All six of these studies looked at cohorts of

(up to 600) students from a single institution. These studies

used compiler error messages to classify errors, while early

results from McCall and K

olling [9] suggest that compiler

error messages have an imperfect (many-to-many) mapping

to student misconceptions.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38576045

粉丝: 6
资源: 881

大规模编程学习者错误分析：3700万次编译揭示新手常见问题

wildfly-cli-compilations:wildfly cli命令的存储库[独立模式]

论文PACS分类代码目录

annotationprocessor-in-subproject:Gradle子项目中的注释处理器示例

c-free gcc.exe: cannot specify -o with -c or -S and multiple compilations怎么解决

[error] g++.exe: cannot specify -o with -c or -s and multiple compilations

[Error] g++.exe: cannot specify -o with -c or -S and multiple compilations

c-free中出现的问题[error] g++.exe: cannot specify -o with -c or -s and multiple compilations，最简单的解决方法

latex there were undefined references

最新资源