Results – Evidence Library – Artificial Intelligence in Measurement and Education

Automatic item generation: foundations and machine learning-based approaches for assessments

Ruhan Circi, Juanita Hicks, Emmanuel Sik...

|

May 10th, 2023

|

journalArticle

Ruhan Circi, Juanita Hicks, Emmanuel Sik...

May 10th, 2023

This mini review summarizes the current state of knowledge about automatic item generation in the context of educational assessment and discusses key points in the item generation pipeline. Assessment is critical in all learning systems and digitalized assessments have shown significant growth over the last decade. This leads to an urgent need to generate more items in a fast and efficient manner. Continuous improvements in computational power and advancements in methodological approaches,...

Can ChatGPT and Bard generate aligned assessment items? A reliability analysis against human performance

Abdolvahab Khademi Khademi University o...

|

May 10th, 2023

|

journalArticle

Abdolvahab Khademi Khademi University o...

May 10th, 2023

ChatGPT and Bard are AI chatbots based on Large Language Models (LLM) that are slated to promise different applications in diverse areas. In education, these AI technologies have been tested for applications in assessment and teaching. In assessment, AI has long been used in automated essay scoring and automated item generation. One psychometric property that these tools must have to assist or replace humans in assessment is high reliability in terms of agreement between AI scores and human...

Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance

Abdolvahab Khademi

|

May 10th, 2023

|

journalArticle

Abdolvahab Khademi

May 10th, 2023

ChatGPT and Bard are AI chatbots based on Large Language Models (LLM) that are slated to promise different applications in diverse areas. In education, these AI technologies have been tested for applications in assessment and teaching. In assessment, AI has long been used in automated essay scoring and automated item generation. One psychometric property that these tools must have to assist or replace humans in assessment is high reliability in terms of agreement between AI scores and human...

The Writing Synth Hypothesis

sbs on March 24th, 2023

|

May 5th, 2023

|

blogPost

sbs on March 24th, 2023

May 5th, 2023

Assessment redesign for generative AI: A taxonomy of options and their viability

May 2nd, 2023

|

webpage

May 2nd, 2023

with Sarah Howard and Jaclyn Broadbent Since the seemingly sudden emergence of ChatGPT at the end of 2022, there has been significant debate surrounding the impact of text-based generative AI in education. Many jurisdictions initially attempted to ban access to these tools, citing concerns that stud

Study Finds ChatGPT Outperforms Physicians in High-Quality, Empathetic Answers to Patient Questions

May 1st, 2023

|

webpage

May 1st, 2023

A new study published in JAMA Internal Medicine compared written responses from physicians with those from ChatGPT to real-world health questions. A panel of licensed healthcare professionals preferred ChatGPT’s responses 79% of the time.

How AI Could Save (Not Destroy) Education | Sal Khan | TED

TED

|

May 1st, 2023

|

videoRecording

TED

May 1st, 2023

Sal Khan, the founder and CEO of Khan Academy, thinks artificial intelligence could spark the greatest positive transformation education has ever seen. He shares the opportunities he sees for students and educators to collaborate with AI tools -- including the potential of a personal AI tutor for every student and an AI teaching assistant for every teacher -- and demos some exciting new features for their educational chatbot, Khanmigo. If you love watching TED Talks like this one, become a...

Artificial Intelligence and the Future of Teaching and Learning

Department of Education

|

May 24th, 2023

|

report

Department of Education

May 24th, 2023

MathGPT is incoming; how it fits into undergraduate math education - The Arizona State Press

Apr 30th, 2023

|

webpage

Apr 30th, 2023

Arizona State University's independent student-run news organization covering Tempe, Phoenix, Mesa and Glendale.

Responsible Use of Generative AI | Deloitte US

Apr 28th, 2023

|

webpage

Apr 28th, 2023

Yann LeCun on LinkedIn: A survey of LLMs with a practical guide and evolutionary tree. Number of… | 21 comments

Apr 28th, 2023

|

webpage

Apr 28th, 2023

A survey of LLMs with a practical guide and evolutionary tree. Number of LLMs from Meta = 7 Number of open source LLMs from Meta = 7 The architecture… | 21 comments on LinkedIn

A New Approach To Mitigating AI’s Negative Impact

Apr 26th, 2023

|

webpage

Apr 26th, 2023

Stanford launches an Ethics and Society Review Board that asks researchers to take an early look at the impact of their work.

The Stanford MOOCPosts Data Set

Akshay Agrawal, Andreas Paepcke

|

Apr 26th, 2023

|

conferencePaper

Akshay Agrawal, Andreas Paepcke

Apr 26th, 2023

Office Overachievers Won't Be Happy About ChatGPT, Study Finds

Apr 24th, 2023

|

webpage

Apr 24th, 2023

Customer support agents given access to a generative AI chatbot were 14% more productive, but those gains were much higher for lower-performing workers.

Artificial intelligence in higher education: the state of the field

Helen Crompton, Diane Burke

|

Apr 24th, 2023

|

journalArticle

Helen Crompton, Diane Burke

Apr 24th, 2023

This systematic review provides unique findings with an up-to-date examination of artificial intelligence (AI) in higher education (HE) from 2016 to 2022. Using PRISMA principles and protocol, 138 articles were identified for a full examination. Using a priori, and grounded coding, the data from the 138 articles were extracted, analyzed, and coded. The findings of this study show that in 2021 and 2022, publications rose nearly two to three times the number of previous years. With this rapid...

Meet RedPajama: An AI Project to Create Fully Open-Source Large Language Models Beginning with the Release of a 1.2 Trillion Token Dataset

Niharika Singh

|

Apr 21st, 2023

|

blogPost

Niharika Singh

Apr 21st, 2023

The most advanced foundation models for AI are only partially open-source and are only available through commercial APIs. This restricts their use and limits research and customization. However, a project called RedPajama now aims to create leading, fully open-source models. The first step of this project, reproducing the LLaMA training dataset, has been completed. Open-source models have made significant progress recently, and AI is experiencing a moment similar to the Linux movement....

5 Things Every School District and Educator Should Know About ChatGPT - TalkingPoints

Heather Dooley

|

Apr 20th, 2023

|

webpage

Heather Dooley

Apr 20th, 2023

In schools and conferences across the country, everyone seems to be talking about AI (Artificial Intelligence) and ChatGPT. Teachers are using ChatGPT to create lesson plans. Students are using ChatGPT to do their homework. And bloggers are using ChatGPT to draft content (although we promise not this one!). We understand the opportunities these resources can

Automated Scoring of Speaking and Writing: Starting to Hit its Stride

Daniel Marc Jones, Liying Cheng, Gregory...

|

Apr 20th, 2023

|

journalArticle

Daniel Marc Jones, Liying Cheng, Gregory...

Apr 20th, 2023

This article reviews recent literature (2011–present) on the automated scoring (AS) of writing and speaking. Its purpose is to first survey the current research on automated scoring of language, then highlight how automated scoring impacts the present and future of assessment, teaching, and learning. The article begins by outlining the general background of AS issues in language assessment and testing. It then positions AS research with respect to technological advancements. Section two...

Can Large Language Models Provide Feedback to Students? A Case Study on ChatGPT

Wei Dai, Jionghao Lin, Flora Jin

|

Apr 13th, 2023

|

preprint

Wei Dai, Jionghao Lin, Flora Jin

Apr 13th, 2023

Educational feedback has been widely acknowledged as an effective approach to improving student learning. However, scaling effective practices can be laborious and costly, which motivated researchers to work on automated feedback systems (AFS). Inspired by the recent advancements in the pre-trained language models (e.g., ChatGPT), we posit that such models might advance the existing knowledge of textual feedback generation in AFS because of their capability to offer natural-sounding and...

MxML (Exploring the paradigmatic relationship between measurement and machine learning in the history, current time, and future): Current state-of-the-field

Yi Zheng, Steven Nydick, Sijia Huang

|

Apr 12th, 2023

|

conferencePaper

Yi Zheng, Steven Nydick, Sijia Huang

Apr 12th, 2023

The recent surge of machine learning (ML) has impacted many disciplines, including educational and psychological measurement (hereafter shortened as measurement, “M”). The measurement literature has seen a rapid growth in studies that explore using ML methods to solve measurement problems. However, there exist gaps between the typical paradigm of ML and fundamental principles of measurement. The MxML project was created to explore how the measurement community might potentially redefine the...

Search

Publication year