research

In reversed chronological order.

DICE: A Framework for Dimensional and Contextual Evaluation of Language Models
Aryan Shrivastava and Paula Akemi Aoyagui
Under Review, 2024

Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental Healthcare
Max Lamparth, Declan Grabb, Amy Franks, Scott Gershan, Kaitlyn N. Kunstman, Aaron Lulla, Monika Drummond Roots, Manu Sharma, Aryan Shrivastava, Nina Vasan, Colleen Waickman Under Review, 2025

Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis Simulations
Aryan Shrivastava, Jessica Hullman, Max Lamparth
NeurIPS 2024 Socially Responsible Language Modelling Research Workshop; MILA 2024 Harms and Risks of AI in Military Workshop, 2024