The Corpus of Utah Dual Language Immersion (CUDLI), developed by Fernando Rubio and Erin Schnur at the University of Utah’s Second Language Teaching and Research Center, is a growing multilingual corpus focused on second language writing.
Texts in CUDLI are drawn from 3rd to 12th grade student responses to the presentational writing section of the ACTFL Assessment of Performance toward Proficiency in Languages (AAPPL), administered in Utah's Dual Language Immersion (DLI) program. CUDLI encompasses nearly 150,000 texts, amounting to approximately 15 million words across five languages: Chinese, French, German, Portuguese, and Spanish.
CUDLI offers rich metadata, allowing for searches across a wide array of variables such as language, gender, grade level, immersion type, years of study, test form, and topics, as well as students' proficiency ratings in writing, reading, and listening sections of the AAPPL test.
The corpus offers extensive searchability across various factors including language, gender, grade level, immersion type, years of study, test form, topic, and proficiency ratings in writing, reading, and listening sections of the AAPPL test.
CUDLI serves as a valuable resource for teachers, researchers, and materials developers, offering insights for a wide range of pedagogical purposes. These include identifying learner challenges, comparing performance across proficiency levels, pinpointing common errors, analyzing vocabulary use, and crafting data-driven educational materials.
How to Cite CUDLI
Rubio, F. & Schnur, E. (2019). The corpus of Utah dual language immersion (CUDLI) [Learner corpus]. University of Utah. Retrieved Month Day, Year, from https://l2trec.utah.edu/learner-corpora/cudli/
Choose Access Level
Allows users to open individual corpus files (TXT) in a web browser. Users won't be able to download the corpus for linguistic analyses. Temporary Online Access will automatically terminate three months after the registration date.
- Project Description for Temporary Access
- Provide a brief description of your project, specifically focusing on how your project intends to explore second language development using CUDLI. (100-300 words)
Allows users to download the whole corpus for offline use. Users are required to show proof of IRB approval or exemption to receive full offline access. Online Access to the corpus will expire on the expected project completion date unless you request an extension.
- Project Description for Full Access
- Provide a concise yet detailed description of your research purpose, specifically focusing on how your project intends to explore second language development using CUDLI. Emphasize how the anticipated outcomes will contribute to second language learning and/or teaching. (200-500 words)
- Expected Project Completion Date
- Please be aware that your online access to the corpus files is set to expire on the date you have chosen. Should you require continued access beyond this date, you must submit a request for an extension.
- Proof of IRB Approval or Exemption for Full Access
Fill Out Database
NOTICE: You must prepare the required materials for the desired access level listed in Step 1 before filling out the form.
Sign a Material Transfer Agreement
Sign a Material Transfer Agreement (Only for Non-UofU Affiliates)
Upon submission of the access request form, you will receive an email that includes the Material Transfer Agreement (MTA) from the Technology Transfer Office of the University of Utah. This email will also provide detailed instructions for completing and returning the MTA. The MTA outlines the terms for using the MuSSeL and CUDLI language data. Please be aware that the review and approval process for the MTA may take up to two weeks.