Design
Starting point
The Spanish tasks in this corpus were developed to provide a research design for learner pronunciation that is systematic, comparable, and still manageable for learners. For the broader project frame and the history of the overall initiative, the general project page What this project is about provides the right context.
In addition, the project pages Project structure, Data & methods, and Team & contributors place the Spanish design within the shared platform logic, methodological infrastructure, and collaborative setup.
The starting point was the observation that existing models offer valuable groundwork, but can only partly be transferred directly to the targeted study of Spanish learner pronunciation. This concerns lexical difficulty, content-related distraction in reading passages, and the question of which pronunciation phenomena actually need to be elicited repeatedly and under controlled conditions in learner data. The guiding principles are therefore not proximity to native-speaker norms or mere tradition, but intelligibility, controlled elicitation, and materials that make sense for learners.
Previous work and empirical starting position
An important intermediate step was the earlier project MAR.ELE – Corpus sobre la pronunciación del español por aprendientes de ELE en Marburg. In that smaller pilot project, 22 recordings with students from Marburg University were produced. MAR.ELE made Spanish learner pronunciation accessible as corpus material and open to empirical analysis. At the same time, practical work with this corpus also revealed the limits of a design that had been adopted too directly. Those experiences were decisive for the more extensive revision in the present project and are closely tied to the broader project development outlined on What this project is about.
For MAR.ELE, the wordlist from the (I)FEC project was adopted in full in order to remain compatible with an established corpus-phonological design for Spanish. Methodologically, that made sense, but it also exposed clear problems in work with learners: some items turned out to be unnecessary lexical stumbling blocks, while other phenomena that are particularly revealing for learner pronunciation were not distributed optimally or were not represented strongly enough. The current design therefore responds not out of mere preference for a different model, but to concrete experience gained in practice.
Interview
The interview is, finally, a project-wide extension of the design. It adds a reflective component to the controlled reading tasks: learners are not only recorded, but also asked about their own pronunciation, perceived difficulties, and striking phenomena observed during the recording process. This means that the corpus includes not only external observation, but also the learners' own perspective. For the study of learner pronunciation, this is especially important because it documents not only realisations, but also metalinguistic judgments and subjective perceptions of difficulty.
At the same time, the interview component shows that the Spanish corpus is part of a broader working structure in which research, data collection, and material development are thought together from the outset. Information on the contributors to the overall project is gathered on Team & contributors.
References
- (I)FEC: (Inter-)Fonología del Español Contemporáneo.
- MAR.ELE: Corpus sobre la pronunciación del español por aprendientes de ELE en Marburg.