Compilation of a learners’ Corpus of Chinese Students learning Spanish

Authors

  • Rodrigo Muñoz Cabrera Universidad Internacional de La Rioja
  • Lan Zhang Universidad Complutense de Madrid

DOI:

https://doi.org/10.32870/mycp.v13i39.898

Keywords:

corpus linguistics, error analysis, interlanguage, linguistic transfer, Spanish as a second language

Abstract

The analysis of the production of learners of a foreign language is crucial for defining the parameters by which it is acquired. Therefore, the development and use of learner corpora are indispensable. The Covid-19 pandemic brought about changes in teaching methodology. During this period, due to the impossibility of conducting in-person classes and the necessity for students to submit their work electronically for correction, it became possible to compile a corpus using the essays of Chinese learners studying Spanish at Nankai University (China). This article aims to detail the steps for creating a textual corpus, its compilation, and the tools to be used for its analysis, specifically the program AntConc. After the creation and analysis of the corpus, it is concluded that, by using the corpus, one can identify the deficiencies and difficulties faced by students, which allows for the formation of linguistic learning models and the improvement of existing ones. Moreover, its incorporation in the classroom enables the individualization of students’ linguistic learning and the generation of theories about language use.  

Downloads

Download data is not yet available.

References

Anthony, L. (2004). AntConc: A learner and classroom friendly, multi-platform corpus analysis toolkit. IWLeL 2004: An interactive Workshop on Language e-Learning, 7-3. https://waseda.repo.nii.ac.jp/records/28823

Armas, A. R. (2018). Mejora de la competencia ortográfica en el plano escrito a través de un taller de escritura dirigido a universitarios sinohablantes. Monográficos Sinoele, (17), 157-70.

Baker, P., Hardie, A., & McEnery, T. (2006). A Glossary of Corpus Linguistics. Edinburgh University Press. https://doi.org/10.1515/9780748626908

Bennett, G. (2010). Using Corpora in the Language Learning Classroom: Corpus Linguistics for Teachers. University of Michigan Press. https://doi.org/10.3998/mpub.371534

Bowker, L., & Pearson, J. (2002). Working with Specialized Language. A Practical Guide to Using Corpora. Routledge. https://doi.org/10.4324/9780203469255

Cabré, M. T., Estopà, R., & Vivaldi, J. (2001). Automatic term detection. A review of current systems. En D. Bourigault, C. Jacquemin & M.C. L´Homme (Eds.), Recent Advances in Computational Technology (pp. 53-87). John Benjamins.

Castillejos, W. (2009). Error analysis in a learner corpus: What are the learners’ strategies. En P. Cantos & A. Sánchez (Eds.), A Survey of Corpus-Based Research (pp. 675-690). Murcia: Asociación Española de Lingüística del Corpus.

Cerezo, E. (2014). Análisis curricular de la formación lingüística y sociocultural en las titulaciones de Traducción e Interpretación en España. Prosopopeya: revista de crítica contemporánea, (9), 283-315.

Cerezo, E. (2019). Lenguas extranjeras con fines traductológicos: en busca de una identidad propia. Quaderns. Revista de Traducció, (26), 239-254.

Ellis, R. (2003). The Study of Second Language Acquisition. Oxford University

Els, T. V. (1984). Applied Linguistics and the Learning and Teaching of Foreign Languages (R. R. van Oirsouw, Trad.). Hodder Arnold.

Firth, J. R. (1957). Papers in Linguistics 1934-1951. Oxford University Press.

Gamon, M., Chodorow, M., Leacock, C., & Tetreault, J. (2013). Using learner corpora for automatic error detection and correction. En A. Díaz-Negrillo, N. Ballier & P. Thompson (Eds.), Automatic Treatment and Analysis of Lear-ner Corpus Data (pp. 127-150). John Benjamins. https://doi.org/10.1075/scl.59.09gam

Hurtado Albir, A. (1999). Objetivos de aprendizaje y metodología en la forma ción de traductores e intérpretes. En A. Hurtado (Coord.), Enseñar a traducir: metodología en la formación de traductores e intérpretes (pp. 8-58). Edelsa.

James, C. (1992). Awareness, consciousness and language contrast. In C. Mair & M. Markus (Eds.), New departures in Contrastive Linguistics (pp. 183–198). University of Innsbruck Press.

James, C. (1998). Errors in Language Learning and Use: Exploring Error Analysis. Routledge. https://doi.org/10.4324/9781315842912

Leech, G. (1992). Corpora and Theories of Linguistic Performance. En J. Svartvik (Ed.), Directions in Corpus Linguistics: Proceedings of the Nobel symposium 82 (pp. 105-122). Mouton de Gruyter.

Loffler, S. (2011). Las interferencias lingüísticas y la enseñanza del español como lengua extranjera. ARJÉ Revista de Posgrado face-uv, 5(8), 11-25. http://www.arje.bc.uc.edu.ve/arj08/art01.pdf

McEnery, T., & Hardie, A. (2012). Corpus Linguistics: Method, Theory and Practice. Cambridge University Press.

McEnery, T., & Wilson, A. (2001). Corpus Linguistics: An Introduction. Edinburgh University Press.

McEnery, T., Xiao, R., & Tono, Y. (2006). Corpus-Based Language Studies: An Advanced Resource Book. Taylor & Francis.

Millán, R. (2006). Interferencias lingüísticas en el aprendizaje de una segunda lengua. En A. Álvarez, L. Barrientos, M. Braña, V. Coto, M. Cuevas, C. de la Hoz, I. Iglesias, P. Martínez, M. Prieto & A. Turza (Eds.), La competen-cia pragmática y la enseñanza del español como lengua extranjera: Actas XVI Internacional asele (pp. 481-485). Ediciones de la Universidad de Oviedo.

Nesselhauf, N., & Tschichold, C. (2002). Collocations in CALL: An investigation of vocabulary-building software for EFL. Computer Assisted Language Learning, 15(3), 251-279. https://doi.org/10.1076/call.15.3.251.8190

Real Academia Española. (2018). Corpus de Referencia del Español Actual. RAE. https://www.rae.es/banco-de-datos/crea

Selinker, L. (1974). Interlanguage. En J. C. Richards (Ed.), Error Analysis: Perspectives on Second Language Acquisition (pp. 31-54). Longman.

Selinker, L., & Rutherford, W. E. (2013). Rediscovering Interlanguage. Routledge.

Sinclair, J. (2005). Corpus and Text - Basic Principles. En M.Wynne (Ed.), Developing Linguistic Corpora: A Guide to Good Practice, (pp. 1-16). AHDS.

Swanson, B., & Charniak, E. (2014). Data Driven Language Transfer Hypotheses. En Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, (Vol. 2, pp. 169-173). Association for Computational Linguistics. https://doi.org/10.3115/v1/E14-4033

Tetreault, J., Blanchard, D., & Cahill, A. (2013). A Report on the First Native Language Identification Shared Task. En J. Tetreault, J. Burstein & C. Leacock (Eds.), Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications (pp. 48-57). Association for Compu-tational Linguistics. https://aclanthology.org/W13-1706.pdf

Tognini-Bonelli, E. (2001). Corpus Linguistics at Work. John Benjamins. https://doi.org/10.1075/scl.6

Wang, M., Malmasi, S. & Huang, M. (2015). The Jinan Chinese Learner Corpus. En J. Tetreault, J. Burstein & C. Leacock (Eds.), Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applica-tions (pp. 118-123). Association for Computational Linguistics. https://aclanthology.org/W15-0614.pdf

Weisser, M. (2016). Practical Corpus Linguistics: An Introduction to Corpus-Based Language Analysis. Wiley.

Yip, V. (1995). Interlanguage and Learnability: From Chinese to English. John Benjamins. https://doi.org/10.1075/lald.11

Published

2024-08-27 — Updated on 2024-08-29

Versions

How to Cite

Muñoz Cabrera, R., & Zhang, L. . (2024). Compilation of a learners’ Corpus of Chinese Students learning Spanish. México Y La Cuenca Del Pacífico, 13(39), 81-110. https://doi.org/10.32870/mycp.v13i39.898 (Original work published August 27, 2024)