Construction of a Learner Corpus for Japanese Language Learners: Natane and Nutmeg
AbstractJapanese language learners aim to acquire reading, listening, writing and speaking skills. We at the Hinoki project (https://hinoki-project.org/) have recently been working on the Natsume collocation search system (https://hinoki-project.org/natsume/), the Natane learner corpus to support Natsume (https://hinoki-project.org/natane/) and the Nutmeg writing support system (http://hinoki-project.org/nutmeg/). In order to test the effectiveness of Nutmeg, we conducted an online experiment with 36 participants who used the system's register misuse identification feature to correct four writing assignments. Results show that Nutmeg can be an effective tool in correcting common register-related errors, especially those involving auxiliary verbs. However, the accuracy of verb and adverb identification was too low, suggesting the need for improvements in the variety of corpora used for identifying register misuse.
Cao, H., Yagi, Y., Kuroda, F., & Nishina, K. (2012, August). Construction of learner corpus Natane and possible application. (pp. 1–4). 5th international conference on Computer Assisted Systems for Teaching & Learning Japanese (CASTEL/J). Nagoya. Retrieved from http://2012castelj.kshinagawa.com/proceedings/Poster/Poster5_Cao.pdf
Halliday, M. K & Hasan, R. (1976). Cohesion in English. London: Longman.
Hodošček, B., Abekawa, T., Bekeš, A., & Nishina, K. (2011). Assisting co-occurrence production in report writing: Evaluation of writing assistance tool Natsume. Journal of Technical Japanese Education, 13, 33–40. doi:10.11448/jtje.13.33
Hodošček, B. & Nishina, K. (2011, August). On the treatment of register in writing assistance systems. (Vol. 2, pp. 522–523). International Conference on Japanese Language Education 2011. Tianjin, China.
Hodošček, B. & Nishina, K. (2012). Japanese learning support systems: Hinoki project report. Acta Linguistica Asiatica, 2(3) Lexicography of Japanese as a Second/Foreign Language (Part 2), 95–124. DOI: 10.4312/ala.2.3.95-124. Retrieved from http://revije.ff.uni-lj.si/ala/article/view/221
Ichikawa, Y. (1997). A Dictionary of Japanese Language Learners’ Errors. Bonjinsha.
Imai, S. & Kuroda, F. (2012). A method of associating j-cat with other tests. Departmental Bulletin of the Tsukuba University International Student Center on Japanese Language Education, (27), 57–66.
Ishioka, T. (2008, January). Latest trends in automated essay scoring and evaluation. Journal of Japanese Society for Artificial Intelligence (Special Issue on the Automatic Evaluation of Text), 23(1), 17–24. Retrieved from http://ci.nii.ac.jp/naid/110006570441/en/
Nishina, K., Kamada, M., Cao, H., Utashiro, T., & Muraoka, T. (Eds.). (2012). Nihongo gakushūsien no kōchiku: Gengo kyōiku kōpasu shisutemu kaihatu [Constructing Japanese language learning: Language education, corpus and system development]. Tokyo, Japan: Bonjinsha
Teramura, H. (1990). Gaikokujin gakushūsha no nihongo goyōreishū [Collection of errors from learners of Japanese as a foreign language]. Osaka University. Retrieved from http://www.ninjal.ac.jp/teramuragoyoureishu/pdf
Yagi, Y., Hodošček, B., & Nishina, K. (2012, March). BCCWJ to gakushūsha sakubun kōpasu o riyōshita nihongo sakubun shien [Japanese writing assistance using the BCCWJ and a learner corpus]. (pp. 315-320). In Dai ikkai kōpasu nihongogaku wākushoppu yokōshū [Proceeding of the ﬁrst workshop on Japanese corpus linguistics]. Dai ikkai nihongo kōpasu wākushoppu [First workshop on Japanese corpus linguistics]. Tokyo.
Yagi, Y., Hodošček, B., Abekawa, T., & Nishina, K. (2014, March). Evaluation of Error Detection in Japanese Composition Support System “Nutmeg”. In Dai gokai kōpasu nihongo wākushoppu yokōshū [Proceedings of the 5th Workshop on Japanese Corpus Linguistics] (pp. 167–170). Dai gokai nihongo kōpasu wākushoppu [5th Workshop on Japanese Corpus Linguistics]. NINJAL.
Copyright (c) 2014 Kikuko NISHINA, Bor HODOŠČEK, Yutaka YAGI, Takeshi ABEKAWA
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors are confirming that they are the authors of the submitting article, which will be published online in journal Acta Linguistica Asiatica by Ljubljana University Press, Faculty of Arts (University of Ljubljana, Faculty of Arts, Aškerčeva 2, 1000 Ljubljana, Slovenia). Author’s name will be evident in the article in journal. All decisions regarding layout and distribution of the work are in hands of the publisher.
- Authors guarantee that the work is their own original creation and does not infringe any statutory or common-law copyright or any proprietary right of any third party. In case of claims by third parties, authors commit their self to defend the interests of the publisher, and shall cover any potential costs.
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike 4.0 International License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.