Song, Sanghoun & Song, Ji Young. 2014. A Data Compilation of Mulitple Case-marking Constructions: Using the Sejong Spoken Corpus. Language Information. Volume 19. 57-90. The present study builds up a language dataset involving multiple case-marking constructions in Korean. Exploiting the Sejong Spoken Corpus, we extracted 1,021 sentences in which the nominative marker ‘-i/ka’ or the accusative marker ‘-ul/lul’ occur twice or more. These sentences were annotated with respect to 47 linguistic parameters, which the previous studies assume to interact with multiple case-marking constructions. These parameters are divided into five subgroups: namely, (i) distribution, (ii) semantic relation, (iii) nominal category, (iv) predication, and (iv) discourse. The constructed data are numerically analyzed, and the content characteristics are also examined. The numerical analysis looks into proportion of each parameter and correlation between two parameters. The content analysis focuses on how multiple case-marking constructions are realized in naturally occurring conversations. The whole dataset constructed in this study will be readily distributed in order for other linguists to use it for their own research purposes.

 

Key words: multiple case-marking, nominative marker, accusative marker, the Sejong Spoken Corpus, online workbench, corpus annotation