Paper ID: 2302.14708

Is Japanese CCGBank empirically correct? A case study of passive and causative constructions

Daisuke Bekki, Hitomi Yanaka

The Japanese CCGBank serves as training and evaluation data for developing Japanese CCG parsers. However, since it is automatically generated from the Kyoto Corpus, a dependency treebank, its linguistic validity still needs to be sufficiently verified. In this paper, we focus on the analysis of passive/causative constructions in the Japanese CCGBank and show that, together with the compositional semantics of ccg2lambda, a semantic parsing system, it yields empirically wrong predictions for the nested construction of passives and causatives.

Submitted: Feb 28, 2023

Topics

Case Study
Causal Structure
Categorial Grammar
Compositional Semantics
Japanese Corpus
Japanese Business Domain

Links

arXiv PDF