Institute of Global Studies
Tokyo University of Foreign Studies
Investigation of Words in a Japanese Closed Caption TV Corpus
For Japanese learners, we describe the specific details of TV program vocabulary and investigate what kinds of words are necessary for understanding the contents of TV scripts. We use our closed caption TV corpus over 1 billion words in size for the investigation of vocabulary. In this paper we will show different word statistics from various viewpoints such as the difference in years and the difference in parts-of-speech.