The Chinese Valence-Arousal Text (CVAT) is an affective corpus containing 2,009 sentences extracted from the web with six different categories: news articles, political discussion forums, car discussion forums, hotel reviews, book reviews, and laptop reviews. Each sentence is manually annotated with a real-valued score for both valence and arousal dimensions. The valence represents the degree of positive and negative sentiment, and arousal represents the degree of calm and excitement. Both dimensions range from 1 (highly negative or calm) to 9 (highly positive or excited). The scatter plot of the CVAT is shown below.
Click here to download CVAT 1.0 (2,009 sentences; released on August 1, 2016).
Liang-Chih Yu, Lung-Hao Lee, Shuai Hao, Jin Wang, Yunchao He, Jun Hu, K. Robert Lai, and Xuejie Zhang. 2016. Building Chinese affective resources in valence-arousal dimensions. In Proceedings of NAACL/HLT-16, pages 540-545.
Department of Information Management, Yuan Ze Universitylcyu@saturn.yzu.edu.tw