Introduction

The Chinese Valence-Arousal Words (CVAW) is an affective lexicon containing 2,802 single words. Each word is manually annotated with a real-valued score for both valence and arousal dimensions. The valence represents the degree of positive and negative sentiment, and arousal represents the degree of calm and excitement. Both dimensions range from 1 (highly negative or calm) to 9 (highly positive or excited). The scatter plot of the CVAW is shown below.




Format

No. Word Valence_Mean Valence_SD Arousal_Mean Arousal_SD Frequency
286 乏味 3.4 0.800 3.0 1.414 25
559 放鬆 6.2 0.748 2.0 0.894 78
983 勝利 7.8 0.748 7.2 1.166 174
1099 痛苦 2.4 0.490 6.8 0.748 369

  • Valence/Arousal_Mean/SD: Mean and standard deviation of the valence/arousal ratings.
  • Frequency: Word frequency retrieved from the Sinica Corpus 4.0.

  • Download

    CVAW 1.0 (1,653 words; released on May 2, 2016): An official training set used in the IALP 2016 Shared Task on Dimensional Sentiment Analysis for Chinese Words.

    CVAW 2.0 (2,802 words; released on February 22, 2017): CVAW 1.0 plus the official test set used in the IALP 2016 Shared Task on Dimensional Sentiment Analysis for Chinese Words.


    Reference

    Liang-Chih Yu, Lung-Hao Lee, Shuai Hao, Jin Wang, Yunchao He, Jun Hu, K. Robert Lai, and Xuejie Zhang. 2016. Building Chinese affective resources in valence-arousal dimensions. In Proceedings of NAACL/HLT-16, pages 540-545.


    Contact

    Liang-Chih Yu

    Associate Professor

    Department of Information Management, Yuan Ze University

    lcyu@saturn.yzu.edu.tw