Notes for using Chinese LIWC
1. Macintosh system only, and text files saved in UTF-8 format
Currently, Chinese LIWC can only be used in the Macintosh system, and the text files must be saved in UTF8 format.
2. The Chinese words have to be segmented by space first
In English texts, words are separated by space but there is no space in Chinese sentences. Thus, For Chinese texts, in order to be accurately analyzed by the LIWC, some preliminary works have to be done. The words in the Chinese text files have to be segmented by space first. This certainly could be done manually. We, however, suggestusing some computer software to achieve this task. One reliable service, among others, is the Chinese Word Segmentation System (CWSS, http://ckipsvr.iis.sinica.edu.tw/). The CWSS is developed by the AcademicSinica in Taiwan. However, please be careful that the CWSS adds some extra information or markers for each word in parentheses in the output and these added information has to be removed.
3. Transform full form punctuation into half form
If the use of punctuation is of interest to your research, it is also important to note that all full form punctuation marks have to be transformed into half form in order to be correctly recognized by the LIWC2007. We have also provided a table that shows the corresponding punctuations in Chinese and English texts.
4. Interface program for integrate needed processings for CKIPS output
We have now developed an interface program to integrate needed processings for CKIPS output. The program is tested and proved to be a reliable one. We are ready to release it for download, please contact us if you need this program.
Currently, Chinese LIWC can only be used in the Macintosh system, and the text files must be saved in UTF8 format.
2. The Chinese words have to be segmented by space first
In English texts, words are separated by space but there is no space in Chinese sentences. Thus, For Chinese texts, in order to be accurately analyzed by the LIWC, some preliminary works have to be done. The words in the Chinese text files have to be segmented by space first. This certainly could be done manually. We, however, suggestusing some computer software to achieve this task. One reliable service, among others, is the Chinese Word Segmentation System (CWSS, http://ckipsvr.iis.sinica.edu.tw/). The CWSS is developed by the AcademicSinica in Taiwan. However, please be careful that the CWSS adds some extra information or markers for each word in parentheses in the output and these added information has to be removed.
3. Transform full form punctuation into half form
If the use of punctuation is of interest to your research, it is also important to note that all full form punctuation marks have to be transformed into half form in order to be correctly recognized by the LIWC2007. We have also provided a table that shows the corresponding punctuations in Chinese and English texts.
4. Interface program for integrate needed processings for CKIPS output
We have now developed an interface program to integrate needed processings for CKIPS output. The program is tested and proved to be a reliable one. We are ready to release it for download, please contact us if you need this program.