A multi-domain sentiment classification model based on sample filtering and transfer learning

doi:10.3969/j.issn.0253-2778.2019.01.002

Abstract

Abstract: Most of the models for sentiment classification are trained and tested on a single dataset. However, the model parameters obtained by training on one dataset are not suitable for another dataset and the model is not generic. A multi-domain sentiment classification model (MDSC) was proposed. With sample filtering and transfer learning, the trained model can be applied to different datasets in multiple domains and the model is more applicable and expandable. Specifically, a document is first mapped to the domain distribution which is used as a bridge between domain classification and sentiment classification, and then sentiment classification is completed. In order to make the model more generic, representative data samples should be selected. MDSC constructs a domain-independent sentiment lexicon to filter sentences that belong to the same document and obtain a high-quality training dataset. At the same time, to improve the classification accuracy and reduce the training time, parameter-based transfer learning with neutral networks is used to obtain the document embeddings for classification. Extensive experiments on datasets containing 15 different domains show that the proposed model can achieve better performance compared with traditional models when applied to datasets in multiple domains.

Key words: sentiment classification, sample filtering, transfer learning, sentiment lexicon, neural network

QU Zhaowei,ZHAO Yanjiao,WANG Xiaoru. A multi-domain sentiment classification model based on sample filtering and transfer learning[J]. Journal of University of Science and Technology of China, 2019, 49(1): 8-14.

[1]	Liu Sen, Zhang Zhizheng, Yu Tao, Chen Zhibo. MOVIE: Mesh oriented video inpainting network [J]. Journal of University of Science and Technology of China, 2021, 51(1): 1-11.
[2]	XIN Shouyu, ZHENG Ruirui, ZHOU Yu, LIU Wenpeng, HE Jianjun. A one-shot learning algorithm using support set information during training [J]. Journal of University of Science and Technology of China, 2020, 50(8): 1187-1192.
[3]	WANG Yue, LI Jing. Research on optimization method of convolutional neural network based on visualization [J]. Journal of University of Science and Technology of China, 2020, 50(7): 959-967.
[4]	DU Shuying, DU Peng, DING Shifei. A malicious domain name detection method based on CNN [J]. Journal of University of Science and Technology of China, 2020, 50(7): 1019-1025.
[5]	YANG Jie, WANG Xiangning. Exchange rate prediction method based on ARIMA-HPSO-Elman combined model with SSA： Based on the central parity rate data of USD/CNY [J]. Journal of University of Science and Technology of China, 2020, 50(4): 516-527.
[6]	XIONG Junlin, ZHAO Duo. Two-stage grasping detection for robots based on RGB images [J]. Journal of University of Science and Technology of China, 2020, 50(1): 1-10.
[7]	YAN Huifeng, HUANG Dingjiang, XIE Yao, CHENG Xiao, XIE Jiyang, ZHU Xiaomeng, MA Zhanyu. Comparative study of short-term electrical load forecast models [J]. Journal of University of Science and Technology of China, 2019, 49(2): 119-124.
[8]	ZENG Weihui, LI Miao, ZHANG Jian, HUANG Xiaoping, WANG Jingxian, YUAN Yuan. Research on high-order residual convolution neural network for crop disease recognition application [J]. Journal of University of Science and Technology of China, 2019, 49(10): 781-790.
[9]	SUI Hongjian, SHANG Weiwei, LI Xiang, CONG Shuang. Robot control policy transfer based on progressive neural network [J]. Journal of University of Science and Technology of China, 2019, 49(10): 812-819.
[10]	YANG Ziwen, CHEN Lei, PU Jianyu. Recognizing emotions from abstract paintings using convolutional neural network with two-layer transfer learning scheme [J]. Journal of University of Science and Technology of China, 2019, 49(1): 40-48.
[11]	LONG Aoming, BI Xiuchun, ZHANG Shuguang. An arbitrage strategy model for ferrous metal futures based on LSTM neural network [J]. Journal of University of Science and Technology of China, 2018, 48(2): 125-132.
[12]	SUN Dachang, BI Xiuchung. High-frequency trading strategies based on deep learning algorithms and their profitability [J]. Journal of University of Science and Technology of China, 2018, 48(11): 923-932.
[13]	CHEN Dongjie, ZHANG Wensheng, YANG Yang. Detection and recognition of high-speed railway catenary locator based on Deep Learning [J]. Journal of University of Science and Technology of China, 2017, 47(4): 320-327.
[14]	CHANG Xinzhuo, YANG Kaizhong, LI Xin, SHEN Hongxin, LI Hengnian. Localized atmospheric density prediction method based on#br#NARX neural network [J]. Journal of University of Science and Technology of China, 2017, 47(12): 1015-1022.
[15]	ZHANG Hao, WU Jianxin,. Ensemble max-pooling: Is only the maximum activation useful when pooling [J]. Journal of University of Science and Technology of China, 2017, 47(10): 799-807.

A multi-domain sentiment classification model based on sample filtering and transfer learning

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments