摘要:In this paper, we propose a cost-sensitive twin SVM (cs-tsvm) and apply it to imbalanced data. A weight is added to each instance according to its cost of misclassification which is related to its position. In preprocessing part, features are selected by their difference of majority and minority classes. The feature is selected when its difference value is higher than average one. The experiment is conducted on UCI datasets and G-mean, AUC and accuracy are evaluation metrics. The experimental results show that Feature selection with CS-TWSVM is useful for datasets with high dimension.
关键词:Keywords:enTwin SVMcost-sensitivefeature selectionimbalanced data