我需要为文本构建一个分类器,现在我正在使用TfidfVectorizer和SelectKBest来选择特征,如下所示:
vectorizer = TfidfVectorizer(sublinear_tf = True, max_df = 0.5, stop_words = 'english',charset_error='strict')
X_train_features = vectorizer.fit_transform(data_train.data)
y_train_labels = data_train.target;
ch2 = SelectKBest(chi2, k = 1000)
X_train_features = ch2.fit_transform(X_train_features, y_train_labels)
我希望在选择了K个最佳特征之后打印出所选特征的名称(文本),有办法可以做到吗?我只需要打印出所选的特征名称,也许我应该使用CountVectorizer?