A New Method of Text Categorization on Imbalanced Datasets

机译：一个新的简单数据集文本分类方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper aims at improving the categorization performance of the small number of samples in the imbalance datasets, and dealing with data re-sampling from the perspective of data. The main idea is to make the number of various types of texts by increasing some texts. The experiment indicates that the system has improved the accuracy of text-categorization effectively.

机译：本文旨在提高不平衡数据集中少量样本的分类性能，并从数据的角度处理数据重新采样。主要思想是通过增加一些文本来制作各种文本的数量。实验表明，该系统有效地提高了文本分类的准确性。

著录项

来源
《International Workshop on Education Technology and Trainin》|2008年||共4页
会议地点
作者
LI Xin-fu; YU Yan; YIN Peng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 P3-53;
关键词
Text categorization; Imbalanced dataset; SVM;

机译：文本分类;不平衡数据集;SVM;

相似文献

外文文献
中文文献
专利

1. Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization [J] . JiemingYang, ZhaoyangQu, ZhiyingLiu ScientificWorldJournal . 2014,第3期

机译：提高了考虑文本分类中的不平衡问题的特征选择方法
2. Evaluation of feature selection methods for text classification with small datasets using multiple criteria decision-making methods [J] . Kou Gang, Yang Pei, Peng Yi, Applied Soft Computing . 2020,第期

机译：使用多种标准决策方法对小型数据集的文本分类特征选择方法的评估
3. Term evaluation metrics in imbalanced text categorization [J] . Naderalvojoud Behzad, Sezer Ebru Akcapinar Natural language engineering . 2020,第1期

机译：不平衡文本分类中的术语评估指标
4. A New Method of Text Categorization on Imbalanced Datasets [C] . Li Xin-fu, Yu Yan, Yin Peng Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008 . 2009

机译：不平衡数据集文本分类的新方法
5. Active learning with support vector machines for imbalanced datasets and a method for stopping active learning based on stabilizing predictions. [D] . Bloodgood, Michael. 2009

机译：支持向量机用于不平衡数据集的主动学习，以及一种基于稳定预测的主动学习停止方法。
6. Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization [O] . Jieming Yang, Zhaoyang Qu, Zhiying Liu -1

机译：文本分类中考虑不平衡问题的改进特征选择方法
7. Handling imbalanced dataset in multi-label text categorization using Bagging and Adaptive Boosting [O] . Genta Indra Winata, Masayu Leylia Khodra 2015

机译：使用袋装和自适应升压处理多标签文本分类中的不平衡数据集

A New Method of Text Categorization on Imbalanced Datasets

摘要

著录项

相似文献

相关主题

期刊订阅