is one of most frequent used methods for text
categorization. The feature high-dimension and skew of sort
distribution will impact the performance of the classier. An
improved N based on skew sort condition is introduced in
this paper for solving the problem that the big swatch sort with
more texts is easy to be selected when conducting the
neighbor selection. Firstly, text feature selection is conducted
by an improved information gain method for more ecient
using the categorization distribution information in the sample
training set. Then an improved classier based on the
sort is used for categorization, which can solve the problem
that big swatch sort is selected in training set. The experiment
shows this method has improved the classication
performance.