Results on R2001(Topic part) data (101 categories in training set,103 categories in test set,excluding "None" category)

3.0 kNN(standard)

Result tuned for microf1 and Result tuned for macrof1

Result tuned for microf0.5 and Result tuned for macrof0.5
The setting is: k=100, fs=8000, fbr=0.1(for macro avg. f0.5) and 0.5(for micro avg. f0.5)
 

3.2 Rocchio on 2001t
The result tuned for microf1 and the result tuned for macrof1   (I also tried rcut and the result is much worse)
The result tuned for both micro and macro f0.5

The graph tuning feature selection number (5000 for both micro and macro avg. performance)
The graph tuning fbr score(0.3 for micro avg. performance and 0.2 for macro avg. performance, generally, 0.2 is OK)
The graph tuning pmax (3000 for both micro and macro avg. performance)
The graph tuning beta(-1 for both micro and macro avg. performance)

3.3 NB(rainbow)
The result tuned for microf1 and the result tuned for macrof1
(for micro avg. result, all the features are used. fbr=0.3.   For macro avg. result, 3000 top features are used. fbr=0.)
 

3.4 SVM on 2001t

Result tuned for microf1
Result tuned for macrof1
Result tuned for macrof0.5
 
 
 

Conclusion:

bar graph