Max features sets the maximum number of features to be used to characterize texts in the training/classification process. This number affects how many computation resources are needed to train the model. More features means more computation times. Also classification times are affected.

Intuitively, adding more features should improve results, but this is not always the case, adding more features could decrease accuracy and also increase computation times.

The default value is set in 10,000 features which is a reasonable value for text mining applications. More features may allow better representation of the model, but too many may have the risk of overfitting the data, and decrease the generalization performance.

Did this answer your question?