Under "Parameters" you can change the language of your custom module.

This setting should match the language in your samples, currently we support the following languages:

  • English
  • Dutch
  • French
  • German
  • Italian
  • Portuguese
  • Spanish
  • Russian
  • Chinese
  • Japanese
  • Korean
  • Arabic
  • Danish
  • Swedish
  • Romanian
  • Hungarian
  • Finnish
  • Norwegian
  • Other / Multi-language

Selecting the correct language is important, MonkeyLearn uses this information for the stemming and tokenization process, and for the default stopwords selection.

If we don’t support the right language for your data yet, you should definitively try with the Other / Multi-language option. You can get very good results without stemming and you can override the default stopwords with your own if you need to.

If your samples uses more than one language, for example for a language detection classifier, the Other / Multi-language is the right option for you.

If you edit this setting you must retrain the project (and redeploy if you are using a live tree) in order to see the changes when classifying (remember to save the new configuration).

