Skip to the content.

Bahasa Indonesia Open Source NLP Resource

moved from here

A few might know open sourced resources for Bahasa Indonesia NLP, since they are scattered everywhere on github. Here are a few that I know, hope it helps other people for getting started their NLP projects:

Negative and Positive Unigrams


Stopword List



  1. (python)
  2. (java)

MWE (Multi Word Expression) Lists

  1. (see the resources folder)

Twitter Sample Corpus

  1. (on the csv files)
Written on October 8, 2016