What is AMENA?
- AMENA is a machine learning project that focuses on analyzing social media such as Twitter and News. Specifically, it is built for investigating political phenomena in Indonesia based on big data computation and the latest artificial intelligent approach. AMENA aims to provide accountable insight and information for government, policymakers, and society related to political events or the status quo in social media.
- AMENA is a tool for analyzing and investigating public opinions on political developments in Indonesia through automatic text analysis.
Preliminary works (partially used as benchmark and knowledge base for this project) has been published in some international conferences, including:
- Fajri Koto, and Gemala Y. Rahmaningtyas “InSet Lexicon: Evaluation of a Word List for Indonesian Sentiment Analysis in Microblogs”. IEEE in the 21st International Conference on Asian Language Processing (IALP), Singapore, 2017. [paper]
- Fajri Koto, and Mirna Adriani. “HBE: Hashtag-Based Emotion Lexicons for Twitter Sentiment Analysis”. ACM in The 6th Forum for Information Retrieval (FIRE 2015), Gandhinagar, India, December 2015 [paper]
- Fajri Koto, and Mirna Adriani. “A Comparative Study on Twitter Sentiment Analysis: Which Features are Good?”. Springer in The 20th International Conference on Applications of Natural Language To Information Systems (NLDB 2015), Passau, Germany, June 2015 [paper]
- Fajri Koto, and Mirna Adriani. “The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis”. IEEE in The 8th International Symposium on Mining and Web (MAW15), Gwangju, Korea (join with the 29th AINA Conference), March 2015 [paper].
What are the features of AMENA?
Currently, we have built three primary functions of AMENA. In the future, we are planning to enhance the model and features.
- Crawler
- Analyzer: Network analysis and Sentiment Analysis
- Data visualization
How is AMENA constructed?
According to our knowledge, most of the machine learning systems in Indonesia (especially for analyzing text in the political domain) still use old-fashion approaches such as n-gram, lexicon-based technique, or traditional classification such as Naive Bayes, SVM, or regression. AMENA is different as it implements the state of the art of NLP for Bahasa Indonesia. Please refer to the recent NLP lecture from Stanford to get some understanding about sequence-to-sequence model. In the future, we will enhance this architecture with pre-trained-network-models such as BERT (from Google) and GPT (from Open AI).
Currently, this is our neural architecture for classifying sentiment. Please keep in mind that we do it in two stages, as we involve neutral vs non-neutral, and postive vs negative prediction.
What has AMENA been used for?
AMENA has collaborated with NextPolicy on these following projects:
- Analisis Pelantikan Kabinet / Menteri Jokowi (September-November 2019). Featured in Tempo, Merdeka, Republika, Jawapos, Tribunnews, LineToday, Media Indonesia, Tirto, BeritaSatu, SindoNews, Radar Madura.
- Analisis 100 hari kepresidenan Jokowi (ongoing project, December 2019 on wards)