Rahmad Mahendra (@rmahendrarm) 's Twitter Profile
Rahmad Mahendra

@rmahendrarm

NLP, Badminton 🇮🇩 | PhD student @ARC_AIMedTech @RMITComputing | @FASILKOM_UI

ID: 1394792713006981122

calendar_today18-05-2021 23:11:36

18 Tweet

78 Followers

323 Following

Aran Komatsuzaki (@arankomatsuzaki) 's Twitter Profile Photo

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets By manually auditing the quality of 205 language-specific corpora, they find that lower-resource corpora have systematic issues in quality. arxiv.org/abs/2103.12028

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

By manually auditing the quality of 205 language-specific corpora, they find that lower-resource corpora have systematic issues in quality.

arxiv.org/abs/2103.12028
Anna Rogers (@annargrs) 's Twitter Profile Photo

#NLPaperAlert: QA Dataset Explosion!🔥 A survey of 200+ QA/RC datasets proposing a taxonomy of formats & reasoning skills. Also in the bag: modalities, conversational QA, domains & beyond-English data. Honored to work on this with Matt Gardner & Isabelle Augenstein arxiv.org/abs/2107.12708

#NLPaperAlert: QA Dataset Explosion!🔥
A survey of 200+ QA/RC datasets proposing a taxonomy of formats & reasoning skills. Also in the bag: modalities, conversational QA, domains & beyond-English data.
Honored to work on this with <a href="/nlpmattg/">Matt Gardner</a> &amp; <a href="/IAugenstein/">Isabelle Augenstein</a>
arxiv.org/abs/2107.12708
Jimmy Lin (@lintool) 's Twitter Profile Photo

Yesterday Rodrigo Nogueira Andrew Yates and I wrapped up the final preproduction version of "Pretrained Transformers for Text Ranking: BERT and Beyond" - posted on arXiv as v3: arxiv.org/abs/2010.06467 now in the hands of Morgan & Claypool Publishers and will be in print soon!

Rahmad Mahendra (@rmahendrarm) 's Twitter Profile Photo

We are excited to share that our paper, "IndoNLI: A Natural Language Inference Dataset for Indonesian", is accepted at EMNLP 2025 main conference. Thanks and congrats to my co-authors: Clara Vania Alham Fikri Aji samuel_louvan Fahrurrozi Rahman #EMNLP2021 #NLProc

Jia-Bin Huang (@jbhuang0604) 's Twitter Profile Photo

How to write a paper that looks like a good one? You worked super hard and did great research, but somehow the reviewer 2 just doesn't buy it. Why? 🤔 It's probably because your paper does not look like a good paper *visually*. 🙄 How? 👇👇👇 #AcademicTwitter

Alham Fikri Aji (@alhamfikri) 's Twitter Profile Photo

Did you know that 700+ languages ​​are spoken among 200M+ people in 🇮🇩Indonesia? Yet only a tiny portion of them has been explored in the NLP world. Our upcoming #acl2022nlp paper describes Indonesian NLP's progress, challenges & opportunities. arxiv.org/abs/2203.13357 [1/6]

Did you know that 700+ languages ​​are spoken among 200M+ people in 🇮🇩Indonesia? Yet only a tiny portion of them has been explored in the NLP world. 

Our upcoming #acl2022nlp paper describes Indonesian NLP's progress, challenges &amp; opportunities.

arxiv.org/abs/2203.13357 [1/6]
Alham Fikri Aji (@alhamfikri) 's Twitter Profile Photo

Finding Indonesian NLP resources is difficult, let's change that! If you have any NLP resources for Indonesian languages, you can share them through 🇮🇩NusaCrowd initiative, and be our co-author for our upcoming paper📜! Check our Github github.com/IndoNLP/nusa-c…

Rahmad Mahendra (@rmahendrarm) 's Twitter Profile Photo

Multilingual sentiment analysis dataset in Acehnese, Balinese, Banjarese, Buginese, Toba Batak, Madurese, Minangkabau, Javanese, (Dayak) Ngaju, Sundanese

Samuel Cahyawijaya (@scahyawijaya) 's Twitter Profile Photo

🚨 Exciting news! We are delighted to announce NusaCrowd, a new open-source initiative to collect and unite Indonesian NLP resources! 🇮🇩🇮🇩🇮🇩 Through NusaCrowd, we have gathered 137 datasets and 117 standardized data loaders covering text, audio, and image modalities💪🏾✨💕🌍

🚨 Exciting news! We are delighted to announce NusaCrowd, a new open-source initiative to collect and unite Indonesian NLP resources! 🇮🇩🇮🇩🇮🇩

Through NusaCrowd, we have gathered 137 datasets and 117 standardized data loaders covering text, audio, and image modalities💪🏾✨💕🌍
Genta Winata (@gentaiscool) 's Twitter Profile Photo

It has been 3⃣ years since we started the first initiative on the Indonesian benchmark, IndoNLU, and built IndoBERT as the foundation of IndoNLP 🇮🇩. We have seen so much progress 🥳 Repo: github.com/IndoNLP/indonlu follow the🧵to explore the journey ⛵️ #indonesian #indonlp @NLProc

Alham Fikri Aji (@alhamfikri) 's Twitter Profile Photo

🚨Join us on May 3rd to see our #eacl2023 poster and presentation on "NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages". This is our initiative to create an NLP resource for underrepresented 🇮🇩Indonesian languages. arxiv.org/abs/2205.15960

🚨Join us on May 3rd to see our #eacl2023 poster and presentation on "NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages".

This is our initiative to create an NLP resource for underrepresented 🇮🇩Indonesian languages.

arxiv.org/abs/2205.15960
Alham Fikri Aji (@alhamfikri) 's Twitter Profile Photo

🇮🇩NusaX is awarded with Outstanding Paper Award 🎉 Amazing work by all coauthors. More work to come from Indonesian NLP community, stay tuned.

🇮🇩NusaX is awarded with Outstanding Paper Award 🎉

Amazing work by all coauthors. More work to come from Indonesian NLP community, stay tuned.