Skip to content
View fajri91's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report fajri91

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[EACL 2026 Main] Framework to construct a Cultural Commonsense Knowledge Graph( CCKG) that have geographical context.

Python 3 Updated Jan 28, 2026

paper list, dataset, and tools for radiology report generation

410 38 Updated Apr 15, 2026

A curated list of research papers and resources on Cultural LLM.

52 3 Updated Sep 26, 2024
Python 29 8 Updated Sep 17, 2024

Open Implementations of LLM Analyses

Jupyter Notebook 108 10 Updated Oct 8, 2024

A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

Python 98 55 Updated Mar 16, 2026

NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

Jupyter Notebook 28 1 Updated Sep 27, 2024
Python 38 14 Updated Oct 10, 2023

Multicultural Proverbs and Sayings

Python 13 Updated Jan 11, 2025
Python 2 Updated Dec 6, 2022

CMMLU: Measuring massive multitask language understanding in Chinese

Python 808 68 Updated Dec 6, 2024

A Multilingual Replicable Instruction-Following Model

Python 97 3 Updated Jun 11, 2023
Python 1 Updated Sep 16, 2022

Discourse Probing of Pretrained Language Models. In Proceedings of NAACL 2021.

Jupyter Notebook 10 1 Updated Jun 27, 2022

A framework for assessing and improving classification fairness.

Jupyter Notebook 33 9 Updated Jun 12, 2023

High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)

Jupyter Notebook 110 10 Updated May 8, 2023

Minangkabau NLP corpus. PACLIC 2020

Python 10 2 Updated Jun 7, 2021

Evaluating the Efficacy of Summarization Evaluation across Languages. In Findings of ACL 2021.

Jupyter Notebook 2 1 Updated Jul 26, 2021

Indonesia Sentiment Lexicon

140 30 Updated Aug 5, 2019

IndoNLI

Python 18 3 Updated Dec 4, 2021

KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation

Python 30 7 Updated Aug 31, 2021

EACL 2021

Python 11 4 Updated May 4, 2021

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Python 72 6 Updated Sep 13, 2021

Complete Web Scraping of TED.com for Metadata, Transcript, Audio, Video, Images using Parallel Programming

Jupyter Notebook 11 6 Updated Jun 25, 2020

Classification of twitter user's personality based on their tweets. Big Five Model used to classify the personality.

Python 15 6 Updated Aug 30, 2020

The Dataset for Hate Speech Detection in Indonesian (Bahasa Indonesia)

29 16 Updated Jul 6, 2022
Next