The Old Javanese Wordnet (OJW) is a lexical resource for Old Javanese, built from vocabulary extracted from the digitised Old Javanese–English Dictionary (Zoetmulder, 1982). It uses the Princeton WordNet synset hierarchy as a backbone and is linked to English and Indonesian.
The resource is developed by David Moeljadi and Zakariya Pamuji Aminullah.
Browse the wordnet at davidmoeljadi.github.io/OJW or download the databases from the latest release.
The source data is in wn-kaw.tab, one entry per line:
synset<TAB>lemma<TAB>variants
synset is the offset-pos from Princeton WordNet 3.0.
09815790-n pariwṛta pariwṛtta, pariwarta
00460735-a ahĕniṅ mahĕniṅ, ahniṅ
01332730-v tumutupi
The build script produces a WordNet LMF XML package and two Cygnet SQLite
databases, then deploys the web UI to docs/ for GitHub Pages.
- uv
curl,tar,xz,wget,xmlstarlet,python3libxml2-dev,libxslt-dev- The cygnet repository checked out as a
sibling directory:
../cygnet
# Debian/Ubuntu
sudo apt-get install -y curl tar xz-utils wget xmlstarlet libxml2-dev libxslt-dev
# macOS
brew install curl wget xmlstarlet libxml2 libxsltbash build.shThis:
- Clones required external data (CILI map, OMW scripts) into
external/ - Converts
wn-kaw.tabto WordNet LMF XML and validates it against the DTD - Packages the XML as
build/wnkaw-VERSION.tar.xz - Calls cygnet's build pipeline to produce two gzipped SQLite databases
- Copies the web UI and databases to
docs/
| File | Description |
|---|---|
build/wnkaw-VERSION.tar.xz |
WordNet LMF package (for release) |
docs/kaw-cygnet.db.gz |
Main Cygnet database — synsets, senses, forms, relations |
docs/kaw-provenance.db.gz |
Provenance database — per-row source attribution |
docs/index.html etc. |
Web UI (for GitHub Pages) |
bash run.shOpens http://localhost:8801 (or similar) serving docs/ — the full web UI
with the locally-built databases.
Releases are tagged YYYY.MM.DD and include three assets:
| Asset | Contents |
|---|---|
wnkaw-VERSION.tar.xz |
WordNet LMF XML package |
kaw-cygnet.db.gz |
Main Cygnet database |
kaw-provenance.db.gz |
Provenance database |
To create a release:
git tag 2026.03.14
git push origin 2026.03.14
gh release create 2026.03.14 \
--title "OJW 2026.03.14" \
--notes "Description of changes." \
docs/kaw-cygnet.db.gz \
docs/kaw-provenance.db.gz \
build/wnkaw-2026.03.14.tar.xzThe live web UI at davidmoeljadi.github.io/OJW fetches the databases from the
latest release automatically.
The web UI (docs/index.html) is the
Cygnet browser, configured for OJW via
docs/local.json. It runs entirely in the browser using
sql.js — no server-side component is needed.
Configuration is in etc/local.json; the build script copies it to docs/.
See cygnet/CUSTOMIZE.md for full documentation on
all available local.json fields.
If you use the Old Javanese Wordnet, please cite:
David Moeljadi and Zakariya Pamuji Aminullah (2020). Building the Old Javanese Wordnet. In Proceedings of LREC 2020, pages 2940–2946. ELRA.
For the web interface, please cite:
Rowan Hall Maudslay and Francis Bond (2026). Cygnet: A merged multilingual wordnet browser.
The Old Javanese Wordnet data is released under CC BY 4.0.