-
-
-
pkuseg-python Public
Forked from lancopku/pkuseg-pythonThe pkuseg toolkit for multi-domain Chinese word segmentation
Python MIT License UpdatedJan 7, 2026 -
fairseq2 Public
Forked from facebookresearch/fairseq2FAIR Sequence Modeling Toolkit 2
Python MIT License UpdatedJul 18, 2025 -
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJan 9, 2025 -
python-ruwordnet Public
A Python wrapper for the RuWordNet thesaurus
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedNov 8, 2024 -
lingtrain-aligner Public
Forked from averkij/lingtrain-alignerLingtrain Aligner — ML powered library for the accurate texts alignment.
Python GNU General Public License v3.0 UpdatedSep 23, 2024 -
awesome-translations Public
Forked from mbiesiad/awesome-translations😎 Awesome lists about Internationalization & localization stuff. l10n, g11n, m17n, i18n. Translations! 🌎🌍
-
awesome-machine-translation Public
Forked from maidis/awesome-machine-translationA list of awesome Machine Translation frameworks, libraries, software and papers
Creative Commons Zero v1.0 Universal UpdatedSep 18, 2024 -
encodechka Public
The tiniest sentence encoder for Russian language
-
arm_treebank_tokenizer Public
Forked from Armtreebank/TokenizerTokenization module of ArmTreeBank
Python Other UpdatedJun 19, 2024 -
-
compress-fasttext Public
Tools for shrinking fastText models (in gensim format)
-
-
sentence-transformers Public
Forked from huggingface/sentence-transformersMultilingual Sentence & Image Embeddings with BERT
Python Apache License 2.0 UpdatedMar 13, 2024 -
flores-OLDI Public
Forked from openlanguagedata/floresThe FLORES+ Machine Translation Benchmark
TeX Creative Commons Attribution Share Alike 4.0 International UpdatedFeb 20, 2024 -
apertium-python Public
Forked from apertium/apertium-pythonnow you can even use apertium from python
Python GNU General Public License v3.0 UpdatedFeb 19, 2024 -
mteb Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
Python Apache License 2.0 UpdatedFeb 13, 2024 -
stopes Public
Forked from facebookresearch/stopesA library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.
Python MIT License UpdatedNov 30, 2023 -
weirdMath Public
some examples of "high math" that we can play with
-
-
dialogic Public
Yet another common Python wrapper for Alice and Salut skills and bots in Telegram, VK, and Facebook
-
-
-
-
AI4TALK Public
Forked from AIRI-Institute/AI4TALKPython Creative Commons Attribution Share Alike 4.0 International UpdatedNov 12, 2022 -
transformer-contributions-nmt Public
Forked from mt-upc/transformer-contributions-nmtJupyter Notebook Apache License 2.0 UpdatedOct 6, 2022 -
RussianSuperGLUE Public
Forked from RussianNLP/RussianSuperGLUERussian SuperGLUE benchmark
Jupyter Notebook MIT License UpdatedJun 23, 2022