Popular repositories Loading
-
cjclassifier
cjclassifier PublicLanguage classifier to disambiguate Chinese (simplied,traditional) vs Japanese ideograph sequences.
Java 1
-
langidentify
langidentify PublicLangIdentify is a fast, high-accuracy language detection library for Java. It combines ngram classification with a "topwords" signal that boosts accuracy on short or ambiguous text. Models were tra…
Java 1
Repositories
- cjclassifier Public
Language classifier to disambiguate Chinese (simplied,traditional) vs Japanese ideograph sequences.
jlpka/cjclassifier’s past year of commit activity - langidentify Public
LangIdentify is a fast, high-accuracy language detection library for Java. It combines ngram classification with a "topwords" signal that boosts accuracy on short or ambiguous text. Models were trained on the Wikipedia corpus and cover 80+ languages.
jlpka/langidentify’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…