- Budapest, Hungary
-
19:50
(UTC +01:00) - https://gyorgy.orosz.link
- in/oroszgy
Highlights
NLP tools
🪼 a python library for doing approximate and phonetic matching of strings.
This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
Library for clinical NLP with spaCy.
Open Source Data Annotation & Labeling Tools
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
A simple library for training named entity recognition model from partially annotated data
Parse natural language time expressions in python
A list of publications on NLP interpretability (Welcome PR)
Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.
A visual labeling system implemented in Jupyter widgets.
REMERGE - Multi-Word Expression discovery algorithm
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
MTEB: Massive Text Embedding Benchmark
OpenRefine is a free, open source power tool for working with messy data and improving it
Unsupervised text tokenizer focused on computational efficiency
✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3
Export Hugging Face models to Core ML and TensorFlow Lite
Zero and Few shot named entity & relationships recognition