spaCy is a free, open-source library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems, or to pre-process text for deep learning.
Software
Prodigy is a modern annotation tool for creating training data for machine learning models. It’s so efficient that data scientists can do the annotation themselves, enabling a new level of rapid iteration. Whether you’re working on entity recognition, intent detection or image classification, Prodigy can help you train and evaluate your models faster.
Ellf is an interactive AI-powered assistant for Natural Language Processing (NLP) and machine learning projects. It integrates with your coding assistant like Claude Code and makes it proficient at planning and developing NLP solutions. The platform lets you plug in your own data-private cluster and makes it easy to execute annotation tasks, auto-annotation agents, training experiments and more, and collaborate on development with your team.