Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.

Open: Rubrix is free, open-source, and 100% compatible with major NLP libraries (Hugging Face transformers, spaCy, Stanford Stanza, Flair, etc.). In fact, you can use and combine your preferred libraries without implementing any specific interface.

End-to-end: Once a model goes into production, you want to monitor and analyze its predictions, and collect more data to improve your model over time. Rubrix enables you to iterate as much as you need.

User and Developer Experience: The key to sustainable NLP solutions is to make it easier for everyone to contribute to projects. Domain experts should feel comfortable interpreting and annotating data. Rubrix optimizes the experience to make your teams more productive.

Beyond hand-labeling: Easily combine hand-labeling with active learning, bulk-labeling, zero-shot models, and weak-supervision in novel data annotation workflows.