Software
This is a summary of open source projects that I recently worked on. Find more on GitHub.
Natural language processing
- Dact: search tool for Alpino treebanks.
- finalfrontier: word embeddings with subword units. 🦀
- finalfusion: word embedding package with support for subword units, quantization, and memory mapping. 🦀
- spaCy: industrial-strength natural language processing library. 🐍
- SyntaxDot: multi-task transformer-based syntax annotator. 🦀
Machine learning
- Curated Transformers: PyTorch-based transformer model library for spaCy. 🐍
- reductive: (optimized) product quantization. 🦀
- Thinc: deep learning library with a refreshing functional approach. 🐍
Miscellaneous
- os-signpost: Python wrapper for the macOS signpost API to mark events for visualization in Instruments.app. 🐍