Software
This is a summary of open source projects that I recently worked on. Find more on GitHub.
Natural language processing
- Dact: search tool for Alpino treebanks. C++
- finalfrontier: word embeddings with subword units. Rust
- finalfusion: word embedding package with support for subword units, quantization, and memory mapping. Rust
- spaCy: industrial-strength natural language processing library. Python
- SyntaxDot: multi-task transformer-based syntax annotator. Rust
Machine learning
- Curated Transformers: PyTorch-based transformer model library for spaCy. Python
- reductive: (optimized) product quantization. Rust
- Thinc: deep learning library with a refreshing functional approach. Python
Miscellaneous
- os-signpost: Python wrapper for the macOS signpost API to mark events for visualization in Instruments.app. Python