Benchmark performance of shaping individual words.
Found some more performance optimizations. Use dupablePerformIO where relevant, & it doesn't incur excessive allocation. Ensure amounts of output are dereferenced as quickly as large amounts are.
Optimize Harfbuzz-Pure.
Add benchmark, against Dracula by Bram Stoker.