I ported budoux to zig (and C)
It lets you find “word” boundaries in japanese and chinese, useful for word wrapping and perhaps even for indexing if you remove the particles.
7 Likes
Very interesting. What would the use cases be for this. Maybe doing some analysis on word frequency?
1 Like
The main use case is for proper “word wrapping” for which I already use it for on the frontend, but on the backend I’m testing it out as a core component for creating search index for japanese text.
2 Likes
I see it, very nice!