Zig-budoux: Wrap that Japanese and Chinese text properly

I ported budoux to zig (and C)
It lets you find “word” boundaries in japanese and chinese, useful for word wrapping and perhaps even for indexing if you remove the particles.

7 Likes

Very interesting. What would the use cases be for this. Maybe doing some analysis on word frequency?

1 Like

The main use case is for proper “word wrapping” for which I already use it for on the frontend, but on the backend I’m testing it out as a core component for creating search index for japanese text.

2 Likes

Examples from my frontend:

Without budoux (standard browser word wrap)

With budoux

5 Likes

I see it, very nice!