Skip to content

Lingua 1.3.5

Compare
Choose a tag to compare
@github-actions github-actions released this 03 Apr 12:33
· 22 commits to main since this release

Improvements

  • The language models are now stored in dictionaries instead of NumPy arrays. This change leads to significantly improved runtime performance at the cost of higher memory consumption (up to 3 GB for all models). As the runtime performance was much too slow with the former approach, this change makes sense because adding more memory is quite cheap.

  • The language model files are now compressed with the Brotli algorithm which reduces the file size by 15 %, on average.

  • The characters Щщ are now correctly identified as possible indicators for the Ukrainian language, leading to slightly higher accuracy when identifying Ukrainian texts.

Miscellaneous

  • All dependencies have been updated to their latest versions.