Skip to content

PyThaiNLP v5.3.2 Released!

Choose a tag to compare

@bact bact released this 19 Mar 16:19
· 156 commits to dev since this release
6ddcc19

This release focuses on security improvements related to path traversal and renaming functions to conform with PEP 8 and follow NLTK convention. Old function names are still accessible, but migration to new names are recommended as old function names will be removed in a future version.

Install/upgrade:

pip install -U pythainlp

What's changed

Added

  • pythainlp.chunk module: canonical home for chunking/phrase-structure parsing, following the NLTK nltk.chunk naming convention.

Deprecated

The following names are deprecated and will be removed in 6.0 (#1339):

  • pythainlp.util.isthaichar(): use pythainlp.util.is_thai_char().
  • pythainlp.util.isthai(): use pythainlp.util.is_thai().
  • pythainlp.util.countthai(): use pythainlp.util.count_thai().
  • pythainlp.tag.crfchunk.CRFchunk: use pythainlp.chunk.CRFChunkParser.
  • pythainlp.tag.chunk_parse(): use pythainlp.chunk.chunk_parse().

Security

  • Prevent path traversal: validate that paths stay within their expected base directory (#1342)

Full Changelog: v5.3.1...v5.3.2