PyThaiNLP v5.3.2 Released!
This release focuses on security improvements related to path traversal and renaming functions to conform with PEP 8 and follow NLTK convention. Old function names are still accessible, but migration to new names are recommended as old function names will be removed in a future version.
Install/upgrade:
pip install -U pythainlp- Documentation: https://pythainlp.github.io/docs/5.3
- Report bug: https://github.com/PyThaiNLP/pythainlp/issues
What's changed
Added
pythainlp.chunkmodule: canonical home for chunking/phrase-structure parsing, following the NLTKnltk.chunknaming convention.
Deprecated
The following names are deprecated and will be removed in 6.0 (#1339):
pythainlp.util.isthaichar(): usepythainlp.util.is_thai_char().pythainlp.util.isthai(): usepythainlp.util.is_thai().pythainlp.util.countthai(): usepythainlp.util.count_thai().pythainlp.tag.crfchunk.CRFchunk: usepythainlp.chunk.CRFChunkParser.pythainlp.tag.chunk_parse(): usepythainlp.chunk.chunk_parse().
Security
- Prevent path traversal: validate that paths stay within their expected base directory (#1342)
Full Changelog: v5.3.1...v5.3.2