summaryrefslogtreecommitdiffstats
path: root/meta-python/recipes-devtools/python3-nltk
Commit message (Collapse)AuthorAgeFilesLines
* python3-nltk: upgrade 3.9.3 -> 3.9.4Gyorgy Sarvari2026-04-011-1/+1
| | | | | | | | | | | | | | | | | | | Changes: * Support Python 3.14 * Fix bug in Levenshtein distance when substitution_cost > 2 * Fix bug in Treebank detokeniser re quote ordering * Fix bug in Jaro similarity for empty strings * Several security enhancements * Fix GHSA-rf74-v2fm-23pw: unbounded recursion in JSONTaggedDecoder * Implement TextTiling vocabulary introduction method (Hearst 1997) * Fix ALINE feature matrix errors and add comprehensive tests * Support multiple VerbNet versions, fix longid/shortid regex for VerbNet ids * Let downloader fallback to md5 when sha256 is unavailable * Several other minor bugfixes and code cleanups Signed-off-by: Gyorgy Sarvari <skandigraun@gmail.com> Signed-off-by: Khem Raj <khem.raj@oss.qualcomm.com>
* python3-nltk: mark CVE-2026-0846 patchedGyorgy Sarvari2026-03-181-0/+1
| | | | | | | | | | Details: https://nvd.nist.gov/vuln/detail/CVE-2026-0846 It has been fixed in version 3.9.3, however NVD tracks it without CPE/version info. Signed-off-by: Gyorgy Sarvari <skandigraun@gmail.com> Signed-off-by: Khem Raj <raj.khem@gmail.com>
* python3-nltk: upgrade 3.9.2 -> 3.9.3Gyorgy Sarvari2026-02-241-1/+1
| | | | | | | | | | | | | | | Contains fix for CVE-2026-14009. Changelog: * Fix CVE-2025-14009: secure ZIP extraction in nltk.downloader * Block path traversal/arbitrary reads in nltk.data for protocol-less refs * Block path traversal/abs paths in corpus readers and FS pointers * Validate external StanfordSegmenter JARs using SHA256 * Add optional sandbox enforcement for filestring() * Maintenance: downloader/zipped models, CI/tooling updates Signed-off-by: Gyorgy Sarvari <skandigraun@gmail.com> Signed-off-by: Khem Raj <raj.khem@gmail.com>
* python3-nltk: set CVE_PRODUCTGyorgy Sarvari2025-12-311-0/+2
| | | | | | | | | | | | | | | | | The CVEs for this project are tracked under nltk:nltk CPE, which doesn't match the default python:nltk CPE. See CVE db query: sqlite> select * from products where PRODUCT like 'nltk'; CVE-2019-14751|nltk|nltk|||3.4.5|< CVE-2021-3828|nltk|nltk|||3.6.3|<= CVE-2021-3842|nltk|nltk|||3.6.6|< CVE-2021-43854|nltk|nltk|||3.6.5|< Set the CVE_PRODUCT so it can be used to match CVEs. Signed-off-by: Gyorgy Sarvari <skandigraun@gmail.com> Signed-off-by: Khem Raj <raj.khem@gmail.com>
* python3-nltk: upgrade 3.9.1 -> 3.9.2Wang Mingyu2025-10-141-1/+1
| | | | | | | | | | | | | | | | | | | | Changelog: ============= * Update download checksums to use SHA256 in built index * Fix percentage escape in new-style string formatting * replace shortened URLs using goo.gl * Make Wordnet interoperable with various taggers and tagged corpora * Fix saving PerceptronTagger * Document how to reproduce old Wordnet studies * properly initialize Portuguese corpus reader * support for mixed rules conversion into Chomsky Normal Form * only import tkinter if a GUI is needed * issue #2112 with Corenlp * new environment variable NLTK_DOWNLOADER_FORCE_INTERACTIVE_SHELL * Lesk defaults to most frequent sense in case of ties Signed-off-by: Wang Mingyu <wangmy@fujitsu.com> Signed-off-by: Khem Raj <raj.khem@gmail.com>
* python3-nltk: add a recipe for version 3.9.1Thomas Perrot2024-11-201-0/+24
The Natural Language Toolkit (NLTK) is a Python package for natural language processing. Signed-off-by: Thomas Perrot <thomas.perrot@bootlin.com> Signed-off-by: Khem Raj <raj.khem@gmail.com>