The SentencePiece is an unsupervised text tokenizer for Neural Network-based
text generation.
It is an unsupervised text tokenizer and detokenizer mainly for
Neural Network-based text generation systems where the vocabulary size is
predetermined prior to the neural model training.
SentencePiece implements subword units and unigram language model with the
extension of direct training from raw sentences.
SentencePiece allows us to make a purely end-to-end system that does not
depend on language-specific pre/post-processing.
Update Information:
Update to 0.2.1
* Tue May 19 2026 Peter Robinson
[ 1 ] Bug #2432139 - CVE-2026-1260 sentencepiece: Sentencepiece: Invalid memory access leading to potential arbitrary code execution via a crafted model file. [fedora-43]
https://bugzilla.redhat.com/show_bug.cgi?id=2432139
This update can be installed with the "dnf" update program. Use su -c 'dnf upgrade --advisory FEDORA-2026-314504fd18' at the command line. For more information, refer to the dnf documentation available at http://dnf.readthedocs.io/en/latest/command_ref.html#upgrade-command-label
Get the latest Linux and open source security news straight to your inbox.