Apply ITEM pretokenization to a given dataset.
Functions
adapt_dataset_from_parquet(self, data, path, ...)
adapt_dataset_from_parquet
Classes
Pretokenizer([writer, executor, ...])
Pretokenizer
Bases: Dataset
Dataset
Contents: