datasetsΒΆ
Dataset building scripts and reusable pipeline utilities.
Modules
A dataset harvested from packages available via Advanced Package Tool. |
|
Assemblage datasets, built from open-source code. |
|
Base classes and utilities for datasets. |
|
The Google Code Jam programming competition archives. |
|
The HumanEval-X multilingual code benchmark. |
|
A dataset built from packages available via the Nix package manager. |
|
Custom pipeline steps for processing data. |
|
Dataset schema definition and enforcement. |
|
Dataset parsing scripts and utilities. |
|
The XLCost text-to-code generation benchmark. |