Recommended Reading¶

We use a lot of external libraries in Undertale - if you’re not already pretty familiar with the following, it’s worth reading through their documentation and possibly completing their tutorial(s) before contributing.

datatrove

The dataset building pipeline library from the folks at HuggingFace. We use this to codify all of our dataset building pipelines and parallelize them across compute infrastructure.

PyTorch

The deep learning library. If you’re not already deeply familiar with PyTorch, the textbook Deep Learning with Pytorch is an excellent resource.

PyTorch Lightning

All of our models are written in PyTorch and wrapped in Lightning modules. We largely let Lightning handle the complexities of multi-node, multi-GPU training, validation, and integration with tensorboard for monitoring training.

Tensorboard

The visualization tool we use for tracking training runs.

Sphinx

A software documentation library. All of our documentation is written in reStructuredText and built with Sphinx. We also use autodoc with Google-style Python docstrings for automatically generated reference documentation.

pyinstrument

The statistical profiler for Python. We use this occasionally for performance testing.

 
  • ← Architecture
  • Conventions →

Undertale

Navigation

Contents:

  • Usage
  • Development
    • Architecture
    • Recommended Reading
    • Conventions
    • Workflows
    • Setup
  • Reference
  • Glossary
  • About

Related Topics

  • Documentation overview
    • Development
      • Previous: Architecture
      • Next: Conventions
©2025, MIT Lincoln Laboratory. | Page source