Pretrain a model on a Dataflow Prediction (DP) task.
Contents:
TransformerEncoder
TransformerEncoderForMaskedLM
TransformerEncoderForSequenceSimilarity
TransformerEncoderForSequenceClassification
TransformerEncoderForSequenceSummarizationGPT2