C++ → Python transformer

16.4M-parameter encoder-decoder for code translation, trained on XLCoST on a GTX 1650. val_loss 2.0474.