Hello
I am trying to train the model on a couple hundred megabytes of fine-tuning data using v3-8 tpus on Google Cloud.
I got some error
RuntimeError: Internal: Replica group has size 8, but all replica groups in an all-to-all with N operands must have size N:
I need expert's help
Thanks