AI RESEARCH
Training GPT-like model on non-language series [R]
r/MachineLearning
•
I am responsible for a research project that is supposed to train a GPT-like model (Transformer-decoder) with 100M, 250M and 500M model variants. # params