AI RESEARCH

Does this idea sound fun? [R]

r/MachineLearning

It's about inference-time learning by inserting some experts specialized for updating sibling expert weights in MoE. All the components needed were already there, but no one tried it inside MoE, so I did a small PoC. It kinda worked. I'd love to hear what you think. submitted by /u/max6296 [link] [comments]