https://casualganpapers.com/cross-modal-fully-attentional-transformer/PerceiverIO-explained.html
Check out this paper summary