Learning to rebalance multi-modal optimization by adaptively masking subnetworks