Spatio-temporal encoding (SPEN) delivers single-scan images with increased robustness to shift and susceptibility artifacts. These acquisitions are usually carried out in a “hybrid” mode that prevents a sufficiently dense sampling along the SPEN domain. Alleviating this resolution loss had so far demanded the acquisition of multiple interleaved scans. The present study demonstrates that by relying on multiple sensors, a similar resolution enhancement can be achieved in a single shot. The principles and potential of the ensuing Super-resolved SPEN with SENSE (SUSPENSE) is demonstrated, with sub-mm single-shot 3T image acquisitions on phantoms and humans.