Objective: The objective of this study was to demonstrate soft palate MRI at 1.5 and 3 T with high temporal resolution on clinical scanners. Methods: Six volunteers were imaged while speaking, using both four real-time steady-state free-precession (SSFP) sequences at 3 T and four balanced SSFP (bSSFP) at 1.5 T. Temporal resolution was 9-20 frames s 21 (fps), spatial resolution 1.661.6610.0-2.762.7610.0 mm 3 . Simultaneous audio was recorded. Signal-to-noise ratio (SNR), palate thickness and image quality score (1-4, non-diagnostic-excellent) were evaluated. Results: SNR was higher at 3 T than 1.5 T in the relaxed palate (nasal breathing position) and reduced in the elevated palate at 3 T, but not 1.5 T. Image quality was not significantly different between field strengths or sequences (p5NS). At 3 T, 40% acquisitions scored 2 and 56% scored 3. Most 1.5 T acquisitions scored 1 (19%) or 4 (46%). Image quality was more dependent on subject or field than sequence. SNR in static images was highest with 1.961.9610.0 mm 3 resolution (10 fps) and measured palate thickness was similar (p5NS)