The standard method for sleep state classification is thresholding the amplitudes of electroencephalography (EEG) and electromyography (EMG) data, followed by manual correction by an expert. Although popular, this method has some shortcomings: (1) the time-consuming manual correction by human experts is sometimes a bottleneck hindering sleep studies, (2) EEG electrodes on the skull interfere with wide-field imaging of the cortical activity of a head-fixed mouse under a microscope, (3) invasive surgery to fix the electrodes on the thin mouse skull risks brain tissue injury, and (4) metal electrodes for EEG and EMG recording are difficult to apply to some experimental apparatus such as that for functional magnetic resonance imaging. To overcome these shortcomings, we propose a pupil dynamicsbased vigilance state classification method for a head-fixed mouse using a long short-term memory (LSTM) model, a variant of a recurrent neural network, for multi-class labeling of NREM, REM, and WAKE states. For supervisory hypnography, EEG and EMG recording were performed on head-fixed mice. This setup was combined with left eye pupillometry using a USB camera and a markerless tracking toolbox, DeepLabCut. Our open-source LSTM model with feature inputs of pupil diameter, pupil location, pupil velocity, and eyelid opening for 10 s at a 10 Hz sampling rate achieved vigilance state estimation with a higher classification performance (macro F1 score, 0.77; accuracy, 86%) than a feedforward neural network. Findings from a diverse range of pupillary dynamics implied possible subdivision of the vigilance states defined by EEG and EMG. Pupil dynamics-based hypnography can expand the scope of alternatives for sleep stage scoring of head-fixed mice.