State-of-the-art semantic segmentation models are characterized by high parameter counts and slow inference times, making them unsuitable for deployment in resource-constrained environments. To address this challenge, we propose AUTO-COMPRESSING SUBSET PRUNING, ACOSP, as a new online compression method. The core of ACOSP consists of learning a channel selection mechanism for individual channels of each convolution in the segmentation model based on an effective temperature annealing schedule. We show a crucial interplay between providing a highcapacity model at the beginning of training and the compression pressure forcing the model to compress concepts into retained channels. We apply ACOSP to SegNet and PSPNet architectures and show its success when trained on the CAMVID, CITYSCAPES, PASCAL VOC2012, and ADE20K datasets. The results are competitive with existing baselines for compression of segmentation models at low compression ratios and outperform them significantly at high compression ratios, yielding acceptable results even when removing more than 93% of the parameters. In addition, ACOSP is conceptually simple, easy to implement, and can readily be generalized to other data modalities, tasks, and architectures. Our code is available at https://github.com/merantix/acosp.