This paper describes two generally applicable approaches towards significant improvement of performance of state-of-the-art extractive question answering (EQA) systems. Firstly, contrary to a common belief, it demonstrates that using the objective with independence assumption for span probability P (a s , a e ) = P (a s )P (a e ) of span starting at position a s and ending at position a e may have adverse effects. Therefore we propose a new compound objective that models joint probability P (a s , a e ) directly, while still keeping the objective with independency assumption as an auxiliary objective. Our second approach shows the beneficial effect of distantly semi-supervised sharednormalization objective known from (Clark and Gardner, 2017). We show that normalizing over set of documents similar to golden passage, and marginalizing over all groundtruth answer string positions leads to improvement of results from smaller statistical models. Our results are supported via experiments with three QA models (BidAF, BERT, AL-BERT) over six datasets. The proposed approaches do not use any additional data. Our code, analysis, pretrained models and individual results will be available online.