Recently, neural networks have shown promising results on Document-level Aspect Sentiment Classification (DASC). However, these approaches often offer little transparency w.r.t. their inner working mechanisms and lack interpretability. In this paper, to simulating the steps of analyzing aspect sentiment in a document by human beings, we propose a new Hierarchical Reinforcement Learning (HRL) approach to DASC. This approach incorporates clause selection and word selection strategies to tackle the data noise problem in the task of DASC. First, a high-level policy is proposed to select aspect-relevant clauses and discard noisy clauses. Then, a low-level policy is proposed to select sentiment-relevant words and discard noisy words inside the selected clauses. Finally, a sentiment rating predictor is designed to provide reward signals to guide both clause and word selection. Experimental results demonstrate the impressive effectiveness of the proposed approach to DASC over the state-of-the-art baselines. * Corresponding author Review Document [ [ [This hotel is close to railway station] ] ]Clause1 [ [ [and very convenient to eat around] ] ]Clause2 [ [ [but room of Hilton is a little uncomfortable .]Clause3 [ [ [I'm often nitpicking for room decoration.] ] ]Clause4 [ [ [Besides, the price is very expensive ] ] ]Clause5 [ [ [although the staff service is professional .] ] ]Clause6