“…That is, when we fairly apply the rules to all lexicons and ML algorithms, we achieve better correlation coefficients (mean r increase of 5.2%) and better accuracies (mean F1 increase of 2.1%). Consistent with prior work (Agarwal, Xie, Vovsha, Rambow, & Passonneau, 2011;Davidov et al, 2010;Shastri, Parvathy, Kumar, Wesley, & Balakrishnan, 2010), we find that grammatical features (conventions of use for punctuation and capitalization) and consideration for degree modifiers like "very" or "extremely" prove to be useful cues for distinguishing differences in sentiment intensity. Other syntactical considerations identified via qualitative analysis (negation, degree modifiers, and contrastive conjunctions) also help make VADER successful, and is consistent with prior work (Agarwal et al, 2011;Ding, Liu, & Yu, 2008;Lu, Castellanos, Dayal, & Zhai, 2011;Socher et al, 2013).…”