The reality that the scientific, industry and research communities have to deal with is the potential of 'Big Data'. The high-dimensional data (in digitised format) at our disposal can create opportunities such as discovery of new knowledge, creation of new online communities, and improvement on product and services delivery. The challenge however is that there are only few research, architectural designs and tools that can aid data mining processes from NoSQL databases. By focusing on terms and topic mining, this work proposes a data analytics framework that enables knowledge discovery through information retrieval and filtering from document-based NoSQL (specifically, CouchDB). The tool is algorithmically built and tested based on two methodologies namely: the inference-based apriori and the Baum-Welch algorithm. Preliminary test results of the proposed tool are also discussed based on the accuracy of each proposed algorithm where the inference-based apriori model performs better.