This paper presents NilmDB, a comprehensive framework designed to solve the "big data" problem of non-intrusive load monitoring and diagnostics. It provides the central component of a flexible, distributed architecture for the storage, transfer, manipulation, and analysis of time-series data. NilmDB is network-transparent and facilitates remote viewing and management of large data sets by utilizing efficient data reduction and indexing techniques.Index Terms-Big data, intelligent power metering, nonintrusive load monitoring, smart grid. E NERGY monitoring and smart grid applications have rapidly developed into a multi-billion dollar market [1]. The continued growth and utility of monitoring technologies is predicated upon a necessity to economically extract actionable information from acquired data streams. User and operator needs define the nature of relevant information regarding power consumption and operation of the distribution system. The scale of this information can vary greatly in time, frequency, and amplitude or dynamic range. Basic energy-scorekeeping might be accomplished with time series data of real and reactive power consumption; essentially, information at or near line frequency. Power quality monitoring might require knowledge of line current and line voltage harmonics an order of magnitude higher in frequency. Diagnostic monitoring might require knowledge of non-integer-multiple frequencies of the line, e.g., tracking the principal slot harmonic of an important rotating machine. All of this data, and other streams as well, might be needed on time scales ranging from fractions of a second, for a transient study, to months or years, for energy scorekeeping and behavior tracking.One of the largest roadblocks to effective analytics for power data arises from the disparities of scale inherent in data collection and processing, which often limits the speed and resolution at which data can be captured and effectively managed. This, in turn, affects the ability to extract actionable information from the data. Furthermore, existing database systems are ill-suited to power monitoring. Traditional SQL databases provide a powerful query structure but do not perform well with very large record sets. Partly for this reason, several NoSQL variants have Manuscript