The study of biological sequences often relies on using reference genomes, yet achieving accurate assemblies remains challenging. Consequently, de novo analysis directly from raw reads, without pre-processing, is frequently more practical. We identify a very commonly shared need across various applications: identifying reads containing a specific kmer in a dataset. This kmer-to-reads association would be pivotal in multiple contexts, including genotyping, bacterial strain resolution, profiling, data compression, error correction or assembly. While this challenge appears similar to the extensively researched colored de Bruijn graph problem, resolving it at the read level would be prohibitively resource-intensive in practical applications. In this work, we demonstrate its tractable resolution by leveraging certain assumptions for sequencing dataset indexing. To tackle this challenge, we introduce the Tinted de Bruijn Graph concept, an altered version of the colored de Bruijn graph where each read within a sequencing dataset represents a unique source. We developed K2R, a highly scalable index that implement such search efficiently within this framework. K2Rs performance, in terms of index size, memory footprint, throughput, and construction time, is benchmarked against leading methods, including hashing techniques (e.g., Short Read Connector) and full-text indexing (e.g., Spumoni and Movi), across various datasets. K2R consistently outperforms contemporary solutions in most metrics and is the only tool capable of scaling to larger datasets. To prove K2R scalability we indexed two human datasets of the T2T consortium: the 126X coverage ONT dataset was indexed in 18hours using 19GB of RAM for a final index of 9.5GB and the 56X coverage HiFi dataset was constructed in 90 minutes using 5Gb of RAM for a final index of 207Mb. The K2R index, developed in C++, is open source and available on Github github.com/LeaVandamme/K2R.