The data array in other modules is similarly arranged 62 4.3 One tap of FIR filter implemented in four rows of LUTs in an RFC module with 8 columns of LUTs. The blank LUTs do not participate in computation mode, but function normally in memory mode 4.4 Two successive processing elements (PEs) for DCT/IDCT, imple mented in four rows of LUTs in an RFC module with 8 columns of LUTs. The blank LUTs do not participate in computation mode, but function normally in memory mode 4.5 Access time, at 0.8 micron CMOS technology, in conventional cache and the RFC for Convolution algorithm implementation. x-axis indicates the cache associativity from 2-to 32-way, for each cache size in KB 4.6 Energy dissipation, at 0.8 micron CMOS technology, in conventional cache and the RFC for Convolution algorithm implementation. x-axis indicates the cache associativity from 2-to 32-way, for each cache size in KB 4.7 rfc instructions for loading and storing "word" type of data ... 4.8 State transition for the RFC cache 4.9 Performance of ABC processor vs. base processor for a 32KB 4-way cache with a line size of 32B 4.10 Component power utilization in ABC processor vs. base processor for MPEG2 decode application. X 4.11 Component power utilization in ABC processor vs. base processor for MPEG2 encode application 86 4.12 Component power utilization in ABC processor vs. base proces sor for FIR application 4.13 Component power utilization in ABC processor vs. base proces sor for cjpeg application 88 4.14 Component power utilization in ABC processor vs. base proces sor for IIR application 4.15 Total power utilization in ABC processor vs. base processor using (a) Alpha processor model (b) Pentium-Pro model 90 5.1 Reconfigurable Functional Cache (RFC) organizations and ad dress mapping with (a) 4 cache modules (b) 16 cache modules.. 5.2 Normalized execution cycles in base processor without RFC and ABC processor with RFC, with varying cache organizations.. . 5.3 Distribution of core functions in MPEG applications 5.4 Variation of cache blocking factor with the fraction of core function. 5.5 Variation of cache blocking factor with speed-up in core function. 109 5.6 Normalized execution cycles in the base processor without RFC, and the ABC processor with different RFC configuration schemes. 5.7 Relative performance improvement in two RFC configuration schemes. 114 5.8 Architectural Design Space 6.1 Packet routing based on longest prefix matching mechanism.. . 6.2 Function / = %oZi + ^1^2 + ^2^0 represented as (a) Binary decision tree and (b) BDD 6.3 Binary decision tree for the sample routing table. Dotted nodes are redundant. 6.4 Binary decision tree for (a) (b) with all effective nodes assigned with output. Dotted nodes are redundant xi 6.5 BDDs for (a)N^Pi. (b)AWo 6.6 (a) Nodes with assigned NHP ports. (b),(c),(d),(e) Output bits assigned to each of the nodes in 4-bit binary encoding of NHP. Dotted nodes are redundant 6.7 CLB mapping in FPGA 6.8 (a) Binary decision tree representation of the modified routing table. Dotted nodes are redundant, (b)...