Implementation of a 32-bit RISC processor for the data-intensive architecture processing-in-memory chip

Draper, J.; Sondeen, J.; Mediratta, S.D.; Kim, Ihn

doi:10.1109/asap.2002.1030716

Cited by 20 publications

(15 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Field Programmable Compute Array (FPCA), the internal computation engine of HPPS, was developed to perform efficient stream processing. By modifying basic computational structures in the FPCA to support Wide Word [8] functionality, the arithmetic and memory clusters of FPCA can "morph" between stream and thread modes. MONARCH has also adapted HPPS high bandwidth I/O and fault tolerance features to facilitate sensor input as well as to enable tiling of multiple chips.…”

Section: Monarch Overviewmentioning

confidence: 99%

“…The control signals required for mapping of WideWord ALU functionality onto an FPCA core-tile for thread level parallelism are provided by the RISC processor on the node. The MONARCH node thread processor is largely derived from the DIVA [6] PIM processor model [7] and thus supports single-issue, in-order execution, with 32-bit instructions and 32-bit addresses. In contrast to the dedicated WideWord Unit implemented in DIVA [8], the arithmetic cluster is a morphable unit that can be configured to operate independently as a streaming engine or under control of the threaded execution unit as a wide threaded processor.…”

Section: Monarch Overviewmentioning

confidence: 99%

See 1 more Smart Citation

PBuf: An On-Chip Packet Transfer Engine for MONARCH

Bhatti

Steele

Draper

2006

2006 49th IEEE International Midwest Symposium on Circuits and Systems

View full text Add to dashboard Cite

Abstract-This paper describes the architecture and implementation of an on-chip packet interface/router called the Packet Buffer (PBuf) employed in the MOrphable Networked microARCHitecture (MONARCH). This work provides a brief overview of MONARCH and its subsystems to provide motivation for the PBuf design. MONARCH employs a hierarchy with various levels of address spaces. To connect the subsystems and keep the network complexities low, communication packets undergo an address translation process while passing across the address space boundaries. The PBuf provides protected translation in the midst of superior and inferior address spaces while also serving as an on-chip packet switching router. Additional features, such as its 6 memory to memory block transfer (MMBT) engines, enable it to provide high rate data transfer capabilities.

show abstract

Section: Monarch Overviewmentioning

confidence: 99%

Section: Monarch Overviewmentioning

confidence: 99%

PBuf: An On-Chip Packet Transfer Engine for MONARCH

Bhatti

Steele

Draper

2006

2006 49th IEEE International Midwest Symposium on Circuits and Systems

View full text Add to dashboard Cite

show abstract

“…Similarly, logic for converting to/from the internal number format and rounding logic are shared between both datapaths. DIVA execution control is a simple in-order single-issue instruction pipeline [4][8], therefore combining common datapaths does not suffer any performance penalty. The pipeline registers for the ALU and the Mul/Div blocks are controlled by separate enable signals so that only one of the datapaths is active for each instruction.…”

Section: B Monarch Fpu (Add-multiply Configuration)mentioning

confidence: 99%

“…At an architectural level, the MONARCH chip contains functional units that may serve as the central elements in a dataflow architecture for highly efficient stream computing or through morphing they may become the basis of vector extension units controlled by embedded threaded processors, such as a simple RISC design. In the latter mode, the configuration of the computational elements strongly resembles the WideWord operation of DIVA [4]. To achieve highperformance stream processing capability in MONARCH, FPU throughput should be maximized, even at the expense of area.…”

Section: Introductionmentioning

confidence: 99%

Design Trade-Offs in Floating-Point Unit Implementation for Embedded and Processing-In-Memory Systems

Kwon¹,

Sondeen²,

Draper³

2005 IEEE International Symposium on Circuits and Systems

Self Cite

View full text Add to dashboard Cite

“…The DIVA WideWord Processor speeds up multimedia applications by use of data parallelism. It treats a 256-bit WideWord operand as a packed array of objects of 8, 16, or 32 bits in size [7]. DIVA PIMs support standard memory accesses and have been recently fabricated with SRAM.…”

Section: Introductionmentioning

confidence: 99%

Precise exception handling in discontinuous control flow scenarios for area-constrained systems

Kang¹,

Draper²

2008

2008 51st Midwest Symposium on Circuits and Systems

View full text Add to dashboard Cite

Abstract-Exception handling is one of the most complicated issues in pipelined processors. Several incomplete instructions are in process in the pipeline at any instant in time, and an exception may cause a state change of the processor [5] at any such instant. Prior research efforts have proposed mechanisms for precise exception handling, but it is difficult to achieve precise exception handling in minimal area as required by embedded and processing-in-memory systems. This paper presents a correct and efficient exception handling scheme with a modest hardware resource. The presented idea maintains precise exception handling in the case of discrete control flow and has been implemented in 90nm technology.

show abstract

Implementation of a 32-bit RISC processor for the data-intensive architecture processing-in-memory chip

Cited by 20 publications

References 11 publications

PBuf: An On-Chip Packet Transfer Engine for MONARCH

PBuf: An On-Chip Packet Transfer Engine for MONARCH

Design Trade-Offs in Floating-Point Unit Implementation for Embedded and Processing-In-Memory Systems

Precise exception handling in discontinuous control flow scenarios for area-constrained systems

Contact Info

Product

Resources

About