Title: Memory system design in superscalar processing
Authors: Lu, NP
Chung, CP
National Chiao Tung University
Institute of Computer Science and Engineering
Keywords: superscalar processing;load/store port;cache;memory consistency model
Issue Date: 1-Sep-1995
Abstract: In this paper, we study the memory system design for superscalar processing. Benchmarking is used to examine the execution behavior of load/store instructions, such as load/store parallelism and memory load/store port utilization. It is found that the use of only a single load/store port forms a system bottleneck. A superscalar processor benefits from multiple load/store ports and system performance saturates with two load/store ports. The memory system must be carefully designed if multiple load/store ports are supported in a superscalar processor. Thus, we consider the design of the data cache subsystem. The data cache configurations we investigate include multiported cache, multibank cache, and duplicated cache. Through benchmarking, we find that the duplicated cache performs well in most benchmarks. Yet the cost of a duplicated cache is higher. In a superscalar multiprocessing environment, in order to properly maintain memory consistency, we must consider the load/store ordering of the processors. In superscalar processors, the load/store ordering may be in one of three forms: total ordering, load bypassing, and load forwarding. In this research, we conclude that to support the sequential consistency model, the load/store instructions must be totally ordered. Load bypassing and load forwarding are sufficient to support the processor consistency model.
URI: http://dx.doi.org/10.1142/S0129053395000233
ISSN: 0129-0533
DOI: 10.1142/S0129053395000233
Volume: 7
Issue: 3
Begin Page: 421
End Page: 443
Appears in Collections:Articles