作者: Kathirgamar Aingaran , Sumti Jairath , Georgios Konstadinidis , Serena Leung , Paul Loewenstein
DOI: 10.1109/MM.2015.35
关键词: Shared memory 、 CPU cache 、 Parallel computing 、 Pipeline burst cache 、 Memory hierarchy 、 Multiprocessing 、 Computer science 、 Non-uniform memory access 、 Memory bandwidth 、 Multi-core processor 、 Garbage collection 、 Cache 、 Operating system 、 Memory management 、 Uniform memory access
摘要: The Oracle Sparc M7 processor more than triples the throughput of M6 processor, while increasing per-thread performance, power efficiency, and I/O bandwidth. contains 32 8-thread, dual-issue, out-of-order cores. To minimize L3 cache hit latency, features a partitioned cache, with novel on-chip network for communication between partitions, coherence control, memory. Memory bandwidth is 3 times that processor. On-chip accelerators providing query filtering decompression can provide another order magnitude performance increase these tasks. keeps consumption at minimum through multiple power-saving techniques. scales to 32-processor shared memory multiprocessing systems (SMPs). Coherent also supported SMPs in cluster. Such cluster robust against both SMP failure. Application data integrity guards pointer-related software vulnerabilities without slowing Fine-grained migration supports concurrent garbage collection.