Note
There are multiple asynchronous bridges between the Cortex-A76 core and the DSU. Only the CPU
bridge between the Cortex-A76 core and the DSU can be configured to run synchronously, however it
does not affect the other interfaces such as debug, trace, and GIC which are always asynchronous. For
more information on how to set the CPU bridge to run either synchronously or asynchronously, see
Configuration Guidelines
in the
Arm
®
DynamIQ
™
Shared Unit Configuration and Sign-off Guide
.
A2.1.1
Instruction fetch
The instruction fetch unit fetches instructions from the L1 instruction cache and delivers the instruction
stream to the instruction decode unit.
The instruction fetch unit includes:
• A 64KB, 4-way, set associative L1 instruction cache with 64-byte cache lines and optional parity
protection.
• A fully associative L1 instruction TLB with native support for 4KB, 16KB, 64KB, 2MB, and 32MB
page sizes.
• A dynamic branch predictor.
A2.1.2
Instruction decode
The instruction decode unit supports the A32, T32, and A64 instruction sets. It also supports Advanced
SIMD and floating-point instructions in each instruction set.
A2.1.3
Register rename
The register rename unit performs register renaming to facilitate out-of-order execution and dispatches
decoded instructions to various issue queues.
A2.1.4
Instruction issue
The instruction issue unit controls when the decoded instructions are dispatched to the execution
pipelines. It includes issue queues for storing instruction pending dispatch to execution pipelines.
A2.1.5
Execution pipeline
The execution pipeline includes:
• Integer execute unit that performs arithmetic and logical data processing operations.
• Vector execute unit that performs Advanced SIMD and floating point operations. Optionally, it can
execute the cryptographic instructions.
A2.1.6
L1 data memory system
The L1 data memory system executes load and store instructions and encompasses the L1 data side
memory system. It also services memory coherency requests.
The load/store unit includes:
• A 64KB, 4-way, set associative L1 data cache with 64-byte cache lines and optional ECC protection
per 32 bits.
• A fully associative L1 data TLB with native support for 4KB, 16KB, 64KB, 2MB, and 512MB page
sizes.
A2.1.7
L2 memory system
The L2 memory system services L1 instruction and data cache misses from the Cortex-A76 core.
A2 Technical overview
A2.1 Components
100798_0300_00_en
Copyright © 2016–2018 Arm Limited or its affiliates. All rights
reserved.
A2-36
Non-Confidential
Summary of Contents for Cortex-A76 Core
Page 4: ......
Page 22: ......
Page 23: ...Part A Functional description ...
Page 24: ......
Page 119: ...Part B Register descriptions ...
Page 120: ......
Page 363: ...Part C Debug descriptions ...
Page 364: ......
Page 401: ...Part D Debug registers ...
Page 402: ......
Page 589: ...Part E Appendices ...
Page 590: ......