- What is this used for?
- Instructions in the IST get sent to another queue
- Define address generating instructions
- 128 entry 2-way set-associative least-recently-used replacement policy
- Explain the register dependency table
- Step by step example for Figure 2 (multiple loop iterations)
- Only address-generating instructions, not data generating! e.g. a store will have a data register and an address register
- At the end, figure on cumulative distribution of the number of address generating instructions found by subsequent IBDA iterations
- Two instruction queues (A for general-purpose, B for address generating)
- Additional blocks for IST, RDT
- Based on a two-wide in order superscalar design
- Fun dummy slide (flash it for a second) of architecture parameters
- Sniper multi-core simulator... "cycle-level"
- Giant figure on relative performance
- Computed using CACTI 6.5
- Area-normalized and Power-normalized performance
- IST-size experiments
- Power <= 45W
- Area <= 350mm^2
- "load-slice is better" figure