

## MIPS32® microAptiv<sup>™</sup> UP Processor Core Family Software User's Manual

Document Number: MD00942 Revision 01.02 July 30, 2014 Unpublished rights (if any) reserved under the copyright laws of the United States of America and other countries.

This document contains information that is proprietary to MIPS Tech, LLC, a Wave Computing company ("MIPS") and MIPS' affiliates as applicable. Any copying, reproducing, modifying or use of this information (in whole or in part) that is not expressly permitted in writing by MIPS or MIPS' affiliates as applicable or an authorized third party is strictly prohibited. At a minimum, this information is protected under unfair competition and copyright laws. Violations thereof may result in criminal penalties and fines. Any document provided in source format (i.e., in a modifiable form such as in FrameMaker or Microsoft Word format) is subject to use and distribution restrictions that are independent of and supplemental to any and all confidentiality restrictions. UNDER NO CIRCUMSTANCES MAY A DOCUMENT PROVIDED IN SOURCE FORMAT BE DISTRIBUTED TO A THIRD PARTY IN SOURCE FORMAT WITHOUT THE EXPRESS WRITTEN PERMISSION OF MIPS (AND MIPS' AFFILIATES AS APPLICABLE) reserve the right to change the information contained in this document to improve function, design or otherwise.

MIPS and MIPS' affiliates do not assume any liability arising out of the application or use of this information, or of any error or omission in such information. Any warranties, whether express, statutory, implied or otherwise, including but not limited to the implied warranties of merchantability or fitness for a particular purpose, are excluded. Except as expressly provided in any written license agreement from MIPS or an authorized third party, the furnishing of this document does not give recipient any license to any intellectual property rights, including any patent rights, that cover the information in this document.

The information contained in this document shall not be exported, reexported, transferred, or released, directly or indirectly, in violation of the law of any country or international law, regulation, treaty, Executive Order, statute, amendments or supplements thereto. Should a conflict arise regarding the export, reexport, transfer, or release of the information contained in this document, the laws of the United States of America shall be the governing law.

The information contained in this document constitutes one or more of the following: commercial computer software, commercial computer software documentation or other commercial items. If the user of this information, or any related documentation of any kind, including related technical data or manuals, is an agency, department, or other entity of the United States government ("Government"), the use, duplication, reproduction, release, modification, disclosure, or transfer of this information, or any related documentation of any kind, is restricted in accordance with Federal Acquisition Regulation 12.212 for civilian agencies and Defense Federal Acquisition Regulation Supplement 227.7202 for military agencies. The use of this information by the Government is further restricted in accordance with the terms of the license agreement(s) and/or applicable contract terms and conditions covering this information from MIPS Technologies or an authorized third party.

MIPS, MIPS I, MIPS II, MIPS III, MIPS IV, MIPS V, MIPSr3, MIPS32, MIPS64, microMIPS32, microMIPS64, MIPS-3D, MIPS16, MIPS16e, MIPS-Based, MIPSsim, MIPSpro, MIPS-VERIFIED, Aptiv logo, microAptiv logo, interAptiv logo, microMIPS logo, MIPS Technologies logo, MIPS-VERIFIED logo, proAptiv logo, 4K, 4Kc, 4Km, 4Kp, 4KE, 4KEc, 4KEm, 4KEp, 4KS, 4KSc, 4KSd, M4K, M14K, 5K, 5Kc, 5Kf, 24K, 24Kc, 24Kf, 24KE, 24KEc, 24KEf, 34K, 34Kc, 34Kf, 74K, 74Kc, 74Kf, 1004K, 1004Kc, 1004Kf, 1074K, 1074Kc, 1074Kf, R3000, R4000, R5000, Aptiv, ASMACRO, Atlas, "At the core of the user experience.", BusBridge, Bus Navigator, CLAM, CorExtend, CoreFPGA, CoreLV, EC, FPGA View, FS2, FS2 FIRST SILICON SOLUTIONS logo, FS2 NAVIGATOR, HyperDebug, HyperJTAG, IASim, iFlowtrace, interAptiv, JALGO, Logic Navigator, Malta, MDMX, MED, MGB, microAptiv, microMIPS, Navigator, OCI, PDtrace, the Pipeline, proAptiv, Pro Series, SEAD-3, SmartMIPS, SOC-it, and YAMON are trademarks or registered trademarks of MIPS and MIPS' affiliates as applicable in the United States and other countries.

All other trademarks referred to herein are the property of their respective owners.



## **Table of Contents**

| Chapter 1: Introduction to the MIPS32® microAptiv™ UP Processor Core                  |  |
|---------------------------------------------------------------------------------------|--|
| 1.1: Features                                                                         |  |
| 1.2: microAptiv™ UP Core Block Diagram                                                |  |
| 1.2.1: Required Logic Blocks                                                          |  |
| 1.2.2: Optional Logic Blocks                                                          |  |
| Chapter 2: Pipeline of the microAptiv™ UP Core                                        |  |
| 2.1: Pipeline Stages                                                                  |  |
| 2.1.1: I Stage: Instruction Fetch                                                     |  |
| 2.1.2: E Stage: Execution                                                             |  |
| 2.1.3: M Stage: Memory Fetch                                                          |  |
| 2.1.4: A Stage: Align                                                                 |  |
| 2.1.5: W Stage: Writeback                                                             |  |
| 2.2: Instruction Cache Miss                                                           |  |
| 2.3: Data Cache Miss                                                                  |  |
| 2.4: Multiply/Divide Operations                                                       |  |
| 2.5: MDU Pipeline with DSP Module Enabled                                             |  |
| 2.5.1: MDU                                                                            |  |
| 2.5.2: DSP Module Instruction Latencies                                               |  |
| 2.5.3: High-performance MDU Pipeline Stages                                           |  |
| 2.5.4: High-performance MDU Divide Operations                                         |  |
| 2.6: MDU Pipeline - High-performance MDU with DSP Module Disabled                     |  |
| 2.6.1: 32x16 Multiply (High-Performance MDU)                                          |  |
| 2.6.2: 32x32 Multiply (High-Performance MDU)                                          |  |
| 2.6.3: Divide (High-Performance MDU)                                                  |  |
| 2.7: MDU Pipeline - Area-Efficient MDU with DSP Module Disabled                       |  |
| 2.7.1: Multiply (Area-Efficient MDU).                                                 |  |
| 2.7.2: Multiply Accumulate (Area-Efficient MDU)<br>2.7.3: Divide (Area-Efficient MDU) |  |
| 2.8: Branch Delay                                                                     |  |
| 2.9: Data Bypassing                                                                   |  |
| 2.9. Data Dypassing                                                                   |  |
| 2.9.1: Load Delay                                                                     |  |
| 2.10: Coprocessor 2 Instructions                                                      |  |
| 2.10: Coprocessor 2 instructions                                                      |  |
| 2.12: Slip Conditions.                                                                |  |
| 2.13: Instruction Interlocks.                                                         |  |
| 2.14: Hazards                                                                         |  |
| 2.14.1: Types of Hazards                                                              |  |
| 2.14.2: Instruction Listing                                                           |  |
| 2.14.3: Eliminating Hazards                                                           |  |
| Chapter 3: The MIPS® DSP Module                                                       |  |
| 3.1: Additional Register State for the DSP Module                                     |  |
| 3.1.1: HI-LO Registers                                                                |  |
| 3.1.2: DSPControl Register                                                            |  |
| 3.2: Software Detection of the DSP Module Revision 2                                  |  |
|                                                                                       |  |

| Chapter 4: Memory Management of the microAptiv <sup>™</sup> UP Core   |    |
|-----------------------------------------------------------------------|----|
| 4.1: Introduction                                                     |    |
| 4.1.1: Memory Management Unit (MMU)                                   |    |
| 4.2: Modes of Operation                                               |    |
| 4.2.1: Virtual Memory Segments                                        |    |
| 4.2.2: User Mode                                                      |    |
| 4.2.3: Kernel Mode                                                    |    |
| 4.2.4: Debug Mode                                                     |    |
| 4.3: Translation Lookaside Buffer                                     |    |
| 4.3.1: Joint TLB                                                      |    |
| 4.3.2: Instruction TLB                                                |    |
| 4.3.3: Data TLB                                                       |    |
| 4.4: Virtual-to-Physical Address Translation.                         |    |
| 4.4.1: Hits, Misses, and Multiple Matches                             |    |
| 4.4.2: Memory Space                                                   |    |
| 4.4.3: TLB Instructions                                               |    |
| 4.5: Fixed Mapping MMU                                                |    |
| 4.6: System Control Coprocessor                                       |    |
| 4.0. System Control Coprocessor                                       |    |
| Chapter 5: Exceptions and Interrupts in the microAptiv™ UP Core       | 06 |
| 5.1: Exception Conditions                                             |    |
|                                                                       |    |
| 5.2: Exception Priority                                               |    |
| 5.3: Interrupts                                                       |    |
| 5.3.1: Interrupt Modes                                                |    |
| 5.3.2: Generation of Exception Vector Offsets for Vectored Interrupts |    |
| 5.3.3: MCU ASE Enhancement for Interrupt Handling                     |    |
| 5.4: GPR Shadow Registers.                                            |    |
| 5.5: Exception Vector Locations                                       |    |
| 5.6: General Exception Processing                                     |    |
| 5.7: Debug Exception Processing                                       |    |
| 5.8: Exception Descriptions                                           |    |
| 5.8.1: Reset/SoftReset Exception                                      |    |
| 5.8.2: Debug Single Step Exception                                    |    |
| 5.8.3: Debug Interrupt Exception                                      |    |
| 5.8.4: Non-Maskable Interrupt (NMI) Exception                         |    |
| 5.8.5: Machine Check Exception                                        |    |
| 5.8.6: Interrupt Exception                                            |    |
| 5.8.7: Debug Instruction Break Exception                              |    |
| 5.8.8: Watch Exception — Instruction Fetch or Data Access             |    |
| 5.8.9: Address Error Exception — Instruction Fetch/Data Access        |    |
| 5.8.10: TLB Refill Exception — Instruction Fetch or Data Access       |    |
| 5.8.11: TLB Invalid Exception — Instruction Fetch or Data Access      |    |
| 5.8.12: Execute-Inhibit Exception                                     |    |
| 5.8.13: Read-Inhibit Exception                                        |    |
| 5.8.14: Cache/SPRAM Parity Error Exception                            |    |
| 5.8.15: Bus Error Exception — Instruction Fetch or Data Access        |    |
| 5.8.16: Debug Software Breakpoint Exception                           |    |
| 5.8.17: Execution Exception — System Call                             |    |
| 5.8.18: Execution Exception — Breakpoint                              |    |
| 5.8.19: Execution Exception — Reserved Instruction                    |    |
| 5.8.20: Execution Exception — Coprocessor Unusable                    |    |
| 5.8.21: Execution Exception — CorExtend Unusable                      |    |
| 5.8.22: Execution Exception — DSP Module State Disabled               |    |

| 5.8.23: Execution Exception — Coprocessor 2 Exception                    | 127 |
|--------------------------------------------------------------------------|-----|
| 5.8.24: Execution Exception — Implementation-Specific 1 Exception        |     |
| 5.8.25: Execution Exception — Integer Overflow                           |     |
| 5.8.26: Execution Exception — Trap                                       | 128 |
| 5.8.27: Debug Data Break Exception                                       | 128 |
| 5.8.28: Complex Break Exception                                          | 128 |
| 5.8.29: TLB Modified Exception — Data Access                             | 129 |
| 5.9: Exception Handling and Servicing Flowcharts                         | 129 |
|                                                                          |     |
| Chapter 6: CP0 Registers of the microAptiv™ UP Core                      |     |
| 6.1: CP0 Register Summary                                                |     |
| 6.2: CP0 Register Descriptions                                           |     |
| 6.2.1: Index Register (CP0 Register 0, Select 0)                         |     |
| 6.2.2: Random Register (CP0 Register 1, Select 0)                        |     |
| 6.2.3: EntryLo0 and EntryLo1 Registers (CP0 Registers 2 and 3, Select 0) |     |
| 6.2.4: Context Register (CP0 Register 4, Select 0)                       |     |
| 6.2.5: UserLocal Register (CP0 Register 4, Select 2)                     |     |
| 6.2.6: PageMask Register (CP0 Register 5, Select 0)                      |     |
| 6.2.7: PageGrain Register (CP0 Register 5, Select 1)                     |     |
| 6.2.8: Wired Register (CP0 Register 6, Select 0)                         |     |
| 6.2.9: HWREna Register (CP0 Register 7, Select 0)                        |     |
| 6.2.10: BadVAddr Register (CP0 Register 8, Select 0)                     |     |
| 6.2.11: BadInstr Register (CP0 Register 8, Select 1)                     |     |
| 6.2.12: BadInstrP Register (CP0 Register 8, Select 2)                    |     |
| 6.2.13: Count Register (CP0 Register 9, Select 0)                        |     |
| 6.2.14: EntryHi Register (CP0 Register 10, Select 0)                     |     |
| 6.2.15: Compare Register (CP0 Register 11, Select 0)                     |     |
| 6.2.16: Status Register (CP0 Register 12, Select 0)                      |     |
| 6.2.17: IntCtl Register (CP0 Register 12, Select 1)                      |     |
| 6.2.18: SRSCtl Register (CP0 Register 12, Select 2)                      |     |
| 6.2.19: SRSMap Register (CP0 Register 12, Select 3)                      |     |
| 6.2.20: View_IPL Register (CP0 Register 12, Select 4)                    |     |
| 6.2.21: SRSMap2 Register (CP0 Register 12, Select 5)                     |     |
| 6.2.22: Cause Register (CP0 Register 13, Select 0)                       |     |
| 6.2.23: View_RIPL Register (CP0 Register 13, Select 4)                   |     |
| 6.2.24: NestedExc (CP0 Register 13, Select 5)                            |     |
| 6.2.25: Exception Program Counter (CP0 Register 14, Select 0)            |     |
| 6.2.26: NestedEPC (CP0 Register 14, Select 2)                            |     |
| 6.2.27: Processor Identification (CP0 Register 15, Select 0)             |     |
| 6.2.28: EBase Register (CP0 Register 15, Select 1)                       |     |
| 6.2.29: CDMMBase Register (CP0 Register 15, Select 2)                    |     |
| 6.2.30: Config Register (CP0 Register 16, Select 0)                      |     |
| 6.2.31: Config1 Register (CP0 Register 16, Select 1)                     |     |
| 6.2.32: Config2 Register (CP0 Register 16, Select 2)                     |     |
| 6.2.33: Config3 Register (CP0 Register 16, Select 3)                     |     |
| 6.2.34: Config4 Register (CP0 Register 16, Select 4)                     |     |
| 6.2.35: Config5 Register (CP0 Register 16, Select 5)                     |     |
| 6.2.36: Config7 Register (CP0 Register 16, Select 7)                     |     |
| 6.2.37: Load Linked Address (CP0 Register 17, Select 0)                  | 186 |
| 6.2.38: WatchLo Register (CP0 Register 18, Select 0-7)                   | 187 |
| 6.2.39: WatchHi Register (CP0 Register 19, Select 0-7)                   |     |
| 6.2.40: Debug Register (CP0 Register 23, Select 0)                       | 188 |
| 6.2.41: Trace Control Register (CP0 Register 23, Select 1)               |     |

| 6.2.42: Trace Control2 Register (CP0 Register 23, Select 2)                                            | 195 |
|--------------------------------------------------------------------------------------------------------|-----|
| 6.2.43: User Trace Data1 Register (CP0 Register 23, Select 3)/User Trace Data2 Register (CP0 F         |     |
| 24, Select 3)                                                                                          | -   |
| 6.2.44: TraceBPC Register (CP0 Register 23, Select 4)                                                  |     |
| 6.2.45: Debug2 Register (CP0 Register 23, Select 6)                                                    |     |
| 6.2.46: Debug Exception Program Counter Register (CP0 Register 24, Select 0)                           | 200 |
| 6.2.47: Performance Counter Register (CP0 Register 25, select 0-3)                                     | 201 |
| 6.2.48: ErrCtl Register (CP0 Register 26, Select 0)                                                    |     |
| 6.2.49: CacheErr Register (CP0 Register 27, Select 0)                                                  | 209 |
| 6.2.50: TagLo Register (CP0 Register 28, Select 0)                                                     | 211 |
| 6.2.51: DataLo Register (CP0 Register 28, Select 1)                                                    | 212 |
| 6.2.52: ErrorEPC (CP0 Register 30, Select 0)                                                           | 213 |
| 6.2.53: DeSave Register (CP0 Register 31, Select 0)                                                    | 213 |
| 6.2.54: KScratchn Registers (CP0 Register 31, Selects 2 to 3)                                          | 214 |
| Chapter 7: Hardware and Software Initialization of the microAptiv™ UP Core                             |     |
| 7.1: Hardware-Initialized Processor State                                                              |     |
| 7.1.1: Coprocessor 0 State                                                                             |     |
| 7.1.2: TLB Initialization                                                                              |     |
| 7.1.3: Bus State Machines                                                                              |     |
| 7.1.4: Static Configuration Inputs                                                                     |     |
| 7.1.5: Fetch Address                                                                                   |     |
| 7.2: Software Initialized Processor State                                                              |     |
| 7.2.1: Register File                                                                                   |     |
| 7.2.2: TLB                                                                                             |     |
| 7.2.3: Caches                                                                                          |     |
| 7.2.4: Coprocessor 0 State                                                                             |     |
|                                                                                                        |     |
| Chapter 8: Caches of the microAptiv <sup>™</sup> UP Core                                               | 218 |
| 8.1: Cache Configurations                                                                              |     |
| 8.2: Cache Protocols                                                                                   |     |
| 8.2.1: Cache Organization                                                                              |     |
| 8.2.2: Cacheability Attributes                                                                         |     |
| 8.2.3: Replacement Policy                                                                              |     |
| 8.2.4: Virtual Aliasing                                                                                |     |
| 8.3: Instruction Cache                                                                                 |     |
| 8.4: Data Cache                                                                                        | 224 |
| 8.5: CACHE Instruction                                                                                 | 225 |
| 8.6: Software Cache Testing                                                                            |     |
| 8.6.1: I-Cache/D-cache Tag Arrays                                                                      | 226 |
| 8.6.2: I-Cache Data Array                                                                              |     |
| 8.6.3: I-Cache WS Array                                                                                |     |
| 8.6.4: D-Cache Data Array                                                                              |     |
| 8.6.5: D-cache WS Array                                                                                | 227 |
| 8.7: Memory Coherence Issues                                                                           |     |
| Chapter 9: Power Management of the microAptiv™ UP Core                                                 | 228 |
| 9.1: Register-Controlled Power Management                                                              |     |
| 9.2: Instruction-Controlled Power Management                                                           |     |
| Chanter 40: E ITAC Debug Cumpert in the minut Anti-TH UD Com-                                          | 000 |
| Chapter 10: EJTAG Debug Support in the microAptiv <sup>™</sup> UP Core<br>10.1: Debug Control Register |     |
|                                                                                                        |     |

| 10.2: Hardware Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                | 235                                                  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------|
| 10.2.1: Data Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 236                                                  |
| 10.2.2: Complex Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                               | 236                                                  |
| 10.2.3: Conditions for Matching Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                               | 236                                                  |
| 10.2.4: Debug Exceptions from Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                 | 240                                                  |
| 10.2.5: Breakpoint Used as Triggerpoint                                                                                                                                                                                                                                                                                                                                                                                                                                                   |                                                      |
| 10.2.6: Instruction Breakpoint Registers                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                      |
| 10.2.7: Data Breakpoint Registers                                                                                                                                                                                                                                                                                                                                                                                                                                                         |                                                      |
| 10.2.8: Complex Breakpoint Registers                                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                                      |
| 10.3: Complex Breakpoint Usage                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                                                      |
| 10.3.1: Checking for Presence of Complex Break Support                                                                                                                                                                                                                                                                                                                                                                                                                                    |                                                      |
| 10.3.2: General Complex Break Behavior                                                                                                                                                                                                                                                                                                                                                                                                                                                    | 258                                                  |
| 10.3.3: Usage of Pass Counters                                                                                                                                                                                                                                                                                                                                                                                                                                                            |                                                      |
| 10.3.4: Usage of Tuple Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                                                      |
| 10.3.5: Usage of Priming Conditions                                                                                                                                                                                                                                                                                                                                                                                                                                                       |                                                      |
| 10.3.6: Usage of Data Qualified Breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                               |                                                      |
| 10.3.7: Usage of Stopwatch Timers                                                                                                                                                                                                                                                                                                                                                                                                                                                         |                                                      |
| 10.4: Test Access Port (TAP)                                                                                                                                                                                                                                                                                                                                                                                                                                                              |                                                      |
| 10.4.1: EJTAG Internal and External Interfaces                                                                                                                                                                                                                                                                                                                                                                                                                                            | 261                                                  |
| 10.4.2: Test Access Port Operation                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                                                      |
| 10.4.3: Test Access Port (TAP) Instructions                                                                                                                                                                                                                                                                                                                                                                                                                                               |                                                      |
| 10.5: EJTAG TAP Registers                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | 268                                                  |
| 10.5.1: Instruction Register                                                                                                                                                                                                                                                                                                                                                                                                                                                              |                                                      |
| 10.5.2: Data Registers Overview                                                                                                                                                                                                                                                                                                                                                                                                                                                           |                                                      |
| 10.5.3: Processor Access Address Register                                                                                                                                                                                                                                                                                                                                                                                                                                                 |                                                      |
| 10.5.4: Fastdata Register (TAP Instruction FASTDATA)                                                                                                                                                                                                                                                                                                                                                                                                                                      |                                                      |
| 10.6: TAP Processor Accesses                                                                                                                                                                                                                                                                                                                                                                                                                                                              | 279                                                  |
| 10.6.1: Fetch/Load and Store from/to EJTAG Probe Through dmseg                                                                                                                                                                                                                                                                                                                                                                                                                            | 280                                                  |
| 10.7: SecureDebug                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | 281                                                  |
| 10.7.1: Disabling EJTAG Debugging                                                                                                                                                                                                                                                                                                                                                                                                                                                         | 281                                                  |
| 10.7.2: EJTAG Features Unmodified by SecureDebug                                                                                                                                                                                                                                                                                                                                                                                                                                          | 282                                                  |
| 10.8: iFlowtrace™ Mechanism                                                                                                                                                                                                                                                                                                                                                                                                                                                               | 282                                                  |
| 10.8.1: A Simple Instruction-Only Tracing Scheme                                                                                                                                                                                                                                                                                                                                                                                                                                          | 283                                                  |
| 10.8.2: Special Trace Modes                                                                                                                                                                                                                                                                                                                                                                                                                                                               | 284                                                  |
| 10.8.3: ITCB Overview                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | 287                                                  |
| 10.8.4: ITCB iFlowtrace Interface                                                                                                                                                                                                                                                                                                                                                                                                                                                         | 287                                                  |
| 10.8.5: TCB Storage Representation                                                                                                                                                                                                                                                                                                                                                                                                                                                        |                                                      |
| 10.8.6: ITCB Register Interface for Software Configurability                                                                                                                                                                                                                                                                                                                                                                                                                              | 289                                                  |
| 10.8.7: ITCB iFlowtrace Off-Chip Interface                                                                                                                                                                                                                                                                                                                                                                                                                                                |                                                      |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 293                                                  |
| 10.8.8: Breakpoint-Based Enabling of Tracing                                                                                                                                                                                                                                                                                                                                                                                                                                              | 294                                                  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 295                                                  |
| 10.8.8: Breakpoint-Based Enabling of Tracing<br>10.9: PC/Data Address Sampling<br>10.9.1: PC Sampling in Wait State                                                                                                                                                                                                                                                                                                                                                                       | 295                                                  |
| 10.8.8: Breakpoint-Based Enabling of Tracing<br>10.9: PC/Data Address Sampling                                                                                                                                                                                                                                                                                                                                                                                                            |                                                      |
| 10.8.8: Breakpoint-Based Enabling of Tracing<br>10.9: PC/Data Address Sampling<br>10.9.1: PC Sampling in Wait State                                                                                                                                                                                                                                                                                                                                                                       | 295                                                  |
| 10.8.8: Breakpoint-Based Enabling of Tracing.<br>10.9: PC/Data Address Sampling.<br>10.9.1: PC Sampling in Wait State.<br>10.9.2: Cache Miss PC Sampling.<br>10.9.3: Data Address Sampling.<br>10.10: Fast Debug Channel.                                                                                                                                                                                                                                                                 | 295                                                  |
| 10.8.8: Breakpoint-Based Enabling of Tracing.         10.9: PC/Data Address Sampling.         10.9.1: PC Sampling in Wait State.         10.9.2: Cache Miss PC Sampling.         10.9.3: Data Address Sampling.         10.10: Fast Debug Channel.         10.10.1: Common Device Memory Map.                                                                                                                                                                                             | 295<br>296                                           |
| <ul> <li>10.8.8: Breakpoint-Based Enabling of Tracing.</li> <li>10.9: PC/Data Address Sampling.</li> <li>10.9.1: PC Sampling in Wait State.</li> <li>10.9.2: Cache Miss PC Sampling.</li> <li>10.9.3: Data Address Sampling.</li> <li>10.10: Fast Debug Channel.</li> <li>10.10.1: Common Device Memory Map.</li> <li>10.10.2: Fast Debug Channel Interrupt.</li> </ul>                                                                                                                   | 295<br>296<br>296                                    |
| <ul> <li>10.8.8: Breakpoint-Based Enabling of Tracing.</li> <li>10.9: PC/Data Address Sampling.</li> <li>10.9.1: PC Sampling in Wait State.</li> <li>10.9.2: Cache Miss PC Sampling.</li> <li>10.9.3: Data Address Sampling.</li> <li>10.10: Fast Debug Channel.</li> <li>10.10.1: Common Device Memory Map.</li> <li>10.10.2: Fast Debug Channel Interrupt.</li> <li>10.10.3: microAptiv™ UP Core FDC Buffers.</li> </ul>                                                                | 295<br>296<br>296<br>296                             |
| <ul> <li>10.8.8: Breakpoint-Based Enabling of Tracing.</li> <li>10.9: PC/Data Address Sampling.</li> <li>10.9.1: PC Sampling in Wait State.</li> <li>10.9.2: Cache Miss PC Sampling</li> <li>10.9.3: Data Address Sampling.</li> <li>10.10: Fast Debug Channel.</li> <li>10.10.1: Common Device Memory Map.</li> <li>10.10.2: Fast Debug Channel Interrupt.</li> <li>10.10.3: microAptiv™ UP Core FDC Buffers.</li> <li>10.10.4: Sleep mode.</li> </ul>                                   | 295<br>296<br>296<br>296<br>298                      |
| <ul> <li>10.8.8: Breakpoint-Based Enabling of Tracing.</li> <li>10.9: PC/Data Address Sampling.</li> <li>10.9.1: PC Sampling in Wait State.</li> <li>10.9.2: Cache Miss PC Sampling</li> <li>10.9.3: Data Address Sampling.</li> <li>10.10: Fast Debug Channel.</li> <li>10.10.1: Common Device Memory Map.</li> <li>10.10.2: Fast Debug Channel Interrupt.</li> <li>10.10.3: microAptiv™ UP Core FDC Buffers</li> <li>10.10.4: Sleep mode.</li> <li>10.10.5: FDC TAP Register</li> </ul> | 295<br>296<br>296<br>298<br>298                      |
| <ul> <li>10.8.8: Breakpoint-Based Enabling of Tracing.</li> <li>10.9: PC/Data Address Sampling.</li> <li>10.9.1: PC Sampling in Wait State.</li> <li>10.9.2: Cache Miss PC Sampling</li> <li>10.9.3: Data Address Sampling.</li> <li>10.10: Fast Debug Channel.</li> <li>10.10.1: Common Device Memory Map.</li> <li>10.10.2: Fast Debug Channel Interrupt.</li> <li>10.10.3: microAptiv™ UP Core FDC Buffers.</li> <li>10.10.4: Sleep mode.</li> </ul>                                   | 295<br>296<br>296<br>296<br>298<br>298<br>298<br>298 |

| 11.1: CPU Instruction Formats                                                                                                                                                                                                                                                                                                                                                                                                    |                                                                                                                                                                             |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 11.2: Load and Store Instructions                                                                                                                                                                                                                                                                                                                                                                                                |                                                                                                                                                                             |
| 11.2.1: Scheduling a Load Delay Slot                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                             |
| 11.2.2: Defining Access Types                                                                                                                                                                                                                                                                                                                                                                                                    |                                                                                                                                                                             |
| 11.3: Computational Instructions                                                                                                                                                                                                                                                                                                                                                                                                 |                                                                                                                                                                             |
| 11.3.1: Cycle Timing for Multiply and Divide Instructions                                                                                                                                                                                                                                                                                                                                                                        |                                                                                                                                                                             |
| 11.4: Jump and Branch Instructions                                                                                                                                                                                                                                                                                                                                                                                               |                                                                                                                                                                             |
| 11.4.1: Overview of Jump Instructions                                                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                                             |
| 11.4.2: Overview of Branch Instructions                                                                                                                                                                                                                                                                                                                                                                                          |                                                                                                                                                                             |
| 11.5: Control Instructions                                                                                                                                                                                                                                                                                                                                                                                                       |                                                                                                                                                                             |
| 11.6: Coprocessor Instructions                                                                                                                                                                                                                                                                                                                                                                                                   |                                                                                                                                                                             |
| 11.7: Enhancements to the MIPS Architecture                                                                                                                                                                                                                                                                                                                                                                                      |                                                                                                                                                                             |
| 11.7.1: CLO - Count Leading Ones                                                                                                                                                                                                                                                                                                                                                                                                 |                                                                                                                                                                             |
| 11.7.2: CLZ - Count Leading Zeros                                                                                                                                                                                                                                                                                                                                                                                                |                                                                                                                                                                             |
| 11.7.3: MADD - Multiply and Add Word                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                             |
| 11.7.4: MADDU - Multiply and Add Word                                                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                                             |
| 11.7.5: MSUB - Multiply and Subtract Word                                                                                                                                                                                                                                                                                                                                                                                        |                                                                                                                                                                             |
|                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                             |
| 11.7.6: MSUBU - Multiply and Subtract Unsigned Word                                                                                                                                                                                                                                                                                                                                                                              |                                                                                                                                                                             |
| 11.7.7: MUL - Multiply Word                                                                                                                                                                                                                                                                                                                                                                                                      |                                                                                                                                                                             |
| 11.7.8: SSNOP- Superscalar Inhibit NOP                                                                                                                                                                                                                                                                                                                                                                                           |                                                                                                                                                                             |
| 11.8: MCU ASE Instructions                                                                                                                                                                                                                                                                                                                                                                                                       |                                                                                                                                                                             |
| 11.8.1: ACLR                                                                                                                                                                                                                                                                                                                                                                                                                     |                                                                                                                                                                             |
| 11.8.2: ASET                                                                                                                                                                                                                                                                                                                                                                                                                     |                                                                                                                                                                             |
| 11.8.3: IRET                                                                                                                                                                                                                                                                                                                                                                                                                     |                                                                                                                                                                             |
|                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                             |
| Chapter 12: microAptiv™ UP Processor Core Instructions                                                                                                                                                                                                                                                                                                                                                                           |                                                                                                                                                                             |
|                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                             |
| 12.1: Understanding the Instruction Descriptions                                                                                                                                                                                                                                                                                                                                                                                 |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core                                                                                                                                                                                                                                                                                                                                |                                                                                                                                                                             |
| <ul> <li>12.2: microAptiv™ UP Core Opcode Map.</li> <li>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core</li></ul>                                                                                                                                                                                                                                                                                                      |                                                                                                                                                                             |
| <ul> <li>12.2: microAptiv™ UP Core Opcode Map.</li> <li>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core</li> <li>ACLR</li></ul>                                                                                                                                                                                                                                                                                        |                                                                                                                                                                             |
| <ul> <li>12.2: microAptiv™ UP Core Opcode Map.</li> <li>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core</li></ul>                                                                                                                                                                                                                                                                                                      |                                                                                                                                                                             |
| <ul> <li>12.2: microAptiv™ UP Core Opcode Map.</li> <li>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core.</li> <li>ACLR.</li> <li>ACLR.</li> <li>ASET.</li> <li>CACHE.</li> </ul>                                                                                                                                                                                                                                       |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE                                                                                                                                                                                                                                                                                              |                                                                                                                                                                             |
| 12.2: microAptiv <sup>™</sup> UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv <sup>™</sup> UP Core<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.                                                                                                                                                                                                                                                | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334                                                                                                                        |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE                                                                                                                                                                                                                                                                                              | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334                                                                                                                        |
| 12.2: microAptiv <sup>™</sup> UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv <sup>™</sup> UP Core<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.                                                                                                                                                                                                                                                | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>330<br>334<br>338                                                                                                          |
| 12.2: microAptiv <sup>™</sup> UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv <sup>™</sup> UP Core.<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.<br>LL                                                                                                                                                                                                                                         | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334<br>338<br>339                                                                                                          |
| 12.2: microAptiv <sup>™</sup> UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv <sup>™</sup> UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>LL<br>PREF                                                                                                                                                                                                                                         |                                                                                                                                                                             |
| 12.2: microAptiv <sup>™</sup> UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv <sup>™</sup> UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>LL<br>PREF<br>SC                                                                                                                                                                                                                                   |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC                                                                                                                                                                                                                                           | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334<br>338<br>339<br>342<br>342<br>345                                                                                     |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core                                                                                                                                                                                                                                                                                                                               | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>330<br>334<br>338<br>339<br>342<br>342<br>344<br>344<br>345<br>346                                                         |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core                                                                                                                                                                                                                                                                                                                               |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core.<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.<br>IRET.<br>LL<br>PREF.<br>SC.<br>SYNC<br>TLBR.<br>TLBWI.<br>TLBWR.                                                                                                                                                                                                   |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWR<br>WAIT                                                                                                                                                                                                         |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map.<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core.<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.<br>IL.<br>PREF.<br>SC.<br>SYNC.<br>TLBR.<br>TLBWI.<br>TLBWR.<br>WAIT.<br>Chapter 13: microMIPS™ Instruction Set Architecture                                                                                                                                          |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR.<br>ACLR.<br>ASET.<br>CACHE.<br>IRET.<br>IRET.<br>IRET.<br>LL.<br>PREF.<br>SC.<br>SYNC.<br>TLBR.<br>TLBWI.<br>TLBWR.<br>WAIT.<br>Chapter 13: microMIPS™ Instruction Set Architecture<br>13.1: Overview.                                                                                                                | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>330<br>334<br>338<br>339<br>342<br>342<br>344<br>345<br>345<br>346<br>347<br>348<br>349<br>349                             |
| 12.2: microAptiv™ ŬP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core.<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWI<br>TLBWR<br>WAIT<br>Chapter 13: microMIPS™ Instruction Set Architecture                                                                                                                                        | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>330<br>334<br>338<br>339<br>342<br>345<br>345<br>345<br>345<br>345<br>345<br>345<br>345<br>345<br>345                      |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core.<br>ACLR<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWI<br>TLBWR<br>WAIT<br>Chapter 13: microMIPS™ Instruction Set Architecture<br>13.1.1: MIPSr3™ Architecture<br>13.1.2: Default ISA Mode                                                                    | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>330<br>334<br>338<br>339<br>342<br>344<br>345<br>345<br>345<br>345<br>345<br>346<br>347<br>348<br>349<br>349<br>349<br>350 |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWR<br>WAIT<br>Chapter 13: microMIPS™ Instruction Set Architecture<br>13.1: Overview<br>13.1.1: MIPSr3™ Architecture<br>13.1.2: Default ISA Mode<br>13.1.3: Software Detection                                      |                                                                                                                                                                             |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWR<br>WAIT<br>Chapter 13: microMIPS™ Instruction Set Architecture<br>13.1: Overview<br>13.1.1: MIPSr3™ Architecture<br>13.1.2: Default ISA Mode<br>13.1.3: Software Detection<br>13.1.4: Compliance and Subsetting | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334<br>338<br>339<br>342<br>344<br>345<br>345<br>346<br>347<br>347<br>348<br><b>349</b><br>349<br>349<br>350<br>350        |
| 12.2: microAptiv™ UP Core Opcode Map<br>12.3: MIPS32® Instruction Set for the microAptiv™ UP Core<br>ACLR<br>ACLR<br>ASET<br>CACHE<br>IRET<br>IRET<br>IRET<br>LL<br>PREF<br>SC<br>SYNC<br>TLBR<br>TLBWI<br>TLBWR<br>WAIT<br>Chapter 13: microMIPS™ Instruction Set Architecture<br>13.1: Overview<br>13.1.1: MIPSr3™ Architecture<br>13.1.2: Default ISA Mode<br>13.1.3: Software Detection                                      | 310<br>313<br>320<br>321<br>322<br>324<br>330<br>334<br>338<br>339<br>342<br>344<br>345<br>345<br>345<br>345<br>345<br>345<br>345<br>345<br>345                             |

| 13.1.7: Coprocessor Unusable Behavior                  |  |
|--------------------------------------------------------|--|
| 13.2: Instruction Formats                              |  |
| 13.2.1: Instruction Stream Organization and Endianness |  |
| 13.3: microMIPS Re-encoded Instructions                |  |
| 13.3.1: 16-Bit Category                                |  |
| 13.3.2: 16-bit Instruction Register Set                |  |
| 13.3.3: 32-Bit Category                                |  |
| Appendix A: References                                 |  |
| Appendix B: Revision History                           |  |

## **List of Figures**

| Figure 1.1: microAptiv™ UP Processor Core Block Diagram                                                  | 25              |
|----------------------------------------------------------------------------------------------------------|-----------------|
| Figure 1.2: microAptiv™ UP Core Virtual Address Map                                                      |                 |
| Figure 1.3: Address Translation During Cache Access with FMT Implementation                              | 30              |
| Figure 1.4: Address Translation During a Cache Access with TLB Implementation                            | 31              |
| Figure 1.5: FDC Overview                                                                                 | 39              |
| Figure 1.6: cJTAG Support                                                                                | 40              |
| Figure 2.1: microAptiv™ UP Core Pipeline Stages with high-performance MDU and optional TLB MMU           | 42              |
| Figure 2.2: microAptiv™ UP Core Pipeline Stages with area-efficient MDU                                  |                 |
| Figure 2.3: microAptiv™ UP Core Pipeline Stages with optional TLB MMU                                    | 43              |
| Figure 2.4: Instruction Cache Miss Timing                                                                | 45              |
| Figure 2.5: Load/Store Cache Miss Timing                                                                 | 46              |
| Figure 2.6: Multiply Pipeline                                                                            | 51              |
| Figure 2.7: Multiply With Dependency From ALU                                                            | 51              |
| Figure 2.8: Multiply With Dependency From Load Hit                                                       | 51              |
| Figure 2.9: Multiply With Dependency From Load Miss                                                      |                 |
| Figure 2.10: subtractMUL Bypassing Result to Integer Instructions                                        | 52              |
| Figure 2.11: MDU Pipeline Flow During a 8-bit Divide (DIV) Operation                                     |                 |
| Figure 2.12: MDU Pipeline Flow During a 16-bit Divide (DIV) Operation                                    | 53              |
| Figure 2.13: MDU Pipeline Flow During a 24-bit Divide (DIV) Operation                                    | 53              |
| Figure 2.14: MDU Pipeline Flow During a 32-bit Divide (DIV) Operation                                    | 53              |
| Figure 2.15: MDU Pipeline Behavior During Multiply Operations                                            | 55              |
| Figure 2.16: MDU Pipeline Flow During a 32x16 Multiply Operation                                         | <mark>56</mark> |
| Figure 2.17: MDU Pipeline Flow During a 32x32 Multiply Operation                                         |                 |
| Figure 2.18: High-Performance MDU Pipeline Flow During a 8-bit Divide (DIV) Operation                    | 57              |
| Figure 2.19: High-Performance MDU Pipeline Flow During a 16-bit Divide (DIV) Operation                   | 57              |
| Figure 2.20: High-Performance MDU Pipeline Flow During a 24-bit Divide (DIV) Operation                   |                 |
| Figure 2.21: High-Performance MDU Pipeline Flow During a 32-bit Divide (DIV) Operation                   |                 |
| Figure 2.22: microAptiv™ UP Area-Efficient MDU Pipeline Flow During a Multiply Operation                 |                 |
| Figure 2.23: microAptiv™ UP Core Area-Efficient MDU Pipeline Flow During a Multiply Accumulate Operation |                 |
| Figure 2.24: microAptiv™ UP Core Area-Efficient MDU Pipeline Flow During a Divide (DIV) Operation        |                 |
| Figure 2.25: IU Pipeline Branch Delay                                                                    |                 |
| Figure 2.26: IU Pipeline Data bypass                                                                     |                 |
| Figure 2.27: IU Pipeline M to E bypass                                                                   |                 |
| Figure 2.28: IU Pipeline A to E Data bypass                                                              |                 |
| Figure 2.29: IU Pipeline Slip after a MFHI                                                               | 64              |
| Figure 2.30: Coprocessor 2 Interface Transactions                                                        |                 |
| Figure 2.31: Instruction Cache Miss Slip                                                                 | 67              |
| Figure 3.1: MIPS32® DSP Module Control Register (DSPControl) Format                                      |                 |
| Figure 4.1: Address Translation During a Cache Access with TLB MMU                                       |                 |
| Figure 4.2: Address Translation During a Cache Access with FMT MMU                                       |                 |
| Figure 4.3: microAptiv™ UP processor core Virtual Memory Map                                             |                 |
| Figure 4.4: User Mode Virtual Address Space                                                              |                 |
| Figure 4.5: Kernel Mode Virtual Address Space                                                            |                 |
| Figure 4.6: Debug Mode Virtual Address Space                                                             |                 |
| Figure 4.7: JTLB Entry (Tag and Data)                                                                    |                 |
| Figure 4.8: JTLB Entry (Tag and Data)                                                                    |                 |
| Figure 4.9: Overview of a Virtual-to-Physical Address Translation in the microAptiv™ UP Core             | 89              |

| Figure 4.10: 32-bit Virtual Address Translation                                      | 90    |
|--------------------------------------------------------------------------------------|-------|
| Figure 4.11: TLB Address Translation Flow in the microAptiv™ UP Processor Core       | 92    |
| Figure 4.12: FMT Memory Map (ERL=0) in the microAptiv <sup>™</sup> UP Processor Core |       |
| Figure 4.13: FMT Memory Map (ERL=1) in the microAptiv™ UP Processor Core             |       |
| Figure 5.1: Interrupt Generation for Vectored Interrupt Mode                         |       |
| Figure 5.2: Interrupt Generation for External Interrupt Controller Interrupt Mode    |       |
| Figure 5.3: General Exception Handler (HW)                                           |       |
| Figure 5.4: General Exception Servicing Guidelines (SW)                              |       |
| Figure 5.5: TLB Miss Exception Handler (HW)                                          |       |
| Figure 5.6: TLB Exception Servicing Guidelines (SW)                                  |       |
| Figure 5.7: Reset, Soft Reset and NMI Exception Handling and Servicing Guidelines    |       |
| Figure 6.1: Index Register Format                                                    |       |
| Figure 6.2: Random Register Format                                                   |       |
| Figure 6.3: EntryLo0, EntryLo1 Register Format                                       |       |
| Figure 6.4: Context Register Format                                                  |       |
| Figure 6.5: UserLocal Register Format                                                |       |
| Figure 6.6: PageMask Register Format                                                 |       |
| Figure 6.7: PageGrain Register Format                                                |       |
| Figure 6.8: Wired and Random Entries in the TLB                                      |       |
| Figure 6.9: Wired Register Format                                                    |       |
| Figure 6.10: HWREna Register Format                                                  |       |
| Figure 6.11: BadVAddr Register Format                                                |       |
| Figure 6.12: BadInstr Register Format                                                |       |
| Figure 6.13: BadInstrP Register Format                                               |       |
| Figure 6.14: Count Register Format                                                   |       |
| Figure 6.15: EntryHi Register Format                                                 |       |
| Figure 6.16: Compare Register Format                                                 |       |
| Figure 6.17: Status Register Format                                                  |       |
| Figure 6.18: IntCtl Register Format                                                  |       |
| Figure 6.19: SRSCtl Register Format                                                  |       |
| Figure 6.20: SRSMap Register Format                                                  |       |
| Figure 6-21: View_IPL Register Format                                                |       |
| Figure 6-22: SRSMap Register Format                                                  |       |
| Figure 6.23: Cause Register Format                                                   |       |
| Figure 6-24: View_RIPL Register Format                                               |       |
| Figure 6-25: NestedExc Register Format                                               |       |
|                                                                                      |       |
| Figure 6.26: EPC Register Format<br>Figure 6-27: NestedEPC Register Format           |       |
|                                                                                      |       |
| Figure 6.28: PRId Register Format                                                    |       |
| Figure 6.29: EBase Register Format                                                   |       |
| Figure 6.30: CDMMBase Register Format.                                               |       |
| Figure 6.31: Config Register Format — Select 0                                       |       |
| Figure 6.32: Config1 Register Format — Select 1                                      |       |
| Figure 6.33: Config2 Register Format — Select 2                                      |       |
| Figure 6-34: Config3 Register Format                                                 |       |
| Figure 6-35: Config4 Register Format                                                 |       |
| Figure 6-36: Config5 Register Format                                                 |       |
| Figure 6.37: Config7 Register Format                                                 |       |
| Figure 6.38: LLAddr Register Format                                                  |       |
| Figure 6.39: WatchLo Register Format                                                 |       |
| Figure 6.40: WatchHi Register Format                                                 |       |
| Figure 6.41: Debug Register Format                                                   |       |
| Figure 6.42: TraceControl Register Format                                            | . 193 |

| Figure 6.43: TraceControl2 Register Format       19         Figure 6.44: User Trace Data1/User Trace Data2 Register Format       19         Figure 6.45: Trace BPC Register Format       19 | 97        |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------|
|                                                                                                                                                                                             |           |
|                                                                                                                                                                                             | 28        |
| Figure 6.46: Debug2 Register Format                                                                                                                                                         |           |
| Figure 6.47: DEPC Register Format                                                                                                                                                           |           |
| Figure 6.48: Performance Counter Control Register                                                                                                                                           |           |
| Figure 6.49: Performance Counter Count Register                                                                                                                                             |           |
| Figure 6.50: ErrCtl Register Format                                                                                                                                                         |           |
| Figure 6.51: CacheErr Register (Primary Caches)                                                                                                                                             | na        |
| Figure 6.52: TagLo Register Format (ErrCtl <sub>WST</sub> =0)                                                                                                                               |           |
| Figure 6.53: TagLo Register Format (ErrCtl <sub>WST</sub> =1)                                                                                                                               |           |
| Figure 6.54: DataLo Register Format                                                                                                                                                         |           |
| Figure 6.55: ErrorEPC Register Format                                                                                                                                                       |           |
| Figure 6.56: DeSave Register Format                                                                                                                                                         |           |
| Figure 6-57: KScratchn Register Format                                                                                                                                                      |           |
| Figure 8.1: I-Cache and D-Cache Array Formats                                                                                                                                               |           |
| Figure 8.2: I-Cache Array Formats                                                                                                                                                           |           |
| Figure 8.3: D-Cache Array Formats                                                                                                                                                           |           |
| Figure 0.5: D-Cache Array Formats                                                                                                                                                           |           |
| Figure 10.2: IBS Register Format                                                                                                                                                            |           |
| Figure 10.2: IBS Register Format                                                                                                                                                            |           |
| Figure 10.3: IBAN Register Format                                                                                                                                                           |           |
|                                                                                                                                                                                             |           |
| Figure 10.5: IBASIDn Register Format                                                                                                                                                        |           |
| Figure 10.6: IBCn Register Format                                                                                                                                                           |           |
| Figure 10.7: IBCCn Register Format                                                                                                                                                          |           |
| Figure 10.8: IBPCn Register Format                                                                                                                                                          |           |
| Figure 10.9: DBS Register Format                                                                                                                                                            |           |
| Figure 10.10: DBAn Register Format                                                                                                                                                          |           |
| Figure 10.11: DBMn Register Format                                                                                                                                                          |           |
| Figure 10.12: DBASIDn Register Format                                                                                                                                                       |           |
| Figure 10.13: DBCn Register Format                                                                                                                                                          |           |
| Figure 10.14: DBVn Register Format                                                                                                                                                          |           |
| Figure 10.15: DBCCn Register Format                                                                                                                                                         |           |
| Figure 10.16: DBPCn Register Format                                                                                                                                                         |           |
| Figure 10.17: DVM Register Format                                                                                                                                                           |           |
| Figure 10.18: CBTC Register Format                                                                                                                                                          | 53<br>- ▲ |
| Figure 10.19: PrCndA Register Format                                                                                                                                                        |           |
| Figure 10.20: STCtl Register Format                                                                                                                                                         |           |
| Figure 10.21: STCnt Register Format                                                                                                                                                         |           |
| Figure 10.22: TAP Controller State Diagram                                                                                                                                                  |           |
| Figure 10.23: Concatenation of the EJTAG Address, Data and Control Registers                                                                                                                |           |
| Figure 10.24: TDI to TDO Path When in Shift-DR State and FASTDATA Instruction is Selected                                                                                                   |           |
| Figure 10.25: Device Identification Register Format                                                                                                                                         |           |
| Figure 10.26: Implementation Register Format                                                                                                                                                |           |
| Figure 10.27: EJTAG Control Register Format                                                                                                                                                 |           |
| Figure 10.28: Endian Formats for the PAD Register                                                                                                                                           |           |
| Figure 10.29: Fastdata Register Format                                                                                                                                                      |           |
| Figure 10.30: Trace Logic Overview                                                                                                                                                          |           |
| Figure 10.31: Control/Status Register                                                                                                                                                       |           |
| Figure 10.32: ITCBTW Register Format                                                                                                                                                        |           |
| Figure 10.33: ITCBRDP Register Format                                                                                                                                                       |           |
| Figure 10.34: ITCBWRP Register Format                                                                                                                                                       |           |
| Figure 10.35: PCSAMPLE TAP Register Format (MIPS32)                                                                                                                                         | 94        |

| 297 |
|-----|
| 298 |
| 299 |
| 300 |
| 301 |
| 302 |
| 302 |
| 303 |
| 305 |
| 324 |
| 352 |
| 353 |
| 353 |
|     |

### **List of Tables**

| Table 1.1: Instruction and Data Cache Attributes                                           | 34  |
|--------------------------------------------------------------------------------------------|-----|
| Table 2.1: High-performance MDU Stalls                                                     | 47  |
| Table 2.2: DSP Instruction Delays                                                          | 48  |
| Table 2.3: Delays for Interesting Sequences with DSPControl Dependency                     | 50  |
| Table 2.4: MDU Instruction Latencies (High-Performance MDU)                                | 54  |
| Table 2.5: MDU Instruction Repeat Rates (High-Performance MDU)                             | 55  |
| Table 2.6: microAptiv™ UP Core Instruction Latencies (Area-Efficient MDU)                  | 58  |
| Table 2.7: Pipeline Interlocks                                                             |     |
| Table 2.8: Instruction Interlocks                                                          | 67  |
| Table 2.9: Execution Hazards                                                               | 69  |
| Table 2.10: Instruction Hazards                                                            | 70  |
| Table 2.11: Hazard Instruction Listing                                                     | 70  |
| Table 3.1: MIPS® DSP Module Control Register (DSPControl) Field Descriptions               | 73  |
| Table 3.2: DSPControl ouflag Bits                                                          |     |
| Table 4.1: User Mode Segments                                                              | 80  |
| Table 4.2: Kernel Mode Segments                                                            | 81  |
| Table 4.3: Physical Address and Cache Attributes for dseg, dmseg, and drseg Address Spaces | 83  |
| Table 4.4: CPU Access to drseg Address Range                                               | 83  |
| Table 4.5: CPU Access to dmseg Address Range                                               | 84  |
| Table 4.6: TLB Tag Entry Fields                                                            | 86  |
| Table 4.7: TLB Data Entry Fields                                                           | 86  |
| Table 4.8: TLB Instructions                                                                |     |
| Table 4.9: Cache Coherency Attributes                                                      |     |
| Table 4.10: Cacheability of Segments with Block Address Translation                        | 93  |
| Table 5.1: Priority of Exceptions                                                          | 97  |
| Table 5.2: Interrupt Modes                                                                 |     |
| Table 5.3: Relative Interrupt Priority for Vectored Interrupt Mode                         |     |
| Table 5.4: Exception Vector Offsets for Vectored Interrupts                                | 107 |
| Table 5.5: Exception Vector Base Addresses.                                                |     |
| Table 5.6: Exception Vector Offsets                                                        |     |
| Table 5.7: Exception Vectors                                                               | 111 |
| Table 5.8: Value Stored in EPC, ErrorEPC, or DEPC on an Exception                          |     |
| Table 5.9: Debug Exception Vector Addresses                                                |     |
| Table 5.10: Register States an Interrupt Exception                                         |     |
| Table 5.11: Register States on a Watch Exception                                           |     |
| Table 5.12: CP0 Register States on an Address Exception Error.                             |     |
| Table 5.13: CP0 Register States on a TLB Refill Exception                                  |     |
| Table 5.14: CP0 Register States on a TLB Invalid Exception                                 |     |
| Table 5.15: CP0 Register States on a Execute-Inhibit Exception                             |     |
| Table 5.16: CP0 Register States on a Read-Inhibit Exception                                |     |
| Table 5.17: CP0 Register States on a Cache/SPRAM Parity Error Exception                    |     |
| Table 5.18: Register States on a Coprocessor Unusable Exception                            |     |
| Table 5.19: Register States on a TLB Modified Exception                                    |     |
| Table 6.1: CP0 Registers                                                                   |     |
| Table 6.2: CP0 Register R/W Field Types                                                    |     |
| Table 6.3: Index Register Field Descriptions                                               |     |
| Table 6.4: Random Register Field Descriptions                                              | 139 |

| Table 6.5: EntryLo0, EntryLo1 Register Field Descriptions       14         Table 6.6: Cache Coherency Attributes       14         Table 6.7: Context Register Field Descriptions       14         Table 6.7: Context Register Field Descriptions       14 | 41<br>41  |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------|
| Table 6.7: Context Register Field Descriptions                                                                                                                                                                                                            | 11        |
|                                                                                                                                                                                                                                                           |           |
| Table 6.8: UserLocal Register Field Descriptions                                                                                                                                                                                                          |           |
| Table 6.9: PageMask Register Field Descriptions                                                                                                                                                                                                           |           |
| Table 6.10: Values for the Mask and MaskX <sup>1</sup> Fields of the PageMask Register                                                                                                                                                                    |           |
| Table 6.11: PageGrain Register Field Descriptions                                                                                                                                                                                                         |           |
| Table 6.12: Wired Register Field Descriptions                                                                                                                                                                                                             |           |
| Table 6.13: HWREna Register Field Descriptions                                                                                                                                                                                                            |           |
| Table 6.15: BadInstr Register Field Descriptions                                                                                                                                                                                                          |           |
| Table 6.14: BadVAddr Register Field Description                                                                                                                                                                                                           |           |
| Table 6.16: BadInstrP Register Field Descriptions                                                                                                                                                                                                         | 19        |
| Table 6.17: Count Register Field Description                                                                                                                                                                                                              | 50        |
| Table 6.18: EntryHi Register Field Descriptions                                                                                                                                                                                                           | 50        |
| Table 6.19: Compare Register Field Description                                                                                                                                                                                                            |           |
| Table 6.20: Status Register Field Descriptions                                                                                                                                                                                                            | 52        |
| Table 6.21: IntCtl Register Field Descriptions                                                                                                                                                                                                            | 57        |
| Table 6.22: SRSCtl Register Field Descriptions                                                                                                                                                                                                            | 61        |
| Table 6.23: Sources for new SRSCtl <sub>CSS</sub> on an Exception or Interrupt                                                                                                                                                                            | 63        |
| Table 6.24: SRSMap Register Field Descriptions                                                                                                                                                                                                            |           |
| Table 6.25: View_IPL Register Field Descriptions                                                                                                                                                                                                          |           |
| Table 6.26: SRSMap Register Field Descriptions                                                                                                                                                                                                            | 65        |
| Table 6.27: Cause Register Field Descriptions                                                                                                                                                                                                             | 65        |
| Table 6.28: Cause Register ExcCode Field                                                                                                                                                                                                                  | <u>89</u> |
| Table 6.29: View_RIPL Register Field Descriptions                                                                                                                                                                                                         | 70        |
| Table 6.30: NestedExc Register Field Descriptions                                                                                                                                                                                                         | 71        |
| Table 6.31: EPC Register Field Description                                                                                                                                                                                                                | 72        |
| Table 6.32: NestedEPC Register Field Descriptions                                                                                                                                                                                                         | 73        |
| Table 6.33: PRId Register Field Descriptions                                                                                                                                                                                                              | 73        |
| Table 6.34: EBase Register Field Descriptions                                                                                                                                                                                                             | 75        |
| Table 6.35: CDMMBase Register Field Descriptions                                                                                                                                                                                                          | 75        |
| Table 6.36: Config Register Field Descriptions                                                                                                                                                                                                            | 76        |
| Table 6.37: Cache Coherency Attributes                                                                                                                                                                                                                    |           |
| Table 6.38: Config1 Register Field Descriptions — Select 1                                                                                                                                                                                                | 79        |
| Table 6.40: Config3 Register Field Descriptions                                                                                                                                                                                                           |           |
| Table 6.39: Config2 Register Field Descriptions — Select 1    18                                                                                                                                                                                          | 31        |
| Table 6.41: Config4 Register Field Descriptions                                                                                                                                                                                                           | 35        |
| Table 6.42: Config5 Register Field Descriptions    18                                                                                                                                                                                                     | 35        |
| Table 6.43: Config7 Register Field Descriptions                                                                                                                                                                                                           | 36        |
| Table 6.44: LLAddr Register Field Descriptions                                                                                                                                                                                                            | 36        |
| Table 6.45: WatchLo Register Field Descriptions                                                                                                                                                                                                           | 37        |
| Table 6.46: WatchHi Register Field Descriptions                                                                                                                                                                                                           | 38        |
| Table 6.47: Debug Register Field Descriptions         18                                                                                                                                                                                                  | 39        |
| Table 6.48: TraceControl Register Field Descriptions         19                                                                                                                                                                                           | 93        |
| Table 6.49: TraceControl2 Register Field Descriptions         19                                                                                                                                                                                          | 95        |
| Table 6.50: UserTraceData1/UserTraceData2 Register Field Descriptions         19                                                                                                                                                                          | 97        |
| Table 6.51: TraceBPC Register Field Descriptions                                                                                                                                                                                                          | 8         |
| Table 6.52: Debug2 Register Field Descriptions    19                                                                                                                                                                                                      | 99        |
| Table 6.53: DEPC Register Formats                                                                                                                                                                                                                         |           |
| Table 6.54: Performance Counter Register Selects    20                                                                                                                                                                                                    |           |
| Table 6.55: Performance Counter Control Register Field Descriptions         20                                                                                                                                                                            |           |
| Table 6.56: Performance Counter Events Sorted by Event Number    20                                                                                                                                                                                       |           |
| Table 6.57: Performance Counter Event Descriptions Sorted by Event Type    20                                                                                                                                                                             | )4        |

| Table 6.58: Performance Counter Count Register Field Descriptions           |     |
|-----------------------------------------------------------------------------|-----|
| Table 6.59: Errctl Register Field Descriptions                              |     |
| Table 6.60: CacheErr Register Field Descriptions (Primary Caches)           |     |
| Table 6.61: TagLo Register Field Descriptions                               |     |
| Table 6.62: DataLo Register Field Description                               |     |
| Table 6.63: ErrorEPC Register Field Description                             |     |
| Table 6.64: DeSave Register Field Description                               |     |
| Table 6.65: KScratchn Register Field Descriptions                           | 214 |
| Table 8.1: Instruction and Data Cache Attributes                            | 218 |
| Table 8.2: Instruction and Data Cache Sizes                                 | 219 |
| Table 8.3: LRU and Dirty Width in Way-Select Array                          | 221 |
| Table 8.4: Potential Virtual Aliasing Bits                                  | 224 |
| Table 8.5: Way Selection Encoding, 4 Ways                                   | 225 |
| Table 8.6: Way Selection Encoding, 3 Ways                                   | 225 |
| Table 8.7: Way Selection Encoding, 2 Ways                                   | 226 |
| Table 10.1: DCR Register Field Descriptions                                 | 231 |
| Table 10.2: Addresses for Instruction Breakpoint Registers                  |     |
| Table 10.3: IBS Register Field Descriptions                                 |     |
| Table 10.4: IBAn Register Field Descriptions                                |     |
| Table 10.5: IBMn Register Field Descriptions                                |     |
| Table 10.6: IBASIDn Register Field Descriptions                             |     |
| Table 10.7: IBCn Register Field Descriptions                                |     |
| Table 10.8: IBCCn Register Field Descriptions                               |     |
| Table 10.9: IBPCn Register Field Descriptions                               |     |
| Table 10.10: Addresses for Data Breakpoint Registers                        |     |
| Table 10.11: DBS Register Field Descriptions                                |     |
| Table 10.13: DBMn Register Field Descriptions                               |     |
| Table 10.14: DBASIDn Register Field Descriptions                            |     |
| Table 10.12: DBAn Register Field Descriptions                               |     |
| Table 10.15: DBCn Register Field Descriptions                               |     |
| Table 10.17: DBCCn Register Field Descriptions                              |     |
| Table 10.16: DBVn Register Field Descriptions                               |     |
| Table 10.18: DBPCn Register Field Descriptions                              |     |
| Table 10.20: Addresses for Complex Breakpoint Registers                     |     |
| Table 10.19: DVM Register Field Descriptions                                |     |
| Table 10.21: CBTC Register Field Descriptions                               |     |
| Table 10.23: Priming Conditions and Register Values for 6I/2D Configuration |     |
| Table 10.24: Priming Conditions and Register Values for 8I/4D Configuration |     |
| Table 10.22: PrCndA Register Field Descriptions                             |     |
| Table 10.25: STCtl Register Field Descriptions                              |     |
| Table 10.26: STCtl Register Field Descriptions                              |     |
| Table 10.27: EJTAG Interface Pins                                           |     |
| Table 10.28: Implemented EJTAG Instructions                                 |     |
| Table 10.30: Implementation Register Descriptions                           |     |
| Table 10.29: Device Identification Register                                 |     |
| Table 10.31: EJTAG Control Register Descriptions                            |     |
| Table 10.32: Fastdata Register Field Description                            |     |
| Table 10.33: Operation of the FASTDATA access                               |     |
| Table 10.34: EJ_DisableProbeDebug Signal Overview                           |     |
| Table 10.35: Data Bus Encoding                                              |     |
| Table 10.36: Tag Bit Encoding                                               |     |
| Table 10.37: Control/Status Register Field Descriptions                     |     |
| Table 10.38: ITCBTW Register Field Descriptions                             |     |

| Table 10.39: ITCBRDP Register Field Descriptions                                      | 292 |
|---------------------------------------------------------------------------------------|-----|
| Table 10.40: ITCBWRP Register Field Descriptions                                      | 292 |
| Table 10.41: drseg Registers that Enable/Disable Trace from Breakpoint-Based Triggers | 293 |
| Table 10.42: FDC TAP Register Field Descriptions                                      | 298 |
| Table 10.43: FDC Register Mapping                                                     |     |
| Table 10.44: FDC Access Control and Status Register Field Descriptions                | 299 |
| Table 10.45: FDC Configuration Register Field Descriptions                            | 300 |
| Table 10.46: FDC Status Register Field Descriptions                                   | 301 |
| Table 10.47: FDC Receive Register Field Descriptions                                  | 302 |
| Table 10.49: FDTXn Address Decode                                                     | 303 |
| Table 10.48: FDC Transmit Register Field Descriptions                                 | 303 |
| Table 11.1: Byte Access Within a Word                                                 |     |
| Table 12.1: Encoding of the Opcode Field                                              | 311 |
| Table 12.2: Special Opcode Encoding of Function Field                                 | 311 |
| Table 12.3: Special2 Opcode Encoding of Function Field                                | 311 |
| Table 12.4: Special3 Opcode Encoding of Function Field                                | 312 |
| Table 12.5: RegImm Encoding of rt Field                                               |     |
| Table 12.6: COP2 Encoding of rs Field                                                 |     |
| Table 12.7: COP2 Encoding of rt Field When rs=BC2                                     | 312 |
| Table 12.8: COP0 Encoding of rs Field                                                 |     |
| Table 12.9: COP0 Encoding of Function Field When rs=CO                                |     |
| Table 12.10: Instruction Set                                                          |     |
| Table 12.11: Usage of Effective Address                                               | 324 |
| Table 12.12: Encoding of Bits[17:16] of CACHE Instruction                             | 325 |
| Table 12.13: Encoding of Bits [20:18] of the CACHE Instruction                        | 326 |
| Table 13.1: 16-Bit Re-encoding of Frequent MIPS Instructions                          | 356 |
| Table 13.2: 16-Bit Re-encoding of Frequent MIPS Instruction Sequences                 | 357 |
| Table 13.3: Instruction-Specific Register Specifiers and Immediate Field Values       |     |
| Table 13.4: 16-Bit Instruction General-Purpose Registers - \$2-\$7, \$16, \$17        | 359 |
| Table 13.5: SB16, SH16, SW16 Source Registers - \$0, \$2-\$7, \$17                    |     |
| Table 13.6: 16-Bit Instruction Implicit General-Purpose Registers                     |     |
| Table 13.7: 16-Bit Instruction Special-Purpose Registers                              |     |
| Table 13.8: 32-bit Instructions introduced within microMIPS                           | 361 |

Chapter 1

# Introduction to the MIPS32® microAptiv<sup>™</sup> UP Processor Core

The MIPS32® microAptiv<sup>™</sup> UP core from MIPS Technologies is a high-performance, low-power, 32-bit MIPS RISC processor core intended for custom system-on-silicon applications. The core is designed for semiconductor manufacturing companies, ASIC developers, and system OEMs who want to rapidly integrate their own custom logic and peripherals with a high-performance RISC processor. The microAptiv UP core is fully synthesizable to allow maximum flexibility; it is highly portable across processes and can easily be integrated into full system-on-silicon designs. This allows developers to focus their attention on end-user specific characteristics of their product.

The microAptiv UP core is ideally positioned to support new products for emerging segments of the digital consumer, network, systems, and information-management markets, enabling new tailored solutions for embedded applications.

The microAptiv UP core implements the MIPS Architecture in a 5-stage pipeline. It includes support for the micro-MIPS<sup>TM</sup> ISA, an Instruction Set Architecture with optimized MIPS32 16-bit and 32-bit instructions that provides a significant reduction in code size with a performance equivalent to MIPS32. The microAptiv UP core is a successor to the M14Kc<sup>TM</sup>, designed from the same microarchitecture, including the Microcontroller Application-Specific Extension (MCU<sup>TM</sup> ASE), enhanced interrupt handling, lower interrupt latency, native AMBA®-3 AHB-Lite Bus Interface Unit (BIU), with additional power saving, debug, and profiling features.

The microAptiv UP core has an option to include the MIPS Architecture DSP Module Revision 2 that provides digital signal processing capabilities, with support for a number of powerful data processing operations.

On the microAptiv UP core, instruction and data caches are optional and are fully programmable from 0 - 64 Kbytes in size. In addition, each cache can be organized as direct-mapped, 2-way, 3-way, or 4-way set associative. On a cache miss, loads are blocked only until the first critical word becomes available. The pipeline resumes execution while the remaining words are being written to the cache. Both caches are virtually indexed and physically tagged. Virtual indexing allows the cache to be indexed in the same clock in which the address is generated rather than waiting for the virtual-to-physical address translation in the TLB.

A distinguishing characteristic of the microAptiv UP family is the inclusion of a configurable MIPS DSP Module. The MIPS DSP Module provides support for a number of powerful data processing operations. It includes instructions for executing fractional arithmetic (Q15/Q31) and saturating arithmetic. Additionally, for smaller data sizes, SIMD operations are supported, allowing 2x16b or 4x8b operations to occur simultaneously. Another feature of the DSP Module is the inclusion of additional HI/LO accumulator registers that improve the parallelization of independent accumulation routines.

The core includes one of two different Multiply/Divide Unit (MDU) implementations, selectable at build-time if the DSP Module is not configured in, allowing the user to trade-off performance and area for integer multiply and divide operations. The high-performance MDU option implements single-cycle multiply and multiply-accumulate (MAC) instructions that enable DSP algorithms to be performed efficiently. It allows 32-bit x 16-bit MAC instructions to be issued every cycle, while a 32-bit x 32-bit MAC instruction can be issued every other cycle. The area-efficient MDU option handles multiplies with a one-bit-per-clock iterative algorithm.

If the core is configured with the DSP Module, the Multiply-Divide Unit (MDU) is fully pipelined and supports a maximum issue rate of one 32x32 multiply (MUL/MULT/MULTU), multiply-add (MADD/MADDU), or multiply-subtract (MSUB/MSUBU) operations per clock.

The MMU of the microAptiv UP core has two configurable selections: 1) a 4-entry instruction TLB (ITLB), a 4-entry data TLB (DTLB), and a 16 or 32 dual-entry joint TLB (JTLB) with variable page sizes; 2) a simplified Fixed Mapping Translation (FMT) mechanism, for applications that do not require the full capabilities of a TLB.

The basic Enhanced JTAG (EJTAG) features provide CPU run control with stop, single-stepping and re-start, and with software breakpoints using the SDBBP instruction. Additional EJTAG features such as instruction and data virtual address hardware breakpoints, complex hardware breakpoints, connection to an external EJTAG probe through the Test Access Port (TAP), and PC/Data tracing, may be included as an option.

#### 1.1 Features

- 5-stage pipeline
- 32-bit Address and Data Paths
- MIPS32 Instruction Set Architecture
- MIPS32 Enhanced Architecture Features
  - Vectored interrupts and support for external interrupt controller
  - Programmable exception vector base
  - Atomic interrupt enable/disable
  - GPR shadow registers (one, three, seven, or fifteen additional shadows can be optionally added to minimize latency for interrupt handlers)
  - Bit field manipulation instructions
  - Virtual memory support (smaller page sizes and hooks for more extensive page table manipulation)
- microMIPS Instruction Set Architecture
  - microMIPS ISA is a build-time configurable option that reduces code size over MIPS32, while maintaining MIPS32 performance.
  - Combining both 16-bit and 32-bit opcodes, microMIPS supports all MIPS32 instructions (except branch-likely instructions) with new optimized encoding. Frequently used MIPS32 instructions are available as 16-bit instructions.
  - Added fifteen new 32-bit instructions and thirty-nine 16-bit instructions.
  - Stack pointer implicit in instruction.
  - MIPS32 assembly and ABI-compatible.
  - Supports MIPS architecture Modules and User-defined Instructions (UDIs).

#### • MCU<sup>TM</sup> ASE

- Increases the number of interrupt hardware inputs from 6 to 8 for Vectored Interrupt (VI) mode, and from 63 to 255 for External Interrupt Controller (EIC) mode.
- Separate priority and vector generation. 16-bit vector address is provided.
- Hardware assist combined with the use of Shadow Register Sets to reduce interrupt latency during the prologue and epilogue of an interrupt.
- An interrupt return with automated interrupt epilogue handling instruction (IRET) improves interrupt latency.
- Supports optional interrupt chaining.
- Two memory-to-memory atomic read-modify-write instructions (ASET and ACLR) eases commonly used semaphore manipulation in microcontroller applications. Interrupts are automatically disabled during the operation to maintain coherency.
- Programmable Cache Sizes
  - Individually configurable instruction and data caches
  - Sizes from 0 64KB
  - Direct Mapped, 2-, 3-, or 4-Way Set Associative
  - Loads block only until critical word is available
  - Write-back and write-through support
  - 128-bit (16-byte) cache line size, word sectored suitable for standard 32-bit wide single-port SRAM
  - Virtually indexed, physically tagged
  - Cache line locking support
  - Non-blocking prefetches
- Scratchpad RAM (SPRAM) Support
  - Can optionally replace 1 way of the I- and/or D-cache with a fast scratchpad RAM
  - Independent external pin interfaces for I- and D-scratchpads
  - 20 index address bits allow access of arrays up to 1MB
  - Interface allows back-stalling the core
- MIPS32 Privileged Resource Architecture (PRA)
  - Count/Compare registers for real-time timer interrupts

- I and D watch registers for SW breakpoints
- Memory Management Unit
  - Simple Fixed Mapping Translation (FMT) mechanism, or
  - 4-entry instruction and data Translation Lookaside Buffers (ITLB/DTLB) and a 16 or 32 dual-entry joint TLB (JTLB) with variable page sizes. Read, write, and execute page-protection attributes individually programmable.
- Bus Interface Unit (BIU)
  - Supports AMBA-3 AHB-Lite protocol
  - All I/O's fully registered
  - Separate unidirectional 32-bit address and data buses
  - Two 16-byte collapsing write buffers
  - Support for variable CPU and bus clock ratios to allow the bus to run at a lower speed than the CPU.
- Parity Support
  - The I-cache, D-cache, ISPRAM, and DSPRAM support optional parity detection.
  - MIPS DSP Module (Revision 2.0)
  - Support for MAC operations with 3 additional pairs of Hi/Lo accumulator registers (Ac0 Ac3)
  - Fractional data types (Q15, Q31) with rounding support
  - Saturating arithmetic with overflow handling
  - SIMD instructions operate on 2x16-bit or 4x8-bit operands simultaneously
  - Separate MDU pipeline with full-sized hardware multiplier to support back-to-back operations
  - The DSP Module is build-time configurable.
- Multiply/Divide Unit (area-efficient configuration without DSP)
  - 32 clock latency on multiply
  - 34 clock latency on multiply-accumulate
  - 33-35 clock latency on divide (sign-dependent)
- Multiply/Divide Unit (high-performance configuration without DSP)
  - Maximum issue rate of one 32x16 multiply per clock via on-chip 32x16 hardware multiplier array.
  - Maximum issue rate of one 32x32 multiply every other clock

- Early-in iterative divide. Minimum 11 and maximum 34 clock latency (dividend (*rs*) sign extension-dependent)
- Multiply/Divide Unit (with DSP configuration)
  - Maximum issue rate of one 32x32 multiply per clock via on-chip 32x32 hardware multiplier array
  - Maximum issue rate of one 32x32 multiply every clock
  - Early-in iterative divide. Minimum 12 and maximum 38 clock latency (dividend (*rs*) sign extension-dependent)
- CorExtend® User-Defined Instruction Set Extensions
  - Allows user to define and add instructions to the core at build time
  - Maintains full MIPS32 compatibility
  - Supported by industry-standard development tools
  - Single or multi-cycle instructions
- Coprocessor 2 interface
  - 32-bit interface to an external coprocessor
- Power Control
  - Minimum frequency: 0 MHz
  - Power-down mode (triggered by WAIT instruction)
  - Support for software-controlled clock divider
  - Support for extensive use of local gated clocks
  - Optional power-saving mode in organizing individual cache memory array per way
- EJTAG Debug/Profiling and iFlowtrace<sup>™</sup> Mechanism
  - CPU control with start, stop, and single stepping
  - Virtual instruction and data address/value breakpoints
  - Hardware breakpoint supports both address match and address range triggering
  - Optional simple hardware breakpoints on virtual addresses; 8I/4D, 6I/2D, 4I/2D, 2I/1D breakpoints, or no breakpoints
  - Optional complex hardware breakpoints with 8I/4D, 6I/2D simple breakpoints
  - TAP controller is chainable for multi-CPU debug

- Supports EJTAG (IEEE 1149.1) and compatible with cJTAG 2-wire (IEEE 1149.7) extension protocol
- Cross-CPU breakpoint support
- iFlowtrace support for real-time instruction PC and special events
- PC and/or load/store address sampling for profiling
- Performance Counters
- Support for Fast Debug Channel (FDC)
- SecureDebug
  - An optional feature that disables access via EJTAG in an untrusted environment
- Testability
  - Full scan design achieves test coverage in excess of 99% (dependent on library and configuration options)
  - Optional memory BIST for internal SRAM arrays. Two memory BIST algorithms are provided and selectable by an input pin.

#### 1.2 microAptiv<sup>™</sup> UP Core Block Diagram

The microAptiv UP core contains both required and optional blocks, as shown in the block diagram in Figure 1.1. Required blocks are the lightly shaded areas of the block diagram and are always present in any core implementation. Optional blocks may be added to the base core, depending on the needs of a specific implementation. The required blocks are as follows:

- Instruction Decode
- Execution Unit
- General Purposed Registers (GPR)
- Multiply/Divide Unit (MDU)
- System Control Coprocessor (CP0)
- Memory Management Unit (MMU)
- I/D Cache Controllers
- Bus Interface Unit (BIU)
- Power Management

Optional blocks include:

Instruction Cache

- Data Cache
- Scratchpad RAM interface
- Configurable instruction decoder supporting three ISA modes: MIPS32-only, MIPS32 and microMIPS, or microMIPS-only
- DSP (integrated with MDU)
- Coprocessor 2 interface
- CorExtend® User-Defined Instruction (UDI) interface
- Debug/Profiling with Enhanced JTAG (EJTAG) Controller, Break points, Sampling, Performance counters, Fast Debug Channel, and iFlowtrace logic





#### 1.2.1 Required Logic Blocks

The following subsections describe the required logic blocks of the microAptiv UP processor core.

#### 1.2.1.1 Execution Unit

The microAptiv UP core execution unit implements a load/store architecture with single-cycle ALU operations (logical, shift, add, subtract) and an autonomous multiply/divide unit.

The execution unit includes:

- Arithmetic Logic Unit (ALU) for performing arithmetic and bitwise logical operations. Shared adder for arithmetic operations, load/store address calculation, and branch target calculation.
- Address unit for calculating the next PC and next fetch address selection muxes.

- Load Aligner.
- Shifter and Store Aligner.
- Branch condition comparator.
- Trap condition comparator.
- Bypass muxes to advance result between two adjacent instructions with data dependency.
- Leading Zero/One detect unit for implementing the CLZ and CLO instructions.
- Actual execution of the Atomic Instructions defined in the MCU ASE.
- A separate DSP ALU and Logic block for performing part of DSP Module instructions, such as arithmetic/shift/compare operations, if the DSP function is configured.

#### 1.2.1.2 General Purposed Register (GPR) Shadow Registers

The microAptiv UP core contains thirty-two 32-bit general-purpose registers used for integer operations and address calculation. Optionally, one, three, seven or fifteen additional register file shadow sets (each containing thirty-two registers) can be added to minimize context switching overhead during interrupt/exception processing. The register file consists of two read ports and one write port and is fully bypassed to minimize operation latency in the pipeline.

#### 1.2.1.3 Multiply/Divide Unit (MDU)

The microAptiv UP core includes a multiply/divide unit (MDU) that contains a separate, dedicated pipeline for integer multiply/divide operations, and DSP Module multiply instructions. This pipeline operates in parallel with the integer unit (IU) pipeline and does not stall when the IU pipeline stalls. This allows the long-running MDU operations to be partially masked by system stalls and/or other integer unit instructions.

The MIPS architecture defines that the result of a multiply or divide operation be placed in a pair (without DSP enabled) or one of 4 pairs (with DSP enabled) of *HI* and *LO* registers. Using the Move-From-HI (MFHI) and Move-From-LO (MFLO) instructions, these values can be transferred to the general-purpose register file.

There are three configuration options for the MDU: 1) a full 32x32 multiplier block; 2) a higher performance 32x16 multiplier block; 3) an area-efficient iterative multiplier block. Option 2 and 3 are available if the DSP configuration option is disabled. If the DSP configuration option is enabled, option 1 is the default. The selection of the MDU style allows the implementor to determine the appropriate performance and area trade-off for the application.

#### MDU with 32x32 DSP Multiplier

With the DSP configuration option enabled, the MDU supports execution of one 16x16, 32x16, or 32x32 multiply or multiply-accumulate operation every clock cycle with the built in 32x32 multiplier array. The multiplier is shared with DSP Module operations.

The MDU also implements various shift instructions operating on the HI/LO register and multiply instructions as defined in the DSP Module. It supports all the data types required for this purpose and includes three extra HI/LO registers as defined by the Module.

#### MDU with 32x16 High-Performance Multiplier

The high-performance MDU consists of a 32x16 Booth-recoded multiplier, a pair of result/accumulation registers (*HI* and *LO*), a divide state machine, and the necessary multiplexers and control logic. The first number shown ('32' of

32x16) represents the *rs* operand. The second number ('16' of 32x16) represents the *rt* operand. The microAptiv UP core only checks the value of the *rt* operand to determine how many times the operation must pass through the multiplier. The 16x16 and 32x16 operations pass through the multiplier once. A 32x32 operation passes through the multiplier twice.

The MDU supports execution of one 16x16 or 32x16 multiply or multiply-accumulate operation every clock cycle; 32x32 multiply operations can be issued every other clock cycle. Appropriate interlocks are implemented to stall the issuance of back-to-back 32x32 multiply operations. The multiply operand size is automatically determined by logic built into the MDU.

#### MDU with Area-Efficient Option

With the area-efficient option, multiply and divide operations are implemented with a simple 1-bit-per-clock iterative algorithm. Any attempt to issue a subsequent MDU instruction while a multiply/divide is still active causes an MDU pipeline stall until the operation is completed.

Regardless of the multiplier array implementation, divide operations are implemented with a simple 1-bit-per-clock iterative algorithm. An early-in detection checks the sign extension of the dividend (*rs*) operand. If *rs* is 8 bits wide, 23 iterations are skipped. For a 16-bit-wide *rs*, 15 iterations are skipped, and for a 24-bit-wide *rs*, 7 iterations are skipped. Any attempt to issue a subsequent MDU instruction while a divide is still active causes an IU pipeline stall until the divide operation has completed.

#### 1.2.1.4 System Control Coprocessor (CP0)

In the MIPS architecture, CP0 is responsible for the virtual-to-physical address translation and cache protocols, the exception control system, the processor's diagnostics capability, the operating modes (kernel, user, and debug), and whether interrupts are enabled or disabled. Configuration information, such as cache size and set associativity, presence of build-time options like microMIPS, CorExtend Module or Coprocessor 2 interface, is also available by accessing the CP0 registers.

Coprocessor 0 also contains the logic for identifying and managing exceptions. Exceptions can be caused by a variety of sources, including boundary cases in data, external events, or program errors.

#### Interrupt Handling

The microAptiv UP core includes support for eight hardware interrupt pins, two software interrupts, and a timer interrupt. These interrupts can be used in any of three interrupt modes, as defined by Release 2 of the MIPS32 Architecture:

- Interrupt compatibility mode, which acts identically to that in an implementation of Release 1 of the Architecture.
- Vectored Interrupt (VI) mode, which adds the ability to prioritize and vector interrupts to a handler dedicated to that interrupt, and to assign a GPR shadow set for use during interrupt processing. The presence of this mode is denoted by the *Vlnt* bit in the *Config3* register. This mode is architecturally optional; but it is always present on the microAptiv UP core, so the *Vlnt* bit will always read as a 1 for the microAptiv UP core.
- External Interrupt Controller (EIC) mode, which redefines the way in which interrupts are handled to provide full support for an external interrupt controller handling prioritization and vectoring of interrupts. The presence of this mode denoted by the *VEIC* bit in the *Config3* register. Again, this mode is architecturally optional. On the microAptiv UP core, the *VEIC* bit is set externally by the static input, *SI\_EICPresent*, to allow system logic to indicate the presence of an external interrupt controller.

The reset state of the processor is interrupt compatibility mode, such that a processor supporting Release 2 of the Architecture, the microAptiv UP core for example, is fully compatible with implementations of Release 1 of the Architecture.

VI or EIC interrupt modes can be combined with the optional shadow registers to specify which shadow set should be used on entry to a particular vector. The shadow registers further improve interrupt latency by avoiding the need to save context when invoking an interrupt handler.

In the microAptiv UP core, interrupt latency is reduced by:

- Speculative interrupt vector prefetching during the pipeline flush.
- Interrupt Automated Prologue (IAP) in hardware: Shadow Register Sets remove the need to save GPRs, and IAP removes the need to save specific Control Registers when handling an interrupt.
- Interrupt Automated Epilogue (IAE) in hardware: Shadow Register Sets remove the need to restore GPRs, and IAE removes the need to restore specific Control Registers when returning from an interrupt.
- Allow interrupt chaining. When servicing an interrupt and interrupt chaining is enabled, there is no need to return from the current Interrupt Service Routine (ISR) if there is another valid interrupt pending to be serviced. The control of the processor can jump directly from the current ISR to the next ISR without IAE and IAP.

#### **GPR Shadow Registers**

The MIPS32 Architecture optionally removes the need to save and restore GPRs on entry to high-priority interrupts or exceptions, and to provide specified processor modes with the same capability. This is done by introducing multiple copies of the GPRs, called *shadow sets*, and allowing privileged software to associate a shadow set with entry to kernel mode via an interrupt vector or exception. The normal GPRs are logically considered shadow set zero.

The number of GPR shadow sets is a build-time option. The microAptiv UP core allows 1 (the normal GPRs), 2, 4, 8, or 16 shadow sets. The highest number actually implemented is indicated by the *SRSCt/HSS* field. If this field is zero, only the normal GPRs are implemented.

Shadow sets are new copies of the GPRs that can be substituted for the normal GPRs on entry to kernel mode via an interrupt or exception. When a shadow set is bound to a kernel-mode entry condition, references to GPRs operate exactly as one would expect, but they are redirected to registers that are dedicated to that condition. Privileged software may need to reference all GPRs in the register file, even specific shadow registers that are not visible in the current mode, and the RDPGPR and WRPGPR instructions are used for this purpose. The *CSS* field of the *SRSCtl* register provides the number of the current shadow register set, and the *PSS* field of the *SRSCtl* register provides the number of the previous shadow register set that was current before the last exception or interrupt occurred.

If the processor is operating in VI interrupt mode, binding of a vectored interrupt to a shadow set is done by writing to the *SRSMap* register. If the processor is operating in EIC interrupt mode, the binding of the interrupt to a specific shadow set is provided by the external interrupt controller and is configured in an implementation-dependent way. Binding of an exception or non-vectored interrupt to a shadow set is done by writing to the *ESS* field of the *SRSCtl* register. When an exception or interrupt occurs, the value of *SRSCtl*<sub>CSS</sub> is copied to *SRSCtl*<sub>PSS</sub>, and *SRSCtl*<sub>CSS</sub> is set to the value taken from the appropriate source. On an ERET, the value of *SRSCtl*<sub>PSS</sub> is copied back into *SRSCtl*<sub>CSS</sub> to restore the shadow set of the mode to which control returns.

Refer to Chapter 6, "CP0 Registers of the microAptiv<sup>TM</sup> UP Core" on page 135 for more information on the CP0 registers. Refer to Chapter 10, "EJTAG Debug Support in the microAptiv<sup>TM</sup> UP Core" on page 230 for more information on EJTAG debug registers.

#### 1.2.1.5 Memory Management Unit (MMU)

#### Modes of Operation

The microAptiv UP core implements three modes of operation:

- *User mode* is most often used for applications programs.
- *Kernel mode* is typically used for handling exceptions and operating-system kernel functions, including CP0 management and I/O device accesses.
- *Debug mode* is used during system bring-up and software development. Refer to the EJTAG section for more information on debug mode.

Figure 1.2 shows the virtual address map of the MIPS Architecture.

| 0xFFFFFFFF               |                                                  | )           |
|--------------------------|--------------------------------------------------|-------------|
|                          | Memory Mapped                                    |             |
| 0xFF400000<br>0xFF3FFFFF | Memory/EJTAG <sup>1</sup>                        | ,<br>≻kseg3 |
| 0xFF200000<br>0xF1FFFFFF | Memory/EditAd                                    | l           |
| 0xE0000000               | Memory Mapped                                    |             |
| 0xDFFFFFFF               | Kernel Virtual Address Space                     | kseg2       |
| 0xC0000000               | Mapped, 512 MB                                   |             |
| 0xBFFFFFFF               | Kernel Virtual Address Space<br>Unmapped, 512 MB | kseg1       |
| 0xA0000000               | Uncached                                         |             |
| 0x9FFFFFFF               | Kernel Virtual Address Space<br>Unmapped, 512 MB | kseg0       |
| 0x80000000               |                                                  |             |
| 0x7FFFFFF                | User Virtual Address Space<br>Mapped, 2048 MB    | kuseg       |
| 0x00000000               |                                                  | l           |
| 1 This space i           | e manned to momeny in year or karnal             | modo        |

#### Figure 1.2 microAptiv<sup>™</sup> UP Core Virtual Address Map

1. This space is mapped to memory in user or kernel mode, and by the EJTAG module in debug mode.

#### Memory Management Unit (MMU)

The microAptiv UP core offers one of the two choices of MMU that interfaces between the execution unit and the cache controller, namely Translation Lookaside Buffer (TLB) and Fixed Mapping Translation (FMT).

• Fixed Mapping Translation (FMT)

A FMT is smaller and simpler than a TLB. Like a TLB, the FMT performs virtual-to-physical address translation and provides attributes for the different segments. Those segments that are unmapped in a TLB implementation (kseg0 and kseg1) are translated identically by the FMT.

• Translation Lookaside Buffer (TLB)

A TLB-based MMU consists of three translation buffers: a 16 or 32 dual-entry fully associative Joint TLB (JTLB), a 4-entry fully associative Instruction TLB (ITLB), and a 4-entry fully associative data TLB (DTLB).

When an instruction address is calculated, the virtual address is compared to the contents of the 4-entry ITLB. If the address is not found in the ITLB, the JTLB is accessed. If the entry is found in the JTLB, that entry is then written into the ITLB. If the address is not found in the JTLB, a TLB refill exception is taken.

When a data address is calculated, the virtual address is compared to both the 4-entry DTLB and the JTLB. If the address is not found in the DTLB, but is found in the JTLB, that address is immediately written to the DTLB. If the address is not found in the JTLB, a TLB refill exception is taken.

The microAptiv UP core TLB allows pages to be protected by a read-inhibit and an execute-inhibit attribute in addition to the write-protection attribute defined by the MIPS32 PRA.

Figure 1.3 shows how the FMT is implemented in the microAptiv UP core.





Figure 1.4 shows how the ITLB, DTLB, and JTLB are implemented in the microAptiv UP core.



#### Figure 1.4 Address Translation During a Cache Access with TLB Implementation

The TLB consists of three address translation buffers:

- 1. 16 dual-entry fully associative Joint TLB (JTLB)
- 2. 4-entry fully associative Instruction TLB (ITLB)
- 3. 4-entry fully associative Data TLB (DTLB)
- Joint TLB (JTLB)

The microAptiv UP core implements a 16 or 32 dual-entry, fully associative JTLB that maps 32 virtual pages to their corresponding physical addresses. The purpose of the TLB is to translate virtual addresses and their corresponding ASIDs into a physical memory address. The translation is performed by comparing the upper bits of the virtual address (along with the ASID) against each of the entries in the *tag* portion of the joint TLB structure.

The JTLB is organized as pairs of even and odd entries containing pages that range in size from 4-Kbytes (or 1-Kbyte) to 256-Mbytes into the 4-Gbyte physical address space. By default, the minimum page size is normally 4-Kbytes on the microAptiv UP core; as a build time option, it is possible to specify a minimum page size of 1-Kbyte.

The JTLB is organized in page pairs to minimize the overall size. Each *tag* entry corresponds to 2 data entries: an even page entry and an odd page entry. The highest order virtual address bit not participating in the tag comparison is used to determine which of the data entries is used. Since page size can vary on a page-pair basis, the determination of which address bits participate in the comparison and which bit is used to make the even-odd determination is decided dynamically during the TLB lookup.

• Instruction TLB (ITLB)

The ITLB is a small 4-entry, fully associative TLB dedicated to performing translations for the instruction stream. The ITLB only maps minimum sized pages/subpages. The minimum page size is either 1-Kbyte or 4-Kbyte, depending on the *PageGrain* and *Config3* registers.

The ITLB is managed by hardware and is transparent to software. The larger JTLB is used as a backing store for the ITLB. If a fetch address cannot be translated by the ITLB, the JTLB is used to attempt to translate it in the following clock cycle. If successful, the translation information is copied into the ITLB for future use. There is a two-cycle ITLB miss penalty.

• Data TLB (DTLB)

The DTLB is a small 4-entry, fully associative TLB dedicated to performing translations for loads and stores. Similar to the ITLB, the DTLB only maps either 1-Kbyte or 4-Kbyte pages/subpages depending on the *PageGrain* and *Config3* registers.

The DTLB is managed by hardware and is transparent to software. The larger JTLB is used as a backing store for the DTLB. The JTLB is looked-up in parallel with the DTLB to minimize the DTLB miss penalty. If the JTLB translation is successful, the translation information is copied into the DTLB for future use. There is a one cycle DTLB miss penalty.

#### 1.2.1.6 Cache Controllers

The microAptiv UP core instruction and data cache controllers support caches of various sizes, organizations, and set-associativity. For example, the data cache can be 2 Kbytes in size and 2-way set associative, while the instruction cache can be 8 Kbytes in size and 4-way set associative. Each cache can each be accessed in a single processor cycle. In addition, each cache has its own 32-bit data path, and both caches can be accessed in the same pipeline clock cycle. Refer to "Optional Logic Blocks" on page 33 for more information on instruction and data cache organization.

The cache controllers also have built-in support for replacing one way of the cache with a scratchpad RAM. See the section entitled "Scratchpad RAM" on page 35 for more information on scratchpad RAMs.

Each cache controller contains and manages a one-line fill buffer. Besides accumulating data to be written to the cache, the fill buffer is accessed in parallel with the cache and data can be bypassed back to the core.

Refer to Chapter 8, "Caches of the microAptiv<sup>TM</sup> UP Core" on page 218 for more information on the instruction and data cache controllers.

#### 1.2.1.7 Bus Interface Unit (BIU)

The Bus Interface Unit (BIU) serves as the interface between the microAptiv UP core and the outside world. Primarily, the BIU receives read/write requests from the cache controller. These requests will be arbitrated and turned into bus transactions via the AMBA-3 AHB-lite protocol. The characteristics of the BIU are:

- AHB-Lite is a subset of the AHB bus protocol that supports a single bus master. It does not support complex Split/Retry operations.
- Shared 32-bit read/write address bus
- Two unidirectional 32-bit data buses for read and write operations
- Single read/write and burst (WRAP mode) read/write are supported.

#### 1.2.1.8 Power Management

The microAptiv UP core offers a number of power management features, including low-power design, active power management, and power-down modes of operation. The core is a static design that supports slowing or halting the clocks, which reduces system power consumption during idle periods.

The microAptiv UP core provides two mechanisms for system-level low-power support:

• Register-controlled power management

• Instruction-controlled power management

#### **Register-Controlled Power Management**

The *RP* bit in the CP0 *Status* register provides a software mechanism for placing the system into a low-power state. The state of the *RP* bit is available externally via the *SI\_RP* signal. The external agent then decides whether to place the device in a low-power mode, such as reducing the system clock frequency.

Three additional bits, *StatusEXL*, *StatusERL*, and *DebugDM* support the power management function by allowing the user to change the power state if an exception or error occurs while the microAptiv UP core is in a low-power state. Depending on what type of exception is taken, one of these three bits will be asserted and reflected on the *SI\_EXL*, *SI\_ERL*, or *EJ\_DebugM* outputs. The external agent can look at these signals and determine whether to leave the low-power state to service the exception.

The following four power-down signals are part of the system interface and change state as the corresponding bits in the CP0 registers are set or cleared:

- The SI\_RP signal represents the state of the RP bit (27) in the CP0 Status register.
- The SI\_EXL signal represents the state of the EXL bit (1) in the CP0 Status register.
- The SI\_ERL signal represents the state of the ERL bit (2) in the CP0 Status register.
- The *EJ\_DebugM* signal represents the state of the DM bit (30) in the CP0 *Debug* register.

#### Instruction-Controlled Power Management

The second mechanism for invoking power-down mode is by executing the WAIT instruction. When the WAIT instruction is executed, the internal clock is suspended; however, the internal timer and some of the input pins (*SI\_Int[5:0]*, *SI\_NMI*, *SI\_Reset*, and *SI\_ColdReset*) continue to run. When the CPU is in instruction-controlled power management mode, any interrupt, NMI, or reset condition causes the CPU to exit this mode and resume normal operation.

The microAptiv UP core asserts the  $SI_Sleep$  signal, which is part of the system interface bus, whenever the WAIT instruction is executed. The assertion of  $SI_Sleep$  indicates that the clock has stopped and the microAptiv UP core is waiting for an interrupt.

#### Local clock gating

The majority of the power consumed by the microAptiv UP core is in the clock tree and clocking registers. The core has support for extensive use of local gated clocks. Power-conscious implementors can use these gated clocks to significantly reduce power consumption within the core.

Refer to Chapter 9, "Power Management of the microAptiv<sup>™</sup> UP Core" on page 228 for more information on power management.

#### **1.2.2 Optional Logic Blocks**

The core consists of the following optional logic blocks as shown in the block diagram in Figure 1.1.

#### 1.2.2.1 Instruction Cache

The instruction cache is an optional on-chip memory block of up to 64 Kbytes. Because the instruction cache is virtually indexed, the virtual-to-physical address translation occurs in parallel with the cache access rather than having to wait for the physical address translation. The tag holds 22 bits of physical address, a valid bit, and a lock bit. The LRU replacement bits (0-6b per set depending on associativity) are stored in a separate array.

The instruction cache block also contains and manages the instruction line fill buffer. Besides accumulating data to be written to the cache, instruction fetches that reference data in the line fill buffer are serviced either by a bypass of that data, or data coming from the external interface. The instruction cache control logic controls the bypass function.

The microAptiv UP core supports instruction-cache locking. Cache locking allows critical code or data segments to be locked into the cache on a "per-line" basis, enabling the system programmer to maximize the efficiency of the system cache.

The cache-locking function is always available on all instruction-cache entries. Entries can then be marked as locked or unlocked on a per entry basis using the CACHE instruction.

#### 1.2.2.2 Data Cache

The data cache is an optional on-chip memory block of up to 64 Kbytes. This virtually indexed, physically tagged cache is protected. Because the data cache is virtually indexed, the virtual-to-physical address translation occurs in parallel with the cache access. The tag holds 22 bits of physical address, a valid bit, and a lock bit. There is an additional array holding dirty bits and LRU replacement algorithm bits (0-6b depending on associativity) for each set of the cache.

In addition to instruction-cache locking, the microAptiv UP core also supports a data-cache locking mechanism identical to the instruction cache. Critical data segments are locked into the cache on a "per-line" basis. The locked contents can be updated on a store hit, but cannot be selected for replacement on a cache miss.

The cache-locking function is always available on all data cache entries. Entries can then be marked as locked or unlocked on a per-entry basis using the CACHE instruction.

#### **Cache Memory Configuration**

The microAptiv UP core incorporates on-chip instruction and data caches that can each be accessed in a single processor cycle. Each cache has its own 32-bit data path and can be accessed in the same pipeline clock cycle. Table 1.1 lists the microAptiv UP core instruction and data cache attributes.

| Parameter    | Instruction                    | Data                           |
|--------------|--------------------------------|--------------------------------|
| Size         | 0 - 64 Kbytes                  | 0 - 64 Kbytes                  |
| Organization | 1 - 4 way set asso-<br>ciative | 1 - 4 way set asso-<br>ciative |
| Line Size    | 16 bytes                       | 16 bytes                       |
| Read Unit    | 32 bits                        | 32 bits                        |

#### **Table 1.1 Instruction and Data Cache Attributes**

| Parameter                      | Instruction | Data                                                                                                                        |
|--------------------------------|-------------|-----------------------------------------------------------------------------------------------------------------------------|
| Write Policies                 | NA          | write-through with<br>write allocate,<br>write-through<br>without write allo-<br>cate,<br>write-back with<br>write allocate |
| Miss restart after transfer of | miss word   | miss word                                                                                                                   |
| Cache Locking                  | per line    | per line                                                                                                                    |

#### Table 1.1 Instruction and Data Cache Attributes (Continued)

#### **Cache Protocols**

The microAptiv UP core supports the following cache protocols:

- Uncached: Addresses in a memory area indicated as uncached are not read from the cache. Stores to such addresses are written directly to main memory, without changing cache contents.
- Write-through, no write allocate: Loads and instruction fetches first search the cache, reading main memory only if the desired data does not reside in the cache. On data store operations, the cache is first searched to see if the target address is cache resident. If it is resident, the cache contents are updated, and main memory is also written. If the cache lookup misses, only main memory is written.
- Write-through, write allocate: Similar to above, but stores missing in the cache will cause a cache refill. The store data is then written to both the cache and main memory.
- Write-back, write allocate: Stores that miss in the cache will cause a cache refill. Store data, however, is only written to the cache. Caches lines that are written by stores will be marked as dirty. If a dirty line is selected for replacement, the cache line will be written back to main memory.

#### 1.2.2.3 Scratchpad RAM

The microAptiv UP core also supports replacing up to one way of each cache with a scratchpad RAM. Scratchpad RAM is accessed via independent external pin interfaces for instruction and data scratchpads. The external block which connects to a scratchpad interface is user-defined and can consist of a variety of devices. The main requirement is that it must be accessible with timing similar to an internal cache RAM. Normally, this means that an index will be driven one cycle, a tag will be driven the following clock, and the scratchpad must return a hit signal and the data in the second clock. The scratchpad can easily contain a large RAM/ROM or memory-mapped registers. Unlike the fixed single-cycle cache timing, however, the scratchpad interface can also accommodate back-stalling the core pipeline if data is not available in a single clock. This back-stalling capability can be useful for operations which require multi-cycle latency. It can also be used to enable arbitration of external accesses to a shared scratchpad memory.

The core's functional interface to a scratchpad RAM is slightly different from the interface to a regular cache RAM. Additional index bits allow access to a larger array, with 1MB of scratchpad RAM versus 4KB for a cache way. These bits come from the virtual address, so on a microAptiv UP core care must be taken to avoid virtual aliasing. The core does not automatically refill the scratchpad way and will not select it for replacement on cache misses.

#### 1.2.2.4 microMIPS™ ISA

The microAptiv UP core supports the microMIPS ISA, which contains all MIPS32 ISA instructions (except for branch-likely instructions) in a new 32-bit encoding scheme, with some of the commonly used instructions also available in 16-bit encoded format. This ISA improves code density through the additional 16-bit instructions while maintaining a performance similar to MIPS32 mode. In microMIPS mode, 16-bit or 32-bit instructions will be fetched and recoded to legacy MIPS32 instruction opcodes in the pipeline's I stage, so that the microAptiv UP core can have the same M14Kc microarchitecture. Because the microMIPS instruction stream can be intermixed with 16-bit halfword or 32-bit word size instructions on halfword or word boundaries, additional logic is in place to address the word misalignment issues, thus minimizing performance loss.

Depending on the optimization preference when both MIPS32 and microMIPS ISAs are configured, the microMIPS can be configured in performance mode, with multiple recoding blocks being executed in parallel with Tag compare for each Way Associativity, or with a single recoding block after the Tag compare logic to improve area usage.

#### 1.2.2.5 DSP Module

The microAptiv UP core implements an optional DSP Module to benefit a wide range of DSP, Media, and DSP-like algorithms. The DSP module is highly integrated with the Execution Unit and the MDU in order to share common logic and to include support for operations on fractional data types, saturating arithmetic, and register SIMD operations. Fractional data types Q15 and Q31 are supported. Register SIMD operations can perform up to four simultaneous add, subtract, or shift operations and two simultaneous multiply operations.

In addition, the DSP Module includes some key features that efficiently address specific problems often encountered in DSP applications. These include, for example, support for complex multiply, variable-bit insert and extract, and implementation and use of virtual circular buffers. The extension also makes available three additional sets of HI-LO accumulators to better facilitate common accumulate functions such as filter operation and convolutions.

#### 1.2.2.6 Coprocessor 2 Interface

The microAptiv UP core can be configured to have an interface for an on-chip coprocessor. This coprocessor can be tightly coupled to the processor core, allowing high-performance solutions integrating a graphics accelerator or DSP, for example.

The coprocessor interface is extensible and standardized on MIPS cores, allowing for design reuse. The microAptiv UP core supports a subset of the full coprocessor interface standard: 32b data transfer, no Coprocessor 1 support, single issue in-order data transfer to coprocessor, and one out-of-order data transfer from coprocessor.

The coprocessor interface is designed to ease integration with customer IP. The interface allows high-performance communication between the core and coprocessor. There are no late or critical signals on the interface.

Refer to Chapter 12, "microAptiv<sup>™</sup> UP Processor Core Instructions" on page 310 for more information on the Coprocessor 2 supported instructions.

#### 1.2.2.7 CorExtend® User-defined Instruction Extensions

An optional CorExtend User-defined Instruction (UDI) block enables the implementation of a small number of application-specific instructions that are tightly coupled to the core's execution unit. The interface to the UDI block is external to the microAptiv UP core.

Such instructions may operate on a general-purpose register, immediate data specified by the instruction word, or local state stored within the UDI block. The destination may be a general-purpose register or local UDI state. The operation may complete in one cycle or multiple cycles, if desired.

Refer to Table 12.3 "Special2 Opcode Encoding of Function Field" for a specification of the opcode map available for user-defined instructions.

### 1.2.2.8 EJTAG Debug Support

The microAptiv UP core provides for an Enhanced JTAG (EJTAG) interface for use in the software debug of application and kernel code. In addition to standard user and kernel modes of operation, the microAptiv UP core provides a Debug mode that is entered after a debug exception (derived from a hardware breakpoint, single-step exception, etc.) is taken and continues until a debug exception return (DERET) instruction is executed. During this time, the processor executes the debug exception-handler routine.

The EJTAG interface operates through the Test Access Port (TAP), a serial communication port used for transferring test data in and out of the microAptiv UP core. In addition to the standard JTAG instructions, special instructions defined in the EJTAG specification specify which registers are selected and how they are used.

### **Debug Registers**

Four debug registers (*DEBUG*, *DEBUG*2, *DEPC*, and *DESAVE*) have been added to the MIPS Coprocessor 0 (CP0) register set. The *DEBUG* and *DEBUG*2 registers show the cause of the debug exception and are used for setting up single-step operations. The *DEPC* (Debug Exception Program Counter) register holds the address on which the debug exception was taken, which is used to resume program execution after the debug operation finishes. Finally, the *DESAVE* (Debug Exception Save) register enables the saving of general-purpose registers used during execution of the debug exception handler.

To exit debug mode, a Debug Exception Return (DERET) instruction is executed. When this instruction is executed, the system exits debug mode, allowing normal execution of application and system code to resume.

### EJTAG Hardware Breakpoints

There are several types of *simple* hardware breakpoints defined in the EJTAG specification. These stop the normal operation of the CPU and force the system into debug mode. There are two types of simple hardware breakpoints implemented in the microAptiv UP core: Instruction breakpoints and Data breakpoints. Additionally, *complex* hardware breakpoints can be included, which allow detection of more intricate sequences of events.

The microAptiv UP core can be configured with the following breakpoint options:

- No data or instruction, or complex breakpoints
- One data and two instruction breakpoints, without complex breakpoints
- Two data and four instruction breakpoints, without complex breakpoints
- Two data and six instruction breakpoints, with or without complex breakpoints
- Four data and eight instruction breakpoints, with or without complex breakpoints

Instruction breakpoints occur on instruction execution operations, and the breakpoint is set on the virtual address.Instruction breakpoints can also be made on the ASID value used by the MMU. A mask can be applied to the virtual address to set breakpoints on a binary range of instructions.

Data breakpoints occur on load/store transactions, and the breakpoint is set on a set of virtual address and ASID values, with the same single address or binary address range as the Instruction breakpoint. Data breakpoints can be set on a load, a store, or both. Data breakpoints can also be set to match on the operand value of the load/store operation, with byte-granularity masking. Finally, masks can be applied to both the virtual address and the load/store value.

In addition, the microAptiv UP core has a configurable feature to support data and instruction address-range triggered breakpoints, where a breakpoint can occur when a virtual address is either within or outside a pair of 32-bit addresses. Unlike the traditional address-mask control, address-range triggering is not restricted to a power-of-two binary boundary.

Complex breakpoints utilize the simple instruction and data breakpoints and break when combinations of events are seen. Complex break features include:

- Pass Counters Each time a matching condition is seen, a counter is decremented. The break or trigger will only be enabled when the counter has counted down to 0.
- Tuples A tuple is the pairing of an instruction and a data breakpoint. The tuple will match if both the virtual address of the load or store instruction matches the instruction breakpoint, and the data breakpoint of the resulting load or store address and optional data value matches.
- Priming This allows a breakpoint to be enabled only after other break conditions have been met. Also called *sequential* or *armed triggering*.
- Qualified This feature uses a data breakpoint to qualify when an instruction breakpoint can be taken. When a load matches the data address and the data value, the instruction break will be enabled. If a load matches the address, but has mis-matching data, the instruction break will be disabled.

#### **Performance Counters**

Performance counters are used to accumulate occurrences of internal predefined events/cycles/conditions for program analysis, debug, or profiling. A few examples of event types are clock cycles, instructions executed, specific instruction types executed, loads, stores, exceptions, and cycles while the CPU is stalled. There are two, 32-bit counters. Each can count one of the 64 internal predefined events selected by a corresponding control register. A counter overflow can be programmed to generate an interrupt, where the interrupt-handler software can maintain larger total counts.

#### PC/Address Sampling

This sampling function is used for program profiling and hot-spots analysis. Instruction PC and/or Load/Store addresses can be sampled periodically. The result is scanned out through the EJTAG port. The *Debug Control Register (DCR)* is used to specify the sample period and the sample trigger.

#### Fast Debug Channel (FDC)

The microAptiv UP core includes an optional FDC as a mechanism for high bandwidth data transfer between a debug host/probe and a target. FDC provides a FIFO buffering scheme to transfer data serially, with low CPU overhead and minimized waiting time. The data transfer occurs in the background, and the target CPU can choose either to check the status of the transfer periodically or to be interrupted at the end of the transfer.



Figure 1.5 FDC Overview

#### iFlowtrace™

The microAptiv UP core has an option for a simple trace mechanism named iFlowtrace. This mechanism only traces the instruction PC, not data addresses or values. This simplification allows the trace block to be smaller and the trace compression to be more efficient. iFlowtrace memory can be configured as off-chip, on-chip, or both.

iFlowtrace also offers special-event trace modes when normal tracing is disabled, namely:

- Function Call/Return and Exception Tracing mode to trace the PC value of function calls and returns and/or exceptions and returns.
- Breakpoint Match mode traces the breakpoint ID of a matching breakpoint and, for data breakpoints, the PC value of the instruction that caused it.
- Filtered Data Tracing mode traces the ID of a matching data breakpoint, the load or store data value, access type and memory access size, and the low-order address bits of the memory access, which is useful when the data breakpoint is set up to match a binary range of addresses.
- User Trace Messages. The user can instrument their code to add their own 32-bit value messages into the trace by writing to the Cop0 UTM register.
- Delta Cycle mode works in combination with the above trace modes to provide a timestamp between stored events. It reports the number of cycles that have elapsed since the last message was generated and put into the trace.

Refer to Chapter 10, "EJTAG Debug Support in the microAptiv<sup>™</sup> UP Core" on page 230 for more information on the EJTAG features.

#### cJTAG Support

The microAptiv UP core provides an external conversion block which converts the existing EJTAG (IEEE 1149.1) 4-wire interface at the microAptiv UP core to a cJTAG (IEEE 1149.7) 2-wire interface. cJTAG reduces the number of wires from 4 to 2 and enables the support of Star-2 scan topology in the system debug environment.



### Figure 1.6 cJTAG Support

### SecureDebug

SecureDebug improves security by disabling untrusted EJTAG debug access. An input signal is used to disable debug features, such as Probe Trap, Debug Interrupt Exception (EjtagBrk and DINT), EJTAGBOOT instruction, and PC Sampling.

Chapter 2

# Pipeline of the microAptiv<sup>™</sup> UP Core

The microAptiv UP processor core implements a 5-stage pipeline similar to the original 4KE pipeline. The pipeline allows the processor to achieve high frequency while minimizing device complexity, reducing both cost and power consumption. This chapter contains the following sections:

- Section 2.1 "Pipeline Stages"
- Section 2.2 "Instruction Cache Miss"
- Section 2.3 "Data Cache Miss"
- Section 2.4 "Multiply/Divide Operations"
- Section 2.5 "MDU Pipeline with DSP Module Enabled"
- Section 2.6 "MDU Pipeline High-performance MDU with DSP Module Disabled"
- Section 2.7 "MDU Pipeline Area-Efficient MDU with DSP Module Disabled"
- Section 2.8 "Branch Delay"
- Section 2.9 "Data Bypassing"
- Section 2.11 "Interlock Handling"
- Section 2.12 "Slip Conditions"
- Section 2.13 "Instruction Interlocks"
- Section 2.14 "Hazards"

## 2.1 Pipeline Stages

The microAptiv UP core implements a 5-stage pipeline with a performance similar to the M14Kc pipeline. The pipeline allows the processor to achieve high frequency while minimizing device complexity, reducing both cost and power consumption.

The microAptiv UP core pipeline consists of five stages:

- Instruction (I Stage)
- Execution (E Stage)
- Memory (M Stage)

- Align (A Stage)
- Writeback (W stage)

The microAptiv UP core implements a bypass mechanism that allows the result of an operation to be forwarded directly to the instruction that needs it without having to write the result to the register and then read it back.

The microAptiv UP soft core includes a build-time option that determines the type of multiply/divide unit (MDU) implemented. The MDU can be either a high-performance 32x16 multiplier array or an iterative, area-efficient array when the DSP Module configuration is not selected. The MDU choice has a significant effect on the MDU pipeline, and the latency of multiply/divide instructions executed on the core. Software can query the type of MDU present on a specific implementation of the core by querying the MDU bit in the Config register (CP0 register 16, select 0); see Chapter 6, "CP0 Registers of the microAptiv<sup>TM</sup> UP Core" on page 135 for more details. When the DSP Module configuration is selected, the multiply/divide unit (MDU) of the microAptiv UP soft core is always implemented with a fully pipelined 32x32 multiplier array for maximum performance.

The microAptiv UP has a build-time MMU type option between TLB or FMT implementation.

Figure 2.1 shows the operations performed in each pipeline stage of the microAptiv UP processor, when the high-performance multiplier is present when the DSP Module is disabled.





Figure 2.2 shows the operations performed in each pipeline stage of the microAptiv UP processor core, when the area-efficient multiplier is present when the DSP Module is disabled.



#### Figure 2.2 microAptiv<sup>™</sup> UP Core Pipeline Stages with area-efficient MDU

Figure 2.3 shows the operations performed in each pipeline stage of the microAptiv UP processor when the DSP Module is enabled.

### Figure 2.3 microAptiv™ UP Core Pipeline Stages with optional TLB MMU



### 2.1.1 I Stage: Instruction Fetch

During the Instruction fetch stage:

- An instruction is fetched from the instruction cache.
- The I-TLB performs a virtual-to-physical address translation.
- If both MIPS32 and microMIPS ISAs are supported, microMIPS instructions are converted to MIPS32-like instructions. If the MIPS32 ISA is not supported, 16-bit microMIPS instructions will be first recoded into 32-bit microMIPS equivalent instructions, and then decoded in native microMIPS ISA format.

## 2.1.2 E Stage: Execution

During the Execution stage:

- Operands are fetched from the register file.
- Operands from the M and A stage are bypassed to this stage.
- The Arithmetic Logic Unit (ALU) begins the arithmetic or logical operation for register-to-register instructions.
- The ALU calculates the data virtual address for load and store instructions.
- The ALU determines whether the branch condition is true and calculates the virtual branch target address for branch instructions.
- Instruction logic selects an instruction address.
- All multiply and divide operations begin in this stage.

## 2.1.3 M Stage: Memory Fetch

During the Memory fetch stage:

- The arithmetic ALU operation completes.
- The data cache access and the data virtual-to-physical address translation are performed for load and store instructions.
- Data cache lookup is performed and a hit/miss determination is made.
- A 16x16, 32x16 or 32x32 multiply calculation completes (with DSP configuration).
- A 32x32 multiply operation stalls the MDU pipeline for one clock in the M stage (high-performance MDU option without DSP configuration).
- A multiply operation stalls the MDU pipeline for 31 clocks in the M stage (area-efficient MDU option without DSP configuration).
- A multiply-accumulate operation stalls the MDU pipeline for 33 clocks in the M stage (area-efficient MDU option without DSP configuration).
- A divide operation stalls the MDU pipeline for a maximum of 38 clocks in the M stage. Early-in sign extension detection on the dividend will skip 7, 15, or 23 stall clocks (only the divider in the fast MDU option supports early-in detection).

### 2.1.4 A Stage: Align

During the Align stage:

- Load data is aligned to its word boundary.
- A multiply/divide operation updates the HI/LO registers (area-efficient MDU option).

- Multiply operation performs the carry-propagate-add. The actual register writeback is performed in the W stage (high-performance MDU option).
- A MUL operation makes the result available for writeback. The actual register writeback is performed in the W stage.
- EJTAG complex break conditions are evaluated.

### 2.1.5 W Stage: Writeback

During the Writeback stage:

• For register-to-register or load instructions, the result is written back to the register file.

## 2.2 Instruction Cache Miss

When the instruction cache is indexed, the instruction address is translated to determine if the required instruction resides in the cache. An instruction cache miss occurs when the requested instruction address does not reside in the instruction cache. When a cache miss is detected in the I stage, the core transitions to the E stage. The pipeline stalls in the E stage until the miss is resolved. The bus interface unit must select the address from multiple sources. If the address bus is busy, the request will remain in this arbitration stage (B-ASel in Figure 2.4) until the bus is available. The core drives the selected address onto the bus. The number of clocks before data is returned is then determined by the array containing the data.

When the data is returned to the core, the critical word is written to the instruction register for immediate use. The bypass mechanism allows the core to use the data as soon as it arrives, as opposed to having the entire cache line written to the instruction cache, then reading out the required word.

Figure 2.4 shows a timing diagram of an instruction cache miss.



#### Figure 2.4 Instruction Cache Miss Timing

\* Contains all of the cycles that address and data are utilizing the bus.

## 2.3 Data Cache Miss

When the data cache is indexed, the data address is translated to determine if the required data resides in the cache. A data cache miss occurs when the requested data address does not reside in the data cache.

When a data cache miss is detected in the M stage (D-TLB), the core transitions to the A stage. The pipeline stalls in the A stage until the miss is resolved (requested data is returned). The bus interface unit arbitrates between multiple requests and selects the correct address to be driven onto the bus (B-ASel in Figure 2.5). The core drives the selected

address onto the bus. The number of clocks before data is returned is then determined by the array containing the data.

When the data is returned to the core, the critical word of data passes through the aligner before being forwarded to the execution unit. The bypass mechanism allows the core to use the data as soon as it arrives, as opposed to having the entire cache line written to the data cache, then reading out the required word.

Figure 2.5 shows a timing diagram of a data cache miss.

Figure 2.5 Load/Store Cache Miss Timing



\* Contains all of the time that address and data are utilizing the bus.

## 2.4 Multiply/Divide Operations

The microAptiv UP core implements the standard MIPS II<sup>™</sup> multiply and divide instructions. Additionally, several new instructions were standardized in the MIPS32 architecture for enhanced performance.

The targeted multiply instruction, MUL, specifies that multiply results be placed in the general-purpose register file instead of the HI/LO register pair. By avoiding the explicit MFLO instruction, required when using the LO register, and by supporting multiple destination registers, the throughput of multiply-intensive operations is increased.

Four instructions, multiply-add (MADD), multiply-add-unsigned (MADDU), multiply-subtract (MSUB), and multiplyply-subtract-unsigned (MSUBU), are used to perform the multiply-accumulate and multiply-subtract operations. The MADD/MADDU instruction multiplies two numbers and then adds the product to the current contents of the HI and LO registers. Similarly, the MSUB/MSUBU instruction multiplies two operands and then subtracts the product from the HI and LO registers. The MADD/MADDU and MSUB/MSUBU operations are commonly used in DSP algorithms.

All multiply operations (except the MUL instruction) write to the HI/LO register pair. All integer operations write to the general purpose registers (GPR). Because MDU operations write to different registers than integer operations, integer instructions that follow can execute before the MDU operation has completed. The MFLO and MFHI instructions are used to move data from the HI/LO register pair to the GPR file. If an MFLO or MFHI instruction is issued before the MDU operation completes, it will stall to wait for the data.

## 2.5 MDU Pipeline with DSP Module Enabled

The microAptiv UP processor core contains a high-performance Multiply-Divide Unit (MDU) and a DSP unit to handle integer multiply, divide, and DSP Module instructions. The autonomous multiply/divide unit (MDU) has a separate pipeline for multiply and divide operations. This pipeline operates in parallel with the integer unit (ALU) pipeline and does not stall when the ALU pipeline stalls. This allows multi-cycle MDU operations, such as a divide, to be partially masked by system stalls and/or other integer unit instructions.

The following subsections describe the MDU pipeline in more detail.

### 2.5.1 MDU

The high-performance MDU consists of a 32x32 Booth-recoded multiplier array, separate carry-lookahead adders for multiply and divide, result/accumulation registers (*HI* and *LO*), multiply and divide state machines, and all necessary multiplexers and control logic.

Due to the multiplier array, the high-performance MDU supports execution of a multiply operation every clock cycle. Divide operations are implemented with a simple 1 bit-per-clock iterative algorithm with an early in detection of sign extension on the dividend (*rs*). An attempt to issue a subsequent MDU instruction which would access the *HI* or *LO* register before the divide completes causes a delay in starting the subsequent MDU instruction. Some concurrency is enabled by the separate adders for the multiply and divide data paths. The MDU instruction may start executing when the divide is ensured of writing to the *HI* and *LO* registers before the MDU instruction will access them. A MUL instruction, which does not access the *HI* or *LO* register, may start executing anytime relative to a previous divide instruction.

Table 2.1 lists the number of stall cycles incurred between two dependent instructions. A stall of 0 clock cycles means that the first and second instructions can be issued back-to-back in the code, without the MDU causing any stalls in the ALU pipeline.

| Size of Operand                | Instructio                                  | Delay                                   |                   |
|--------------------------------|---------------------------------------------|-----------------------------------------|-------------------|
| 1st Instruction <sup>[1]</sup> | 1st Instruction                             | 2nd Instruction                         | Clocks            |
| 32 bit                         | MULT/MULTU,<br>MADD/MADDU, or<br>MSUB/MSUBU | MADD/MADDU,<br>MSUB/MSUBU, or MFHI/MFLO | 0                 |
| 32 bit                         | MUL                                         | Integer operation <sup>[1]</sup>        | 3                 |
| 8 bit                          | DIVU                                        | MFHI/MFLO                               | 8                 |
| 16 bit                         | DIVU MFHI/MFLO                              |                                         | 16                |
| 24 bit                         | 24 bit DIVU MFHI/MFLO                       |                                         | 24                |
| 32 bit                         | DIVU                                        | MFHI/MFLO                               | 32                |
| 8 bit                          | DIV                                         | MFHI/MFLO                               | 10 <sup>[2]</sup> |
| 16 bit                         | DIV                                         | MFHI/MFLO                               | 18 <sup>[2]</sup> |
| 24 bit                         | DIV                                         | MFHI/MFLO                               | 26 <sup>[2]</sup> |
| 32 bit                         | DIV                                         | MFHI/MFLO                               | 34 <sup>[2]</sup> |
| any                            | MFHI/MFLO                                   | Integer operation <sup>[1]</sup>        | 1                 |
| any                            | any MTHI/MTLO MADD/MADDU,<br>MSUB/MSUBU     |                                         | 1                 |
| any                            | MTHI/MTLO                                   | MFHI/MFLO                               | 1                 |

### Table 2.1 High-performance MDU Stalls

| Size of Operand                | Instruction     | Delay                                                                         |  |
|--------------------------------|-----------------|-------------------------------------------------------------------------------|--|
| 1st Instruction <sup>[1]</sup> | 1st Instruction | tion 2nd Instruction                                                          |  |
|                                | , e             | ses the result of a previous MDU op<br>stages are bypassed. Delay is then the |  |

#### Table 2.1 High-performance MDU Stalls (Continued)

### 2.5.2 DSP Module Instruction Latencies

The microAptiv UP processor core includes support for DSP Module. Logic for these instructions is located primarily in the ALU and MDU blocks. Any DSP instructions accessing the accumulators or performing multiplication are implemented in the MDU. All others are implemented in the ALU. In addition to the "normal" MIPS32 HI/LO accumulator, the DSP Module introduces three additional HI/LO accumulator pairs.

The latency and repeat rate for the BPOSGE32 instruction is similar to those for a MIPS32 conditional branch instruction. However, unlike a MIPS32 conditional branch instruction, BPOSGE32 is dependent on *DSPControl.Pos* and not on a GPR. The LHX and LWX instructions are treated as non-blocking loads by the core; they have dependencies on the index and base registers. The delay and repeat rates for other DSP instructions are shown in the following tables. The 'delay' in Table 2.2 is in terms of pipeline clocks and refers to the number of cycles the pipeline must stall the second instruction in order to wait for the result of the first instruction. A delay of zero means that the first and second instructions can be issued back-to-back without stalling the pipeline. A delay of one means that if issued back-to-back, the pipeline will stall for one cycle.

|                            | Instruction                                                                                                    | Delay                                                   |        |
|----------------------------|----------------------------------------------------------------------------------------------------------------|---------------------------------------------------------|--------|
| Dependency on <sup>1</sup> | 1st Instruction                                                                                                | 2nd Instruction                                         | Clocks |
| GPR                        | MUL*, EXT*, MFHI, MFLO<br>(multiplies or HI/LO reads that<br>write to a GPR)                                   | Instruction with GPR input                              | 3      |
| GPR                        | Other (ALU) DSP instruction<br>with GPR result                                                                 | Instruction with GPR input                              | 0      |
| HI/LO                      | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*,<br>MULT*, MTHI, MTLO, MTTR,<br>SHILO*, MTHLIP<br>(HI/LO writes)  | MFHI, MFLO, MFTR<br>(HI/LO reads)                       | 1      |
| HI/LO                      | *_SA<br>(MAC's that saturates after accu-<br>mulate)                                                           | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*<br>(MAC's) | 1      |
| HI/LO                      | DPAQ_S.*, DPSQ_S.*, MUL-<br>SAQ*, MAQ_S.*, MADD*,<br>MSUB*<br>(MAC's that do not saturate after<br>accumulate) | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*<br>(MAC's) | 1      |
| HI/LO                      | MTHI, MTLO, MTTR, SHILO*,<br>MTHLIP<br>(HI/LO writes that are not multi-<br>plies)                             | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*<br>(MAC's) | 1      |

### Table 2.2 DSP Instruction Delays

|                            | Instruction                                                                                                         | Sequence                       | Delay  |
|----------------------------|---------------------------------------------------------------------------------------------------------------------|--------------------------------|--------|
| Dependency on <sup>1</sup> | 1st Instruction                                                                                                     | 2nd Instruction                | Clocks |
| HI/LO                      | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*,<br>MULT*, MTHI, MTLO, MTTR,<br>EXT*, SHILO*, MTHLIP<br>(HI/LO writes) | EXT*, SHILO*<br>(HI/LO shifts) | 3      |
| HI/LO                      | DPAQ*, DPSQ*, MULSAQ*,<br>MAQ*, MADD*, MSUB*,<br>MULT*, MTHI, MTLO, MTTR,<br>SHILO*, MTHLIP<br>(HI/LO writes)       | MTHLIP                         | 3      |

#### Table 2.2 DSP Instruction Delays (Continued)

1. For dependencies on a HI/LO accumulator, the delay clocks shown assume that the 1st and 2nd instruction are operating on the same accumulator.

The delays shown in table 2.2 with a dependency on a HI/LO accumulator pair assume that the dependent instruction sequence is operating on the *same* accumulator pair. This is the worst case situation. The delay clock value can be reduced when the second instruction operates on a different accumulator. For example, consider the following sequence:

MULT (writing to accumulator 0) MADD (writing to accumulator 1) MSUB (writing to accumulator 2) EXTR (reading from accumulator n)

If the EXTR instruction is reading accumulator 2 (n=2), then a delay of 3 cycles would apply between the MSUB and EXTR operation, as indicated in Table 2.2. If the EXTR reads accumulator 1, then a delay of 2 cycles would apply between the MADD and EXTR, since there is already one unrelated instruction between the dependent ones. If the EXTR reads accumulator 0, then a delay of 1 would apply between the MULT and EXTR. Finally, if the EXTR instruction is reading accumulator 3, no delay would be incurred in the sequence.

Table 2.3 shows the repeat rates of all possible instruction sequences between two integer arithmetic, multiply, divide, or DSP instructions, with and without data dependencies.

|                         | Ins                 | truction | Repea               | it Rate |                               |                               |
|-------------------------|---------------------|----------|---------------------|---------|-------------------------------|-------------------------------|
| MIPS32                  | 1st Instruction     |          | 2nd Instruct        | ion     | Without Data                  | With Data                     |
| or<br>microMIPS         | Instruction<br>Type | Target   | Instruction<br>Type | Target  | Dependency                    | Dependency                    |
|                         |                     | GPR      | Integer Arithmetic  | GPR     | 1                             | 1                             |
|                         | Integer Arithmetic  |          | Multiply            | GPR     | 1                             | 1                             |
|                         |                     |          | Multiply            | Hi/Lo   | 1                             | 1                             |
|                         |                     |          | Divide              | Hi/Lo   | 1                             | 1                             |
|                         |                     | GPR      | Integer Arithmetic  | GPR     | 3                             | 4                             |
|                         | Multiply            |          | Multiply            | GPR     | 1                             | 4                             |
|                         | in an pro-          |          | Multiply            | Hi/Lo   | 1                             | 4                             |
| Normal                  |                     |          | Divide              | Hi/Lo   | 1                             | 4                             |
| Integer<br>Instructions |                     | Hi/Lo    | Integer Arithmetic  | GPR     | 1                             | 1                             |
|                         | Multiply            |          | Multiply            | GPR     | 1                             | 1                             |
|                         | ivialipiy           |          | Multiply            | Hi/Lo   | 1                             | 1                             |
|                         |                     |          | Divide              | Hi/Lo   | 1                             | 1                             |
|                         |                     | Hi/Lo    | Integer Arithmetic  | GPR     | 1                             | 1                             |
|                         | Divide              |          | Multiply            | GPR     | 10, 18, 26, 34 <sup>[1]</sup> | 10, 18, 26, 34 <sup>[1]</sup> |
|                         |                     |          | Multiply            | Hi/Lo   | 10, 18, 26, 34 <sup>[1]</sup> | 10, 18, 26, 34 <sup>[1]</sup> |
|                         |                     |          | Divide              | Hi/Lo   | 10, 18, 26, 34 <sup>[1]</sup> | 10, 18, 26, 34 <sup>[1]</sup> |
|                         |                     | GPR      | Integer Arithmetic  | GPR     | 1                             | 1                             |
|                         | Integer Arithmetic  |          | Multiply            | GPR     | 1                             | 1                             |
|                         |                     |          | Multiply            | Hi/Lo   | 1                             | 1                             |
|                         |                     | GPR      | Integer Arithmetic  | GPR     | 3                             | 4                             |
| DSP Module              | Multiply            |          | Multiply            | GPR     | 1                             | 4                             |
| Instructions            |                     |          | Multiply            | Hi/Lo   | 1                             | 4                             |
|                         |                     | Hi/Lo    | Integer Arithmetic  | GPR     | 1                             | 1                             |
|                         | Multiply            |          | Multiply            | GPR     | 1                             | 1                             |
|                         |                     |          | Multiply            | Hi/Lo   | 1                             | 1, 2 <sup>[2]</sup>           |

Table 2.3 Delays for Interesting Sequences with DSPControl Dependency

[1] : The number cycles depends on the size of input operands.

[2] : An extra cycle is needed if Saturation arithmetic is needed.

### 2.5.3 High-performance MDU Pipeline Stages

The multiply operation begins in stage  $B_{MDU}$ , which would be the E stage in the integer pipeline. The Booth-recoding function occurs at this time. The multiply calculation requires three clocks and occurs in the M1<sub>MDU</sub>, M2<sub>MDU</sub>,

and  $M3_{MDU}$  stages. The carry-lookahead-add (CLA) function occurs at the end of the  $M3_{MDU}$  stage. In the  $A_{MDU}$  stage, the result is selected from the multiply data path, *HI* register, and *LO* register to be returned to the ALU for the MFHI, MFLO, and MUL instructions. If the MDU instruction is not one of these, the result is selected to be written into the *HI/LO* registers instead. The result is ready to be read from the *HI/LO* registers in the W<sub>MDU</sub> stage.

The following figures illustrate a multiply (accumulate) instruction and the interaction with the main integer pipeline. These figures are applicable to MUL, MULT, MULTU, MADD, MADDU, MSUB, and MSUBU instructions

Figure 2.6 Multiply Pipeline







Figure 2.8 Multiply With Dependency From Load Hit







Figure 2.10 shows the results of the GPR-targeted MUL instruction being bypassed to a later instruction. Independent instructions can execute while the multiply is occurring. If a dependent instruction is found, it will stall until the result is available. When the MUL completes, it will arbitrate for access to the write port of the register file. If the integer pipe is busy with other instructions, the MDU pipeline will stall until the result can be written.

If the MUL target is being used as the base address for a load or store instruction, it needs to be bypassed by the AG stage, and thus one extra cycle will be required.

| MUL                                        | I | B <sub>MDU</sub> | M1 <sub>MDU</sub> | M2 <sub>MDU</sub> | M3 <sub>MDU</sub> | A <sub>MDU</sub> | N | Re | esult byp | ass |   |  |
|--------------------------------------------|---|------------------|-------------------|-------------------|-------------------|------------------|---|----|-----------|-----|---|--|
|                                            |   | I                | E                 | М                 | A                 | W                |   |    |           |     |   |  |
|                                            |   |                  | I                 | E                 | М                 | A                |   | w  |           |     |   |  |
|                                            |   |                  |                   | I                 | E                 | М                |   | А  | W         |     |   |  |
|                                            |   |                  |                   |                   | I                 | E                | , | м  | A         | W   |   |  |
| Earliest dependent ALU instn               |   |                  |                   |                   | tn                | Ι                |   | E  | М         | A   | W |  |
| Earliest dependent load/store base address |   |                  |                   |                   |                   | I                | E | Μ  | A         | W   |   |  |

### Figure 2.10 subtractMUL Bypassing Result to Integer Instructions

## 2.5.4 High-performance MDU Divide Operations

Divide operations are implemented using a simple non-restoring division algorithm. This algorithm works only for positive operands, and thus the first cycle of the  $M_{MDU}$  stage is used to negate the *rs* operand (RS Adjust), if needed. Note that this cycle is spent even if the adjustment is not necessary. In cycle 2, the first add/subtract iteration is executed. In cycle 3, an early-in detection is performed. The adjusted *rs* operand is detected to be zero-extended on the uppermost 8, 16, or 24 bits. If this is the case, the following 7, 15, or 23 cycles of the add/subtract iterations are skipped. During the next maximum 31 cycles (4-34), the remaining iterative add/subtract loop is executed.

The remainder adjust (Rem Adjust) cycle is required if the remainder was negative. Note that this cycle is spent even if the remainder was positive. A sign-adjust is performed on the quotient and/or remainder, if necessary. The sign adjust stages are skipped if both operands are positive.

Figure 2.18, Figure 2.19, Figure 2.13 and Figure 2.14 show the worst-case latencies for 8, 16, 24, and 32 bit divide operations respectively. The worst case repeat rate is either 14, 22,, 30, or 38 cycles (two less if the *sign adjust* stage is skipped).



| Clock | 1              | 2              | 3              | 4-10           | 11            | 12            | 13             | 14           |
|-------|----------------|----------------|----------------|----------------|---------------|---------------|----------------|--------------|
|       | ← IDLE Stage → | ← DIV1 Stage → | ← ERLY Stage → | ← DIV Stages → | ← RMD Stage → | ← SGN Stage → | ← SGN2 Stage → |              |
|       | RS Adjust      | Add/Subtract   | Early In       | Add/subtract   | Rem. Adjust   | Sign Adjust 1 | Sign Adjust 2  | Result Ready |

| Clock | 1              | 2              | 3              | 4-18           | 19            | 20            | 21             | 22             |
|-------|----------------|----------------|----------------|----------------|---------------|---------------|----------------|----------------|
|       | ← IDLE Stage → | ← DIV1 Stage → | ← ERLY Stage → | ← DIV Stages → | ← RMD Stage → | ← SGN Stage → | ← SGN2 Stage → | ◆ IDLE Stage → |
|       | RS Adjust      | Add/Subtract   | Early In       | Add/subtract   | Rem. Adjust   | Sign Adjust 1 | Sign Adjust 2  | Result Ready   |

#### Figure 2.12 MDU Pipeline Flow During a 16-bit Divide (DIV) Operation

Figure 2.13 MDU Pipeline Flow During a 24-bit Divide (DIV) Operation



Figure 2.14 MDU Pipeline Flow During a 32-bit Divide (DIV) Operation



## 2.6 MDU Pipeline - High-performance MDU with DSP Module Disabled

The microAptiv UP processor core contains an autonomous multiply/divide unit (MDU) with a separate pipeline for multiply and divide operations. This pipeline operates in parallel with the integer unit (IU) pipeline and does not stall when the IU pipeline stalls. This allows multi-cycle MDU operations, such as a divide, to be partially masked by system stalls and/or other integer unit instructions.

The MDU consists of a 32x16 Booth-encoded multiplier array, a carry propagate adder, result/accumulation registers (HI and LO), multiply and divide state machines, and all necessary multiplexers and control logic. The first number shown ('32' of 32x16) represents the *rs* operand. The second number ('16' of 32x16) represents the *rt* operand. The core only checks the latter (*rt*) operand value to determine how many times the operation must pass through the multiplier array. The 16x16 and 32x16 operations pass through the multiplier array once. A 32x32 operation passes through the multiplier array twice.

The MDU supports execution of a 16x16 or 32x16 multiply operation every clock cycle; 32x32 multiply operations can be issued every other clock cycle. Appropriate interlocks are implemented to stall the issue of back-to-back 32x32 multiply operations. Multiply operand size is automatically determined by logic built into the MDU. Divide operations are implemented with a simple 1 bit per clock iterative algorithm with an early in detection of sign extension on the dividend (*rs*). Any attempt to issue a subsequent MDU instruction while a divide is still active causes an IU pipeline stall until the divide operation is completed.

Table 2.4 lists the latencies (number of cycles until a result is available) for multiply, and divide instructions. The latencies are listed in terms of pipeline clocks. In this table 'latency' refers to the number of cycles necessary for the first instruction to produce the result needed by the second instruction.

| Size of Operand                                                                      | Instruction Sequence                                                                |                                                                                                                                                   |                   |  |  |
|--------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|--|--|
| 1st Instruction <sup>[1]</sup>                                                       | 1st Instruction                                                                     | 2nd Instruction                                                                                                                                   | Latency<br>Clocks |  |  |
| 16 bit                                                                               | MULT/MULTU,<br>MADD/MADDU,<br>MSUB/MSUBU                                            | MADD/MADDU,<br>MSUB/MSUBU or<br>MFHI/MFLO                                                                                                         | 1                 |  |  |
| 32 bit                                                                               | MULT/MULTU,<br>MADD/MADDU, or<br>MSUB/MSUBU                                         | MADD/MADDU,<br>MSUB/MSUBU or<br>MFHI/MFLO                                                                                                         | 2                 |  |  |
| 16 bit                                                                               | MUL                                                                                 | Integer operation <sup>[2]</sup>                                                                                                                  | 2 <sup>[3]</sup>  |  |  |
| 32 bit                                                                               | MUL                                                                                 | Integer operation <sup>[2]</sup>                                                                                                                  | 2 <sup>[3]</sup>  |  |  |
| 8 bit                                                                                | DIVU                                                                                | MFHI/MFLO                                                                                                                                         | 9                 |  |  |
| 16 bit                                                                               | DIVU                                                                                | MFHI/MFLO                                                                                                                                         | 17                |  |  |
| 24 bit                                                                               | DIVU                                                                                | MFHI/MFLO                                                                                                                                         | 25                |  |  |
| 32 bit                                                                               | DIVU                                                                                | MFHI/MFLO                                                                                                                                         | 33                |  |  |
| 8 bit                                                                                | DIV                                                                                 | MFHI/MFLO                                                                                                                                         | 10 <sup>[4]</sup> |  |  |
| 16 bit                                                                               | DIV                                                                                 | MFHI/MFLO                                                                                                                                         | 18[4]             |  |  |
| 24 bit                                                                               | DIV                                                                                 | MFHI/MFLO                                                                                                                                         | 26 <sup>[4]</sup> |  |  |
| 32 bit                                                                               | DIV                                                                                 | MFHI/MFLO                                                                                                                                         | 34 <sup>[4]</sup> |  |  |
| any                                                                                  | MFHI/MFLO                                                                           | Integer operation <sup>[2]</sup>                                                                                                                  | 2                 |  |  |
| any                                                                                  | MTHI/MTLO                                                                           | MADD/MADDU or<br>MSUB/MSUBU                                                                                                                       | 1                 |  |  |
| [1] For multiply operation<br>[2] Integer Operation ref<br>[3] This does not include | ons, this is the <i>rt</i> operand. For dividential of any integer instruction that | MSUB/MSUBU<br>de operations, this is the <i>rs</i> operand<br>uses the result of a previous MDU operation<br>it or 32 bit) that the MUL operation | operation.        |  |  |

Table 2.4 MDU Instruction Latencies (High-Performance MDU)

spective of the following instruction. These stalls do not add to the latency of 2.

[4] If both operands are positive, then the Sign Adjust stage is bypassed. Latency is then the same as for DIVU.

In Table 2.4, a latency of one means that the first and second instructions can be issued back-to-back in the code, without the MDU causing any stalls in the IU pipeline. A latency of two means that if issued back-to-back, the IU pipeline will be stalled for one cycle. MUL operations are special, because the MDU needs to stall the IU pipeline in order to maintain its register file write slot. As a result, the MUL 16x16 or 32x16 operation will always force a onecycle stall of the IU pipeline, and the MUL 32x32 will force a two-cycle stall. If the integer instruction immediately following the MUL operation uses its result, an additional stall is forced on the IU pipeline.

Table 2.5 lists the repeat rates (peak issue rate of cycles until the operation can be reissued) for multiply accumulate/subtract instructions. The repeat rates are listed in terms of pipeline clocks. In this table 'repeat rate' refers to the case where the first MDU instruction (in the table below) if back-to-back with the second instruction.

| Operand Size of 1st | Instruction Sequence                     |                           |                |  |  |
|---------------------|------------------------------------------|---------------------------|----------------|--|--|
| Instruction         | 1st Instruction                          | 2nd Instruction           | Repeat<br>Rate |  |  |
| 16 bit              | MULT/MULTU,<br>MADD/MADDU,<br>MSUB/MSUBU | MADD/MADDU,<br>MSUB/MSUBU | 1              |  |  |
| 32 bit              | MULT/MULTU,<br>MADD/MADDU,<br>MSUB/MSUBU | MADD/MADDU, MSUB/MSUBU    | 2              |  |  |

Table 2.5 MDU Instruction Repeat Rates (High-Performance MDU)

Figure 2.15 below shows the pipeline flow for the following sequence:

- 1. 32x16 multiply (Mult<sub>1</sub>)
- 2. Add
- 3. 32x32 multiply (Mult<sub>2</sub>)

Sub

4. Subtract (Sub)

The 32x16 multiply operation requires one clock of each pipeline stage to complete. The 32x32 multiply operation requires two clocks in the  $M_{MDU}$  pipe-stage. The MDU pipeline is shown as the shaded areas of Figure 2.15 and always starts a computation in the final phase of the E stage. As shown in the figure, the  $M_{MDU}$  pipe-stage of the MDU pipeline occurs in parallel with the M stage of the IU pipeline, the  $A_{MDU}$  stage occurs in parallel with the A stage, and the  $W_{MDU}$  stage occurs in parallel with the W stage. In general this need not be the case. Following the 1st cycle of the M stages, the two pipelines need not be synchronized. This does not present a problem because results in the MDU pipeline are written to the HI and LO registers, while the integer pipeline results are written to the register file.



М

A

Е

W



The following is a cycle-by-cycle analysis of Figure 2.15.

1. The first 32x16 multiply operation (Mult<sub>1</sub>) is fetched from the instruction cache and enters the I stage.

I

- 2. An Add operation enters the I stage. The Mult<sub>1</sub> operation enters the E stage. The integer and MDU pipelines share the I and E pipeline stages. At the end of the E stage in cycle 2, the MDU pipeline starts processing the multiply operation (Mult<sub>1</sub>).
- 3. In cycle 3, a 32x32 multiply operation (Mult<sub>2</sub>) enters the I stage and is fetched from the instruction cache. Since the Add operation has not yet reached the M stage by cycle 3, there is no activity in the M stage of the integer pipeline at this time.
- 4. In cycle 4, the Subtract instruction enters I stage. The second multiply operation (Mult<sub>2</sub>) enters the E stage. And the Add operation enters M stage of the integer pipe. Since the Mult<sub>1</sub> multiply is a 32x16 operation, only one clock is required for the M<sub>MDU</sub> stage, hence the Mult<sub>1</sub> operation passes to the A<sub>MDU</sub> stage of the MDU pipeline.
- In cycle 5, the Subtract instruction enters E stage. The Mult<sub>2</sub> multiply enters the M<sub>MDU</sub> stage. The Add operation enters the A stage of the integer pipeline. The Mult<sub>1</sub> operation completes and is written back in to the HI/LO register pair in the W<sub>MDU</sub> stage.
- 6. Since a 32x32 multiply requires two passes through the multiplier, with each pass requiring one clock, the 32x32 Mult<sub>2</sub> remains in the M<sub>MDU</sub> stage in cycle 6. The Sub instruction enters M stage in the integer pipeline. The Add operation completes and is written to the register file in the W stage of the integer pipeline.
- 7. The Mult<sub>2</sub> multiply operation progresses to the A<sub>MDU</sub> stage, and the Sub instruction progress to the A stage.
- 8. The Mult<sub>2</sub> operation completes and is written to the HI/LO registers pair in the the W<sub>MDU</sub> stage, while the Sub instruction writes to the register file in the W stage.

### 2.6.1 32x16 Multiply (High-Performance MDU)

The 32x16 multiply operation begins in the last phase of the E stage, which is shared between the integer and MDU pipelines. In the latter phase of the E stage, the *rs* and *rt* operands arrive and the Booth-recoding function occurs at this time. The multiply calculation requires one clock and occurs in the  $M_{MDU}$  stage. In the  $A_{MDU}$  stage, the carry-propagate-add (CPA) function occurs and the operation is completed. The result is ready to be read from the HI/LO registers in the  $W_{MDU}$  stage.

Figure 2.16 shows a diagram of a 32x16 multiply operation.





### 2.6.2 32x32 Multiply (High-Performance MDU)

The 32x32 multiply operation begins in the last phase of the E stage, which is shared between the integer and MDU pipelines. In the latter phase of the E stage, the *rs* and *rt* operands arrive and the Booth-recoding function occurs at this time. The multiply calculation requires two clocks and occurs in the  $M_{MDU}$  stage. In the  $A_{MDU}$  stage, the CPA function occurs and the operation is completed.

Figure 2.17 shows a diagram of a 32x32 multiply operation.



## 2.6.3 Divide (High-Performance MDU)

Divide operations are implemented using a simple non-restoring division algorithm. This algorithm works only for positive operands, hence the first cycle of the  $M_{MDU}$  stage is used to negate the *rs* operand (RS Adjust) if needed. Note that this cycle is spent even if the adjustment is not necessary. During the next maximum 32 cycles (3-34) an iterative add/subtract loop is executed. In cycle 3 an early-in detection is performed in parallel with the add/subtract. The adjusted *rs* operand is detected to be zero extended on the upper most 8, 16 or 24 bits. If this is the case the following 7, 15 or 23 cycles of the add/subtract iterations are skipped.

The remainder adjust (Rem Adjust) cycle is required if the remainder was negative. Note that this cycle is spent even if the remainder was positive. A sign adjust is performed on the quotient and/or remainder if necessary. The sign adjust stage is skipped if both operands are positive. In this case the Rem Adjust is moved to the  $A_{MDU}$  stage.

Figure 2.18, Figure 2.19, Figure 2.20 and Figure 2.21 show the latency for 8, 16, 24 and 32 bit divide operations, respectively. The repeat rate is either 11, 19, 27 or 35 cycles (one less if the *sign adjust* stage is skipped) as a second divide can be in the *RS Adjust* stage when the first divide is in the *Reg WR* stage.



Figure 2.18 High-Performance MDU Pipeline Flow During a 8-bit Divide (DIV) Operation







### Figure 2.20 High-Performance MDU Pipeline Flow During a 24-bit Divide (DIV) Operation





## 2.7 MDU Pipeline - Area-Efficient MDU with DSP Module Disabled

The area-efficient multiply/divide unit (MDU) is a separate autonomous block for multiply and divide operations. The MDU is not pipelined, but rather performs the computations iteratively in parallel with the integer unit (IU) pipeline and does not stall when the IU pipeline stalls. This allows the long-running MDU operations to be partially masked by system stalls and/or other integer unit instructions.

The MDU consists of one 32-bit adder result-accumulate registers (HI and LO), a combined multiply/divide state machine, and all multiplexers and control logic. A simple 1-bit-per-clock recursive algorithm is used for both multiply and divide operations. Using Booth's algorithm all multiply operations complete in 32 clocks. Two extra clocks are needed for multiply-accumulate. The non-restoring algorithm used for divide operations will not work with negative numbers. Adjustment before and after are thus required depending on the sign of the operands. All divide operations complete in 33 to 35 clocks.

Table 2.6 lists the latencies (number of cycles until a result is available) for multiply and divide instructions. The latencies are listed in terms of pipeline clocks. In this table 'latency' refers to the number of cycles necessary for the second instruction to use the results of the first.

| Operand Signs of<br>1st Instruction | Instruction               | Latanav                                    |                   |
|-------------------------------------|---------------------------|--------------------------------------------|-------------------|
| (Rs,Rt)                             | 1st Instruction           | 2nd Instruction                            | Latency<br>Clocks |
| any, any                            | MULT/MULTU                | MADD/MADDU,<br>MSUB/MSUBU, or<br>MFHI/MFLO | 32                |
| any, any                            | MADD/MADDU,<br>MSUB/MSUBU | MADD/MADDU,<br>MSUB/MSUBU, or<br>MFHI/MFLO | 34                |
| any, any                            | MUL                       | Integer operation <sup>[1]</sup>           | 32                |
| any, any                            | DIVU                      | MFHI/MFLO                                  | 33                |

| Instruction                                                                                                                                                                                                                                                                                                                                            |                                   |                                                                                                                                  |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------|----------------------------------------------------------------------------------------------------------------------------------|
| 1st Instruction                                                                                                                                                                                                                                                                                                                                        | 2nd Instruction                   | Latency<br>Clocks                                                                                                                |
| Instruction Sequence         1st Instruction       2nd Instruction         DIV       MFHI/MFLO         DIV       MFHI/MFLO         DIV       MFHI/MFLO         MFHI/MFLO       Integer operation <sup>[1]</sup> MTHI/MFLO       MADD/MADDU,<br>MSUB/MSUBU         n refers to any integer instruction that uses the result of a previous MDU operation | 33                                |                                                                                                                                  |
| DIV                                                                                                                                                                                                                                                                                                                                                    | MFHI/MFLO                         | 34                                                                                                                               |
| DIV                                                                                                                                                                                                                                                                                                                                                    | MFHI/MFLO                         | 35                                                                                                                               |
| MFHI/MFLO                                                                                                                                                                                                                                                                                                                                              | Integer operation <sup>[1]</sup>  | 2                                                                                                                                |
| MTHI/MTLO                                                                                                                                                                                                                                                                                                                                              | ,                                 | 1                                                                                                                                |
| -                                                                                                                                                                                                                                                                                                                                                      | 1st InstructionDIVDIVDIVMFHI/MFLO | 1st Instruction2nd InstructionDIVMFHI/MFLODIVMFHI/MFLODIVMFHI/MFLOMFHI/MFLOInteger operation <sup>[1]</sup> MTHI/MTLOMADD/MADDU, |

Table 2.6 microAptiv<sup>™</sup> UP Core Instruction Latencies (Area-Efficient MDU)

## 2.7.1 Multiply (Area-Efficient MDU)

Multiply operations are executed using a simple iterative multiply algorithm. Using Booth's approach, this algorithm works for both positive and negative operands. The operation uses 32 cycles in  $M_{MDU}$  stage to complete a multiplication. The register writeback to HI and LO are done in the A stage. For MUL operations, the register file writeback is done in the  $W_{MDU}$  stage.

Figure 2.22 shows the latency for a multiply operation. The repeat rate is 33 cycles as a second multiply can be in the first  $M_{MDU}$  stage when the first multiply is in  $A_{MDU}$  stage.

### Figure 2.22 microAptiv<sup>™</sup> UP Area-Efficient MDU Pipeline Flow During a Multiply Operation



### 2.7.2 Multiply Accumulate (Area-Efficient MDU)

Multiply-accumulate operations use the same multiply machine as used for multiply only. Two extra stages are needed to perform the addition/subtraction. The operations uses 34 cycles in  $M_{MDU}$  stage to complete the multiply-accumulate. The register writeback to HI and LO are done in the A stage.

Figure 2.23 shows the latency for a multiply-accumulate operation. The repeat rate is 35 cycles as a second multiply-accumulate can be in the E stage when the first multiply is in the last  $M_{MDU}$  stage.

#### Figure 2.23 microAptiv<sup>™</sup> UP Core Area-Efficient MDU Pipeline Flow During a Multiply Accumulate Operation



## 2.7.3 Divide (Area-Efficient MDU)

Divide operations also implement a simple non-restoring algorithm. This algorithm works only for positive operands, hence the first cycle of the  $M_{MDU}$  stage is used to negate the rs operand (RS Adjust) if needed. Note that this cycle is executed even if negation is not needed. The next 32 cycle (3-34) executes an interactive add/subtract-shift function.

Two sign adjust (Sign Adjust 1/2) cycles are used to change the sign of one or both the quotient and the remainder. Note that one or both of these cycles are skipped if they are not needed. The rule is, if both operands were positive or if this is an unsigned division; both of the sign adjust cycles are skipped. If the *rs* operand was negative, one of the sign adjust cycles is skipped. If only the *rs* operand was negative, none of the sign adjust cycles are skipped. Register writeback to HI and LO are done in the A stage.

Figure 2.24 shows the pipeline flow for a divide operation. The repeat rate is either 34, 35 or 36 cycles (depending on how many sign adjust cycles are skipped) as a second divide can be in the E stage when the first divide is in the last  $M_{MDU}$  stage.

### Figure 2.24 microAptiv<sup>™</sup> UP Core Area-Efficient MDU Pipeline Flow During a Divide (DIV) Operation



## 2.8 Branch Delay

The pipeline has a branch delay of one cycle. The one-cycle branch delay is a result of the branch decision logic operating during the E pipeline stage. This allows the branch target address to be used in the I stage of the instruction following 2 cycles after the branch instruction. By executing the 1st instruction following the branch instruction sequentially before switching to the branch target, the intervening branch delay slot is utilized. This avoids bubbles being injected into the pipeline on branch instructions. Both the address calculation and the branch condition check are performed in the E stage.

The pipeline begins the fetch of either the branch path or the fall-through path in the cycle following the delay slot. After the branch decision is made, the processor continues with the fetch of either the branch path (for a taken branch) or the fall-through path (for the non-taken branch).

The branch delay means that the instruction immediately following a branch is always executed, regardless of the branch direction. If no useful instruction can be placed after the branch, then the compiler or assembler must insert a NOP instruction in the delay slot.

Figure 2.25 illustrates the branch delay.



#### Figure 2.25 IU Pipeline Branch Delay

## 2.9 Data Bypassing

Most MIPS32 instructions use one or two register values as source operands. These operands are fetched from the register file in the first part of E stage. The ALU straddles the E-to-M boundary, and can present the result early in the M stage. However, the result is not written to the register file before the W stage. If no precautions were taken, it would take 3 cycles before the result was available for the following instructions. To avoid this, data bypassing is implemented.

Between the register file and the ALU a data-bypass multiplexer is placed on both operands (see figure below). This enables the microAptiv UP core to forward data from a preceding instruction whose target is a source register of a following instruction. An M to E bypass and an A to E bypass feed the bypass multiplexers. A W to E bypass is not needed, as the register file is capable of making an internal bypass of Rd write data directly to the Rs and Rt read ports.



#### Figure 2.26 IU Pipeline Data bypass

Figure 2.27 shows the data bypass for an Add<sub>1</sub> instruction followed by a Sub<sub>2</sub> and another Add<sub>3</sub> instruction. The Sub<sub>2</sub> instruction uses the output from the Add<sub>1</sub> instruction as one of the operands, and thus the M to E bypass is used. The following Add<sub>3</sub> uses the result from both the first Add<sub>1</sub> instruction and the Sub<sub>2</sub> instruction. Since the Add<sub>1</sub> data is now in A stage, the A to E bypass is used, and the M to E bypass is used to bypass the Sub<sub>2</sub> data to the Add<sub>2</sub> instruction.



#### Figure 2.27 IU Pipeline M to E bypass

### 2.9.1 Load Delay

Load delay refers to the fact that data fetched by a load instruction is not available in the integer pipeline until after the load aligner in A stage. All instructions need the source operands available in the E stage. An instruction immediately following a load instruction will, if it has the same source register as was the target of the load, cause an instruction interlock pipeline slip in the E stage (see 2.13 "Instruction Interlocks" on page 67). If an instruction following the load by 1 or 2 cycles uses the data from the load, the A to E bypass (see Figure 2.26) serves to reduce or avoid stall cycles. An instruction flow of this is shown in Figure 2.28.



Figure 2.28 IU Pipeline A to E Data bypass

### 2.9.2 Move from HI/LO and CP0 Delay

As indicated in Figure 2.26, not only load data, but also data moved from the HI or LO registers (MFHI/MFLO) and data moved from CP0 (MFC0) enters the IU-Pipeline in the A stage. That is, data is not available in the integer pipeline until early in the A stage. The A to E bypass is available for this data. But as for Loads, an instruction following immediately after one of these move instructions must be paused for one cycle if the target of the move is among the sources of the following instruction and this causes an interlock slip in the E stage (see 2.13 "Instruction Interlocks" on page 67). An interlock slip after a MFHI is illustrated in Figure 2.29.

|                  | One Cycle      | One Cycle | One Cycle |
|------------------|-----------|-----------|-----------|-----------|----------------|-----------|-----------|
| MFHI (to R3) 🛛 🔶 | Ι         | E         | М         | А         | W              |           |           |
|                  |           |           |           | Data by   | pass from A to | E         |           |
| ADD (R4=R3+R5)   | >         | Ι         | E (slip)  | Ē         | М              | А         | W         |
|                  |           |           |           |           |                |           |           |

#### Figure 2.29 IU Pipeline Slip after a MFHI

## 2.10 Coprocessor 2 Instructions

If a coprocessor 2 is attached to the microAptiv UP core, a number of transactions must take place on the CP2 Interface for each coprocessor 2 instruction. First, if the CU[2] bit in the CP0 *Status* register is not set, then no coprocessor 2 related instruction will start a transaction on the CP2 Interface; instead, a Coprocessor Unusable exception will be signaled. If the CU[2] bit is set, and a coprocessor 2 instruction is fetched, the following transactions will occur on the CP2 Interface:

- 1. The Instruction is presented on the instructions bus in E stage. Coprocessor 2 can do a decode in the same cycle.
- 2. The Instruction is validated from the core in M stage. From this point, the core will accept control and data signals back from coprocessor 2. All control and data signals from coprocessor 2 are captured on input latches to the core.
- If all the expected control and data signals were presented to the core in the previous M stage, the core will proceed to execute the A stage. If some return information is missing, the A stage will not advance and cause a slip in all I, E, and M stages (see 2.12 "Slip Conditions" on page 66). If this instruction sent data from the core to coprocessor 2, this data is sent in the A stage.
- 4. The instruction completion is signaled to coprocessor 2 in the W stage. Potential data from the coprocessor is written to the register file.

Figure 2.30 shows the timing relationship between the microAptiv UP core and coprocessor 2 for all coprocessor 2 instructions.





As can be seen in the Figure, all control and data from the coprocessor must occur in the M stage. If this is not the case, the A stage will start slipping in the following cycle and thus stall the I, E, M. and A stages; but if all expected control and data is available in the M stage, coprocessor 2 instructions can execute with no pipeline stalls. The only exception to this is the Branch on Coprocessor conditions (BC2) instruction. All branch instructions, including the regular BEQ, BNE, etc., must be resolved in the E stage. The microAptiv UP core does not have branch prediction logic, and thus the target address must be available before the end of the E stage. The BC2 instruction has to follow the same protocol as all other coprocessor 2 instructions on the CP2 Interface. All core interface operations belonging to the E, M, and A stages will have to occur in the E stage for BC2 instructions. This means that a BC2 instruction always slips for a minimum of 2 cycles int the E stage, and any delay in the return of branch information from coprocessor 2 will add to the number of slip cycles. All other Coprocessor 2 instructions can operate without slips, provided that all control and data information from coprocessor 2 is transferred in the M stage.

## 2.11 Interlock Handling

Smooth pipeline flow is interrupted when cache misses occur or when data dependencies are detected. Interruptions handled entirely in hardware, such as cache misses, are referred to as *interlocks*. At each cycle, interlock conditions are checked for all active instructions.

Table 2.7 lists the types of pipeline interlocks for the microAptiv UP processor core.

| Interlock Type | Sources                              | Slip Stage |  |  |
|----------------|--------------------------------------|------------|--|--|
| ITLB Miss      | Instruction TLB                      | I Stage    |  |  |
| ICache Miss    | Instruction cache                    | E Stage    |  |  |
| Instruction    | Producer-consumer hazards            | E/M Stage  |  |  |
|                | Hardware Dependencies (MDU/TLB)      | E Stage    |  |  |
|                | BC2 waiting for COP2 Condition Check |            |  |  |
| DTLB Miss      | Data TLB                             | M Stage    |  |  |

### **Table 2.7 Pipeline Interlocks**

| Interlock Type                | Sources                                                     | Slip Stage |  |  |
|-------------------------------|-------------------------------------------------------------|------------|--|--|
| Data Cache Miss               | Load that misses in data cache                              | A Stage    |  |  |
|                               | Multi-cycle cache Op                                        |            |  |  |
|                               | Sync                                                        |            |  |  |
|                               | Store when write thru buffer full                           |            |  |  |
|                               | EJTAG breakpoint on store                                   |            |  |  |
|                               | VA match needing data value comparison                      |            |  |  |
|                               | Store hitting in fill buffer                                |            |  |  |
| Coprocessor 2 completion slip | Coprocessor 2 control and/or data delay<br>from coprocessor | A Stage    |  |  |

### Table 2.7 Pipeline Interlocks (Continued)

In general, MIPS processors support two types of hardware interlocks:

- Stalls, which are resolved by halting the pipeline
- Slips, which allow one part of the pipeline to advance while another part of the pipeline is held static

In the microAptiv UP processor core, all interlocks are handled as slips.

## 2.12 Slip Conditions

On every clock, internal logic determines whether each pipe stage is allowed to advance. These slip conditions propagate backwards down the pipe. For example, if the M stage does not advance, neither does the E or I stage.

Slipped instructions are retried on subsequent cycles until they issue. The back end of the pipeline advances normally during slips. This resolves the conflict when the slip was caused by a missing result. NOPs are inserted into the bubble in the pipeline. Figure 2.31 shows an instruction cache miss that causes a two-cycle slip.



### Figure 2.31 Instruction Cache Miss Slip

In the first clock cycle in Figure 2.31, the pipeline is full and the cache miss is detected. Instruction  $I_0$  is in the A stage, instruction  $I_1$  is in the M stage, instruction  $I_2$  is in the E stage, and instruction  $I_3$  is in the I stage. The cachemiss occurs in clock 2 when the  $I_4$  instruction fetch is attempted.  $I_4$  advances to the E stage and waits for the instruction to be fetched from main memory. In this example, two clocks (3 and 4) are required to fetch the  $I_4$  instruction from memory. After the cache miss has been resolved in clock 4 and the instruction is bypassed to the E stage, the pipeline is restarted, causing  $I_4$  to finally execute it's E-stage operations.

## 2.13 Instruction Interlocks

Most instructions can be issued at a rate of one per clock cycle. In order to adhere to the sequential programming model, the issue of an instruction must sometimes be delayed to ensure that the result of a prior instruction is available. Table 2.8 details the instruction interactions that prevent an instruction from advancing in the processor pipeline.

|                                                 | l       | nstruction Interlocks                 |                                  |            |
|-------------------------------------------------|---------|---------------------------------------|----------------------------------|------------|
| First Instructio                                | n       | Second Instruction                    | lssue Delay (in<br>Clock Cycles) | Slip Stage |
| LB/LBU/LH/LHU/LL/LW/LWL/LWR                     |         | Consumer of load data                 | 1                                | E stage    |
| MFC0                                            |         | Consumer of destination regis-<br>ter | 1                                | E stage    |
| MULTx/MADDx/MSUBx                               | 16bx32b | MFLO/MFHI                             | 0                                |            |
| (high-performance MDU with DSP Module disabled) | 32bx32b |                                       | 1                                | M stage    |
| MUL                                             | 16bx32b | Consumer of target data               | 2                                | E stage    |
| (high-performance MDU with DSP Module disabled) | 32bx32b |                                       | 3                                | E stage    |

|                                                                                                 | In      | struction Interlocks                                  |                                  |            |  |
|-------------------------------------------------------------------------------------------------|---------|-------------------------------------------------------|----------------------------------|------------|--|
| First Instruction                                                                               |         | Second Instruction                                    | lssue Delay (in<br>Clock Cycles) | Slip Stage |  |
| MUL                                                                                             | 16bx32b | Non-Consumer of target data                           | 1                                | E stage    |  |
| (high-performance MDU with DSP Module disabled)                                                 | 32bx32b |                                                       | 2                                | E stage    |  |
| MFHI/MFLO                                                                                       |         | Consumer of target data                               | 1                                | E stage    |  |
| MULTx/MADDx/MSUBx                                                                               | 16bx32b | MULT/MUL/MADD/MSUB<br>MTHI/MTLO/DIV                   | 0 <sup>[1]</sup>                 | E stage    |  |
| (high-performance MDU with DSP Module disabled)                                                 | 32bx32b | MIHI/MILO/DIV                                         | 1 <sup>[1]</sup>                 | E stage    |  |
| DIV                                                                                             |         | MUL/MULTx/MADDx/<br>MSUBx/MTHI/MTLO/<br>MFHI/MFLO/DIV | Until DIV completes              | E stage    |  |
| MULT/MUL/MADD/MSUB/MTHI/MTLO/MFH<br>I/MFLO/DIV<br>(area-efficient MDU with DSP Module disabled) |         | MULT/MUL/MADD/MSUB/<br>MTHI/MTLO/MFHI/MFLO/<br>DIV    | Until 1st MDU op<br>completes    | E stage    |  |
| MUL<br>(area-efficient MDU with DSP Module disabled)                                            |         | Any Instruction                                       | Until MUL completes              | E stage    |  |
| MFC0/MFC2/CFC2                                                                                  |         | Consumer of target data                               | 1                                | E stage    |  |
| TLBWR/TLBWI                                                                                     |         | Load/Store/PREF/CACHE/                                | 2                                | E stage    |  |
| TLBR                                                                                            |         | СОР0 ор                                               | 1                                | E stage    |  |

### Table 2.8 Instruction Interlocks (Continued)

## 2.14 Hazards

In general, the microAptiv UP core ensures that instructions are executed following a fully sequential program model in which each instruction in the program sees the results of the previous instruction. There are some deviations to this model, referred to as *hazards*.

Prior to Release 2 of the MIPS Architecture, hazards (primarily CP0 hazards) were relegated to implementation-dependent cycle-based solutions, primarily based on the SSNOP instruction. This has been an insufficient and error-prone practice that must be addressed with a firm compact between hardware and software. As such, new instructions have been added to Release 2 of the architecture which act as explicit barriers that eliminate hazards. To the extent that it was possible to do so, the new instructions have been added in such a way that they are backward-compatible with existing MIPS processors.

## 2.14.1 Types of Hazards

With one exception, all hazards were eliminated in Release 1 of the Architecture for unprivileged software. The exception occurs when unprivileged software writes a new instruction sequence and then wishes to jump to it. Such an operation remained a hazard, and is addressed by the capabilities of Release 2.

In privileged software, there are two types of hazards: execution hazards and instruction hazards.

Execution hazards are those created by the execution of one instruction, and seen by the execution of another instruction. Table 2.9 lists execution hazards.

| Producer     | $\rightarrow$ | Consumer                                                                    | Hazard On                                           | Spacing<br>(Instructions) |
|--------------|---------------|-----------------------------------------------------------------------------|-----------------------------------------------------|---------------------------|
| TLBWR, TLBWI | $\rightarrow$ | TLBP, TLBR                                                                  | TLB entry                                           | 0                         |
|              |               | Load/store using new TLB entry                                              | TLB entry                                           | 0                         |
| MTC0         | $\rightarrow$ | Load/store affected by new state                                            | WatchHi<br>WatchLo                                  | 0                         |
| LL           | $\rightarrow$ | MFC0                                                                        | LLAddr                                              | 1                         |
| MTC0         | $\rightarrow$ | Coprocessor instruction execution depends on the new value of $Status_{CU}$ | Status <sub>CU</sub>                                | 1                         |
| MTC0         | $\rightarrow$ | ERET                                                                        | EPC<br>DEPC<br>ErrorEPC                             | 1                         |
| MTC0         | $\rightarrow$ | ERET                                                                        | Status                                              | 0                         |
| MTC0, EI, DI | $\rightarrow$ | Interrupted Instruction                                                     | Status <sub>IE</sub>                                | 1                         |
| MTC0         | $\rightarrow$ | Interrupted Instruction                                                     | Cause <sub>IP</sub>                                 | 3                         |
| TLBR         | $\rightarrow$ | MFC0                                                                        | EntryHi,<br>EntryLo0,<br>EntryLo1,<br>PageMask      | 0                         |
| TLBP         | $\rightarrow$ | MFC0                                                                        | Index                                               | 0                         |
| MTC0         | $\rightarrow$ | TLBR<br>TLBWI<br>TLBWR                                                      | EntryHi                                             | 1                         |
| MTC0         | $\rightarrow$ | TLBP<br>Load/store affected by new state                                    | EntryHi <sub>ASID</sub>                             | 1                         |
| MTC0         | $\rightarrow$ | TLBWI<br>TLBWR                                                              | EntryLo0<br>EntryLo1                                | 0                         |
| MTC0         | $\rightarrow$ | TLBWI<br>TLBWR                                                              | Index                                               | 1                         |
| MTC0         | $\rightarrow$ | RDPGPR<br>WRPGPR                                                            | SRSCtl <sub>PSS</sub>                               | 1                         |
| MTC0         | $\rightarrow$ | Instruction not seeing a Timer Interrupt                                    | Compare<br>update that<br>clears Timer<br>Interrupt | 4 <sup>1</sup>            |
| MTC0         | $\rightarrow$ | Instruction affected by change                                              | Any other CP0<br>register                           | 2                         |

### **Table 2.9 Execution Hazards**

1. This is the minimum value. Actual value is system-dependent since it is a function of the sequential logic between the *Sl\_TimerInt* output and the external logic which feeds *Sl\_TimerInt* back into one of the *Sl\_Int* inputs, or a function of the method for handling *Sl\_TimerInt* in an external interrupt controller.

Instruction hazards are those created by the execution of one instruction, and seen by the instruction fetch of another instruction. Table 2.10 lists instruction hazards.

| Producer                              | $\rightarrow$ | Consumer                                                                                                                       | Hazard On               | Spacing<br>(Instructions)          |
|---------------------------------------|---------------|--------------------------------------------------------------------------------------------------------------------------------|-------------------------|------------------------------------|
| TLBWR, TLBWI                          | $\rightarrow$ | Instruction fetch using new TLB entry                                                                                          | TLB entry               | 3                                  |
| MTC0                                  | $\rightarrow$ | Instruction fetch seeing the new value (including a change to ERL fol-<br>lowed by an instruction fetch from the useg segment) | Status                  |                                    |
| MTC0                                  | $\rightarrow$ | Instruction fetch seeing the new value                                                                                         | EntryHi <sub>ASID</sub> | 3                                  |
| MTC0                                  | $\rightarrow$ | Instruction fetch seeing the new value                                                                                         | WatchHi<br>WatchLo      | 2                                  |
| Instruction stream<br>write via CACHE | $\rightarrow$ | Instruction fetch seeing the new instruction stream                                                                            | Cache entries           | 3                                  |
| Instruction stream<br>write via store | $\rightarrow$ | Instruction fetch seeing the new instruction stream                                                                            | Cache entries           | System-depen-<br>dent <sup>1</sup> |

### **Table 2.10 Instruction Hazards**

1. This value depends on how long it takes for the store value to propagate through the system.

## 2.14.2 Instruction Listing

Table 2.11 lists the instructions designed to eliminate hazards. See the document titled *MIPS32*® *Architecture for Programmers Volume II: The MIPS32*® *Instruction Set* (MD00086) for a more detailed description of these instructions.

| Mnemonic Function |                                                   |  |  |  |  |  |
|-------------------|---------------------------------------------------|--|--|--|--|--|
| EHB               | Clear execution hazard                            |  |  |  |  |  |
| JALR.HB           | Clear both execution and instruction hazards      |  |  |  |  |  |
| JR.HB             | Clear both execution and instruction hazards      |  |  |  |  |  |
| SYNCI             | Synchronize caches after instruction stream write |  |  |  |  |  |

Table 2.11 Hazard Instruction Listing

### 2.14.2.1 Instruction Encoding

The EHB instruction is encoded using a variant of the NOP/SSNOP encoding. This encoding was chosen for compatibility with the Release 1 SSNOP instruction, such that existing software may be modified to be compatible with both Release 1 and Release 2 implementations. See the EHB instruction description for additional information.

The JALR.HB and JR.HB instructions are encoding using bit 10 of the *hint* field of the JALR and JR instructions. These encodings were chosen for compatibility with existing MIPS implementations, including many which pre-date the MIPS architecture. Because a pipeline flush clears hazards on most early implementations, the JALR.HB or JR.HB instructions can be included in existing software for backward and forward compatibility. See the JALR.HB and JR.HB instructions for additional information.

The SYNCI instruction is encoded using a new encoding of the REGIMM opcode. This encoding was chosen because it causes a Reserved Instruction exception on all Release 1 implementations. As such, kernel software running on processors that don't implement Release 2 can emulate the function using the CACHE instruction.

### 2.14.3 Eliminating Hazards

The Spacing column shown in Table 2.9 and Table 2.10 indicates the number of unrelated instructions (such as NOPs or SSNOPs) that, prior to the capabilities of Release 2, would need to be placed between the producer and consumer of the hazard in order to ensure that the effects of the first instruction are seen by the second instruction. Entries in the table that are listed as 0 are traditional MIPS hazards which are not hazards on the microAptiv UP core.

With the hazard elimination instructions available in Release 2, the preferred method to eliminate hazards is to place one of the instructions listed in Table 2.11 between the producer and consumer of the hazard. Execution hazards can be removed by using the EHB, JALR.HB, or JR.HB instructions. Instruction hazards can be removed by using the JALR.HB or JR.HB instructions, in conjunction with the SYNCI instruction.

# The MIPS® DSP Module

The microAptiv UP includes support for the MIPS DSP Module Revision 2 that provides enhanced performance capabilities for a wide range of signal-processing applications, with computational support for fractional data types, SIMD, saturation, and other operations that are commonly used in these applications.

Refer to *MIPS*® *Architecture For Programmers Volume IV-e* [14] or [14] for a general description of the DSP Module and detailed descriptions of the DSP instructions. Additional programming information is contained in *Five Methods of Utilizing the MIPS*® *DSP Module* [16], *Efficient DSP Module Programming in C: Tips* and *Tricks* [17], and *Accelerating DSP Filter Loops with MIPS*® *CorExtend*® *Instructions* [18].

## 3.1 Additional Register State for the DSP Module

The DSP Module defines three additional accumulator registers and one additional control/status register, as described below. These registers require the operating system to recognize the presence of the DSP Module and to include these additional registers in the context save and restore operations.

### 3.1.1 HI-LO Registers

The DSP Module includes three HI/LO accumulator register pairs (ac1, ac2, and ac3) in addition to the HI/LO register pair (ac0) in the standard MIPS32 architecture. These registers improve the parallelization of independent accumulation routines—for example, filter operations, convolutions, etc. DSP instructions that target the accumulators use two instruction bits to specify the destination accumulator, with the zero value referring to the original accumulator.

### 3.1.2 DSPControl Register

The *DSPControl* register contains control and status information used by DSP instructions. Figure 3.1 illustrates the bits in this register, and Table 3.1 describes their usage.

| 31 | 28 | 27 24 | 23 | 16     | 15 | 14  | 13 | 12     | 7 | 6 | 5   | 0 |
|----|----|-------|----|--------|----|-----|----|--------|---|---|-----|---|
|    | 0  | ccond |    | ouflag | 0  | EFI | c  | scount |   | 0 | pos |   |

| Fields |       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | Read / | Reset |            |
|--------|-------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-------|------------|
| Name   | Bits  | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Write  | State | Compliance |
| 0      | 31:28 | Reserved. Used in the MIPS64 architecture but not used<br>in the MIPS32 architecture. Must be written as zero;<br>returns zero on read.                                                                                                                                                                                                                                                                                                                                    | 0      | 0     | Required   |
| ccond  | 27:24 | Condition code bits set by compare instructions. The compare instruction sets the right-most bits as required by the number of elements in the vector compare. Bits not set by the instruction remain unchanged.                                                                                                                                                                                                                                                           | R/W    | 0     | Required   |
| ouflag | 23:16 | This field is written by hardware when certain instruc-<br>tions overflow or underflow and may have been satu-<br>rated. See Table 3.2 for a full list of which bits are set by<br>what instructions.                                                                                                                                                                                                                                                                      | R/W    | 0     | Required   |
| EFI    | 14    | Extract Fail Indicator. This bit is set to 1 when an EXTP,<br>EXTPV, EXTPDP, or EXTPDP instruction fails. These<br>instructions fail when there are insufficient bits to<br>extract, that is, when the value of pos in <i>DSPControl</i> is<br>less than the value of size specified in the instruction.<br>This bit is not sticky, so each invocation of one of the<br>four instructions will reset the bit depending on whether<br>or not the instruction failed.        | R/W    | 0     | Required   |
| С      | 13    | Carry bit. This bit is set and used by special add instruc-<br>tions that implement a 64-bit add across two GPRs. The<br>ADDSC instruction sets the bit and the ADDWC<br>instruction uses this bit.                                                                                                                                                                                                                                                                        | R/W    | 0     | Required   |
| scount | 12:7  | This field is for use by the INSV instruction. The value of this field is used to specify the size of the bit field to be inserted.                                                                                                                                                                                                                                                                                                                                        | R/W    | 0     | Required   |
| pos    | 5:0   | This field is used by the variable insert instructions<br>INSV to specify the insert position.<br>It is also used to indicate the extract position for the<br>EXTP, EXTPV, EXTPDP, and EXTPDPV instructions.<br>The decrement pos (DP) variants of these instructions on<br>completion will have decremented the value of pos (by<br>the size amount).<br>The MTHLIP instruction will increment the pos value by<br>32 after copying the value of <i>LO</i> to <i>HI</i> . | R/W    | 0     | Required   |
| 0      | 15:13 | Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                             | 0      | 0     | Reserved   |

# Table 3.1 MIPS® DSP Module Control Register (DSPControl) Field Descriptions

The bits of the overflow flag ouflag field in the *DSPControl* register are set by a number of instructions, as described in Table 3.2. These bits are sticky and can be reset only by an explicit write to these bits in the register (using the WRDSP instruction).

| Bit Number | Description                                                                                                                                                                                                                                       |
|------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 16         | This bit is set when the destination is accumulator ( <i>HI-LO</i> pair) zero, and an operation overflow or underflow occurs. These instructions are: DPAQ_S, DPAQ_SA, DPSQ_S, DPSQ_SA, DPAQX_S, DPAQX_SA, DPSQX_SA, MAQ_SA, MAQ_SA and MULSAQ_S. |
| 17         | Same instructions as above, when the destination is accumulator ( <i>HI-LO</i> pair) one.                                                                                                                                                         |
| 18         | Same instructions as above, when the destination is accumulator ( <i>HI-LO</i> pair) two.                                                                                                                                                         |
| 19         | Same instructions as above, when the destination is accumulator ( <i>HI-LO</i> pair) three.                                                                                                                                                       |
| 20         | Instructions that set this bit on an overflow/underflow: ABSQ_S, ADDQ, ADDQ_S, ADDU, ADDU_S, ADDWC, SUBQ, SUBQ_S, SUBU and SUBU_S.                                                                                                                |
| 21         | Instructions that set this bit on an overflow/underflow: MUL, MUL_S, MULEQ_S, MULEU_S, MULQ_RS, and MULQ_S.                                                                                                                                       |
| 22         | Instructions that set this bit on an overflow/underflow: PRECRQ_RS, SHLL, SHLL_S, SHLLV, and SHLLV_S.                                                                                                                                             |
| 23         | Instructions that set this bit on an overflow/underflow: EXTR, EXTR_S, EXTR_RS, EXTRV, and EXTRV_RS.                                                                                                                                              |

# 3.2 Software Detection of the DSP Module Revision 2

The presence of the MIPS DSP Module in the microAptiv UP is indicated by two static bits in the *Config3* register: the *DSPP* (*DSP Present*) bit indicates the presence of the DSP Module, and the *DSP2P* (*DSP Rev2 Present*) bit indicates the presence of the MIPS DSP Module Rev2. Because the DSP Module is configurable in the microAptiv UP processor core, and it always comes with the DSP Module Rev2 if the DSP Module is configured, therefore the *DSPP* and *DSP2P* are always preset to 0's or 1's.

The *MX* (*DSP Module Enable*) read/write bit in the CP0 *Status* register must be set to enable access to the additional instructions defined by the DSP Module, as well as to the MTLO/HI, MFLO/HI instructions that access accumulators ac1, ac2, and ac3. Executing a DSP Module instruction or the MTLO/HI, MFLO/HI instructions with this bit set to zero causes a DSP State Disabled Exception (exception code 26 in the CP0 *Cause* register). This exception can be used by system software to do lazy context switching.

Chapter 4

# Memory Management of the microAptiv<sup>™</sup> UP Core

The microAptiv<sup>™</sup> UP processor core includes a Memory Management Unit (MMU) that interfaces between the execution unit and the cache controller. The microAptiv UP core contains either a Translation Lookaside Buffer (TLB) or a simpler Fixed Mapping Translation (FMT) style MMU, specified as a build-time option when the core is implemented.

This chapter contains the following sections:

- Section 4.1 "Introduction"
- Section 4.2 "Modes of Operation"
- Section 4.3 "Translation Lookaside Buffer"
- Section 4.4 "Virtual-to-Physical Address Translation"
- Section 4.5 "Fixed Mapping MMU"
- Section 4.6 "System Control Coprocessor"

# 4.1 Introduction

The MMU in a microAptiv UP processor core translates a virtual address to a physical address before the request is sent to the cache controllers for tag comparison or to the bus interface unit for an external memory reference. This translation is a very useful feature for operating systems when trying to manage physical memory to accommodate multiple tasks active in the same memory, possibly on the same virtual address space but in different locations in physical memory. Other features handled by the MMU are protection of memory areas and defining cache protocols.

By default, the MMU is TLB based. The TLB consists of three address translation buffers: a 16 or 32 dual-entry fully associative Joint TLB (JTLB), a 4-entry instruction micro TLB (ITLB), and a 4-entry data micro TLB (DTLB). When an address is translated, the appropriate micro TLB (ITLB or DTLB) is accessed first. If the translation is not found in the micro TLB, the JTLB is accessed. If there is a miss in the JTLB, an exception is taken.

Optionally, the MMU can be based on a simple algorithm to translate virtual addresses to physical addresses via a Fixed Mapping Translation (FMT) mechanism. These translations are different for various regions of the virtual address space (useg/kuseg, kseg0, kseg1, kseg2/3).

# 4.1.1 Memory Management Unit (MMU)

The microAptiv UP core offers one of the two choices of MMU that interfaces between the execution unit and the cache controller, namely a Translation Lookaside Buffer (TLB) and Fixed Mapping Translation (FMT).

### 4.1.1.1 Fixed Mapping Translation (FMT)

An FMT is smaller and simpler than a TLB. Like a TLB, the FMT performs virtual-to-physical address translation and provides attributes for the different segments. Those segments that are unmapped in a TLB implementation (kseg0 and kseg1) are translated identically by the FMT.

### 4.1.1.2 Translation Lookaside Buffer (TLB)

A TLB-based MMU consists of three translation buffers: a 16 or 32 dual-entry fully associative Joint TLB (JTLB), a 4-entry fully associative Instruction TLB (ITLB), and a 4-entry fully associative data TLB (DTLB).

When an instruction address is calculated, the virtual address is compared to the contents of the 4-entry ITLB. If the address is not found in the ITLB, the JTLB is accessed. If the entry is found in the JTLB, that entry is then written into the ITLB. If the address is not found in the JTLB, a TLB refill exception is taken.

When a data address is calculated, the virtual address is compared to both the 4-entry DTLB and the JTLB. If the address is not found in the DTLB, but is found in the JTLB, that address is immediately written to the DTLB. If the address is not found in the JTLB, a TLB refill exception is taken.

The microAptiv UP core TLB allows pages to be protected by a read-inhibit and an execute-inhibit attribute in addition to the write-protection attribute defined by the MIPS32 PRA.

Figure 4.1 shows how the TLB based memory management unit interacts with cache accesses in the microAptiv UP core, while Figure 4.2 shows how the FMT based memory management unit interacts.



Figure 4.1 Address Translation During a Cache Access with TLB MMU



Figure 4.2 Address Translation During a Cache Access with FMT MMU

# 4.2 Modes of Operation

The microAptiv UP core implements three modes of operation:

- *User mode* is most often used for applications programs.
- Kernel mode is typically used for handling exceptions and operating-system kernel functions, including CP0 management and I/O device accesses.
- *Debug mode* is used during system bring-up and software development. Refer to the EJTAG section for more information on debug mode.

User mode is most often used for application programs. Kernel mode is typically used for handling exceptions and privileged operating system functions, including CP0 management and I/O device accesses. Debug mode is used for software debugging and most likely occurs within a software development tool.

The address translation performed by the MMU depends on the mode in which the processor is operating.

### 4.2.1 Virtual Memory Segments

The Virtual memory segments differ depending on the mode of operation. Figure 4.3 shows the segmentation for the 4 GByte ( $2^{32}$  bytes) virtual memory space addressed by a 32-bit virtual address, for the three modes of operation.

The core enters Kernel mode both at reset and when an exception is recognized. While in Kernel mode, software has access to the entire address space, as well as all CP0 registers. User mode accesses are limited to a subset of the virtual address space (0x0000\_0000 to 0x7FFF\_FFF) and can be inhibited from accessing CP0 functions. In User mode, virtual addresses 0x8000\_0000 to 0xFFFF\_FFFF are invalid and cause an exception if accessed.

Debug mode is entered on a debug exception. While in Debug mode, the debug software has access to the same address space and CP0 registers as for Kernel mode. In addition, while in Debug mode the core has access to the debug segment dseg. This area overlays part of the kernel segment kseg3. dseg access in Debug mode can be turned on or off, allowing full access to the entire kseg3 in Debug mode, if so desired.

| Virtual Address            |                | Kernel Mode                           | Debug Mode |
|----------------------------|----------------|---------------------------------------|------------|
| 0xFFFF_FFFF                |                |                                       | kseg3      |
| 0xFF40_0000                |                |                                       | dseg       |
| 0xFF3F_FFFF                | <del>.</del> . | kseg3                                 | kseg3      |
| 0xFF20_0000                |                |                                       |            |
| 0xFF1F_FFFF                |                | kees?                                 | kee s 2    |
| 0xE000_0000                |                | kseg2                                 | kseg2      |
| 0xDFFF_FFFF                |                |                                       |            |
| 0xC000_0000                |                |                                       |            |
| 0xBFFF FFFF                |                | kseg1                                 | kseg1      |
| _                          |                |                                       |            |
|                            |                |                                       |            |
| 0x9FFF_FFFF                |                | kseg0                                 | kseg0      |
| 0778000 0000               |                | , , , , , , , , , , , , , , , , , , , | J J        |
| 0x8000_0000<br>0x7FFF FFFF | - T            |                                       | -          |
|                            | useg           | kuseg                                 | kuseg      |
| 0x0000_0000                |                |                                       |            |

#### Figure 4.3 microAptiv<sup>™</sup> UP processor core Virtual Memory Map

Each of the segments shown in Figure 4.3 are either mapped or unmapped. The following two sub-sections explain the distinction. Then sections 4.2.2 "User Mode", 4.2.3 "Kernel Mode" and 4.2.4 "Debug Mode" specify which segments are actually mapped and unmapped.

### 4.2.1.1 Unmapped Segments

An unmapped segment does not use the TLB or the FMT to translate from virtual-to-physical addresses. Especially after reset, it is important to have unmapped memory segments, because the TLB is not yet programmed to perform the translation.

Unmapped segments have a fixed simple translation from virtual to physical address. This is much like the translations the FMT provides for the microAptiv UP core, but we will still make the distinction.

Except for kseg0, unmapped segments are always uncached. The cacheability of kseg0 is set in the *K0* field of the CP0 Config register (see 6.2.30 "Config Register (CP0 Register 16, Select 0)").

#### 4.2.1.2 Mapped Segments

A mapped segment does use the TLB or the FMT to translate from virtual-to-physical addresses.

The translation of mapped segments is handled on a per-page basis. Included in this translation is information defining whether the page is cacheable or not, and the protection attributes that apply to the page.

For the microAptiv UP core, the mapped segments have a fixed translation from virtual to physical address. The cacheability of the segment is defined in the CP0 *Config* register fields K23 and KU (see 6.2.30 "Config Register (CP0 Register 16, Select 0)"). Write protection of segments is not possible during FMT translation.

### 4.2.2 User Mode

In user mode, a single 2 GByte  $(2^{31}$  bytes) uniform virtual address space called the user segment (useg) is available. Figure 4.4 shows the location of user mode virtual address space.



#### Figure 4.4 User Mode Virtual Address Space

The user segment starts at address 0x0000\_0000 and ends at address 0x7FFF\_FFF. Accesses to all other addresses cause an address error exception.

The processor operates in User mode when the *Status* register contains the following bit values:

- *UM* = 1
- EXL = 0
- ERL = 0

In addition to the above values, the DM bit in the *Debug* register must be 0.

Table 4.1 lists the characteristics of the useg User mode segments.

|                     | Status Register |     |         |      |                            |                                    |
|---------------------|-----------------|-----|---------|------|----------------------------|------------------------------------|
| Address Bit         | Bit Value       |     | Segment |      |                            |                                    |
| Value               | EXL             | ERL | UM      | Name | Address Range              | Segment Size                       |
| 32-bit<br>A(31) = 0 | 0               | 0   | 1       | useg | 0x0000_0000><br>0x7FFF_FFF | 2 GByte<br>(2 <sup>31</sup> bytes) |

 Table 4.1 User Mode Segments

All valid user mode virtual addresses have their most significant bit cleared to 0, indicating that user mode can only access the lower half of the virtual memory map. Any attempt to reference an address with the most significant bit set while in user mode causes an address error exception.

The system maps all references to *useg* through the TLB or FMT. The virtual address is extended with the contents of the 8-bit ASID field to form a unique virtual address before translation. Also for the microAptiv UP core, bit settings within the TLB entry for the page determine the cacheability of a reference.

### 4.2.3 Kernel Mode

The processor operates in Kernel mode when the *DM* bit in the *Debug* register is 0 and the *Status* register contains one or more of the following values:

- UM = 0
- *ERL* = 1
- EXL = 1

When a non-debug exception is detected, *EXL* or *ERL* will be set and the processor will enter Kernel mode. At the end of the exception handler routine, an Exception Return (ERET) instruction is generally executed. The ERET instruction jumps to the Exception PC, clears *ERL*, and clears *EXL* if *ERL*=0. This may return the processor to User mode.

Kernel mode virtual address space is divided into regions differentiated by the high-order bits of the virtual address, as shown in Figure 4.5. Also, Table 4.2 lists the characteristics of the Kernel mode segments.

| 0xFFFF_FFFF<br>0xE000 0000 | Kernel virtual address space<br>Mapped, 512MB             | kseg3 |
|----------------------------|-----------------------------------------------------------|-------|
| 0xDFFF FFFF                |                                                           |       |
| -<br>0xC000_0000           | Kernel virtual address space<br>Mapped, 512MB             | kseg2 |
| 0xBFFF_FFFF                |                                                           |       |
| 0xA000_0000<br>0x9FFF FFFF | Kernel virtual address space<br>Unmapped, Uncached, 512MB | kseg1 |
| -<br>0x8000 0000           | Kernel virtual address space<br>Unmapped, 512MB           | kseg0 |
| —                          |                                                           |       |
| 0x7FFF_FFFF                | Mapped, 2048MB                                            | kuseg |
| 0x0000_0000                |                                                           |       |
|                            |                                                           |       |

## Figure 4.5 Kernel Mode Virtual Address Space

# **Table 4.2 Kernel Mode Segments**

| Address Bit                 |                                 | Register<br>hese Va |       | Segment                               |                                       | Segment |
|-----------------------------|---------------------------------|---------------------|-------|---------------------------------------|---------------------------------------|---------|
| Values                      | UM                              | EXL                 | ERL   | Name                                  | Address Range                         | Size    |
| A(31) = 0                   | (UM = 0<br>or<br>EXL = 1        |                     | kuseg | 0x0000_0000<br>through<br>0x7FFF_FFF  | 2 GBytes (2 <sup>31</sup><br>bytes)   |         |
| A(31:29) = 100 <sub>2</sub> | or<br>ERL = 1)<br>and<br>DM = 0 |                     | kseg0 | 0x8000_0000<br>through<br>0x9FFF_FFFF | 512 MBytes<br>(2 <sup>29</sup> bytes) |         |
| A(31:29) = 101 <sub>2</sub> | DM = 0                          |                     | kseg1 | 0xA000_0000<br>through<br>0xBFFF_FFFF | 512 MBytes<br>(2 <sup>29</sup> bytes) |         |
| A(31:29) = 110 <sub>2</sub> |                                 |                     | kseg2 | 0xC000_0000<br>through<br>0xDFFF_FFFF | 512 MBytes<br>(2 <sup>29</sup> bytes) |         |
| A(31:29) = 111 <sub>2</sub> |                                 |                     | kseg3 | 0xE000_0000<br>through<br>0xFFFF_FFF  | 512 MBytes<br>(2 <sup>29</sup> bytes) |         |

### 4.2.3.1 Kernel Mode, User Space (kuseg)

In Kernel mode, when the most-significant bit of the virtual address (A31) is cleared, the 32-bit kuseg virtual address space is selected and covers the full  $2^{31}$  bytes (2 GBytes) of the current user address space mapped to addresses  $0x0000\_0000 - 0x7FFF\_FFFF$ . The virtual address is extended with the contents of the 8-bit ASID field to form a unique virtual address.

When the *Status* register's ERL = 1, the user address region becomes a  $2^{29}$ -byte unmapped and uncached address space. While in this setting, the kuseg virtual address maps directly to the same physical address and does not include the ASID field.

### 4.2.3.2 Kernel Mode, Kernel Space 0 (kseg0)

In Kernel mode, when the most-significant three bits of the virtual address are 1002, 32-bit kseg0 virtual address

space is selected; it is the  $2^{29}$ -byte (512-MByte) kernel virtual space located at addresses  $0x8000_0000 - 0x9FFF_FFFF$ . References to kseg0 are unmapped; the physical address selected is defined by subtracting  $0x8000_0000$  from the virtual address. The *K0* field of the *Config* register controls cacheability.

### 4.2.3.3 Kernel Mode, Kernel Space 1 (kseg1)

In Kernel mode, when the most-significant three bits of the 32-bit virtual address are 1012, 32-bit kseg1 virtual

address space is selected. kseg1 is the 2<sup>29</sup>-byte (512-MByte) kernel virtual space located at addresses 0xA000\_0000 - 0xBFFF\_FFF. References to kseg1 are unmapped; the physical address selected is defined by subtracting 0xA000\_0000 from the virtual address. Caches are disabled for accesses to these addresses, and physical memory (or memory-mapped I/O device registers) are accessed directly.

### 4.2.3.4 Kernel Mode, Kernel Space 2 (kseg2)

In Kernel mode, when UM = 0, ERL = 1, or EXL = 1 in the *Status* register, and DM = 0 in the *Debug* register, and the most-significant three bits of the 32-bit virtual address are  $110_2$ , 32-bit kseg2 virtual address space is selected. In the

microAptiv UP core, this 2<sup>29</sup>-byte (512-MByte) kernel virtual space is located at physical addresses 0xC000\_0000 - 0xDFFF\_FFF. This space is mapped through the TLB.

#### 4.2.3.5 Kernel Mode, Kernel Space 3 (kseg3)

In Kernel mode, when the most-significant three bits of the 32-bit virtual address are 1112, the kseg3 virtual address

space is selected. In the microAptiv UP core, this 2<sup>29</sup>-byte (512-MByte) kernel virtual space is located at physical addresses 0xE000\_0000 - 0xFFFF\_FFFF. This space is mapped through the TLB.

### 4.2.4 Debug Mode

Debug mode address space is identical to Kernel mode address space with respect to mapped and unmapped areas, except for *kseg3*. In *kseg3*, a debug segment *dseg* co-exists in the virtual address range 0xFF20\_0000 to 0xFF3F\_FFFF. The layout is shown in Figure 4.6.



Figure 4.6 Debug Mode Virtual Address Space

The dseg is sub-divided into the dmseg segment at 0xFF20\_0000 to 0xFF2F\_FFFF which is used when the probe services the memory segment, and the drseg segment at 0xFF30\_0000 to 0xFF3F\_FFFF which is used when memory-mapped debug registers are accessed. The subdivision and attributes for the segments are shown in Table 4.3.

Accesses to memory that would normally cause an exception if tried from kernel mode cause the core to re-enter debug mode via a debug mode exception. This includes accesses usually causing a TLB exception with the result that such accesses are not handled by the usual memory management routines.

The unmapped kseg0 and kseg1 segments from kernel mode address space are available from debug mode, which allows the debug handler to be executed from uncached and unmapped memory.

Table 4.3 Physical Address and Cache Attributes for dseg, dmseg, and drseg Address Spaces

| Segment<br>Name | Sub-Segment<br>Name | Virtual Address                       | Generates Physical Address                                                     | Cache<br>Attribute |
|-----------------|---------------------|---------------------------------------|--------------------------------------------------------------------------------|--------------------|
| dseg            | dmseg               | 0xFF20_0000<br>through<br>0xFF2F_FFFF | dmseg maps to addresses<br>0x0_0000 - 0xF_FFFF in EJTAG<br>probe memory space. | Uncached           |
|                 | drseg               | 0xFF30_0000<br>through<br>0xFF3F_FFFF | drseg maps to the breakpoint reg-<br>isters 0x0_0000 - 0xF_FFFF                |                    |

#### 4.2.4.1 Conditions and Behavior for Access to drseg, EJTAG Registers

The behavior of CPU access to the drseg address range at 0xFF30\_0000 to 0xFF3F\_FFFF is determined as shown in Table 4.4

### Table 4.4 CPU Access to drseg Address Range

| Transaction  | LSNM Bit in Debug<br>Register | Access                            |
|--------------|-------------------------------|-----------------------------------|
| Load / Store | 1                             | Kernel mode address space (kseg3) |
| Fetch        | Don't care                    | drseg, see comments below         |
| Load / Store | 0                             |                                   |

Debug software is expected to read the Debug Control Register (*DCR*) to determine which other memory mapped registers exist in drseg. The value returned in response to a read of any unimplemented memory mapped register is

unpredictable, and writes are ignored to any unimplemented register in the drseg. Refer to Chapter 10, "EJTAG Debug Support in the microAptiv<sup>™</sup> UP Core" on page 230 for more information on the *DCR*.

The allowed access size is limited for the drseg. Only word size transactions are allowed. Operation of the processor is undefined for other transaction sizes.

#### 4.2.4.2 Conditions and Behavior for Access to dmseg, EJTAG Memory

The behavior of CPU access to the dmseg address range at 0xFF20\_0000 to 0xFF2F\_FFFF is determined by the table shown in Table 4.5.

| Transaction  | ProbEn bit in<br>DCR register | LSNM bit in<br>Debug register | Access                            |
|--------------|-------------------------------|-------------------------------|-----------------------------------|
| Load / Store | Don't care                    | 1                             | Kernel mode address space (kseg3) |
| Fetch        | 1                             | Don't care                    | dmseg                             |
| Load / Store | 1                             | 0                             |                                   |
| Fetch        | 0                             | Don't care                    | See comments below                |
| Load / Store | 0                             | 0                             | 1                                 |

Table 4.5 CPU Access to dmseg Address Range

The case with access to the dmseg when the *ProbEn* bit in the *DCR* register is 0 is not expected to happen. Debug software is expected to check the state of the *ProbEn* bit in *DCR* register before attempting to reference dmseg. If such a reference does happen, the reference hangs until it is satisfied by the probe. The probe can not assume that there will never be a reference to dmseg if the *ProbEn* bit in the *DCR* register is 0 because there is an inherent race between the debug software sampling the *ProbEn* bit as 1 and the probe clearing it to 0.

# 4.3 Translation Lookaside Buffer

The following subsections discuss the TLB memory management scheme used in the microAptiv UP processor core. The TLB consists of one joint and two micro address translation buffers:

- 16 or 32 dual-entry fully associative Joint TLB (JTLB)
- 4-entry fully associative Instruction micro TLB (ITLB)
- 4-entry fully associative Data micro TLB (DTLB)

### 4.3.1 Joint TLB

The microAptiv UP core implements a 16 or 32 dual-entry, fully associative Joint TLB that maps 32 or 64 virtual pages to their corresponding physical addresses. The purpose of the TLB is to translate virtual addresses and their corresponding ASID into a physical memory address. The translation is performed by comparing the upper bits of the virtual address (along with the ASID bits) against each of the entries in the *tag* portion of the JTLB structure. Because this structure is used to translate both instruction and data virtual addresses, it is referred to as a "joint" TLB.

The JTLB is organized as pairs of even and odd entries containing descriptions of pages that range in size from 4-KBytes (or 1-KByte) to 256-MBytes<sup>1</sup> into the 4-GByte physical address space. By default, the minimum page size

<sup>&</sup>lt;sup>1</sup> The maximum page size is defined when building the actual core. For further information please see sec.4.4.2.1 "Page Sizes"

is normally 4-KBytes on the microAptiv UP core; as a build-time option, it is possible to specify a minimum page size of 1-KByte.

The JTLB is organized in pairs of page entries to minimize its overall size. Each virtual *tag* entry corresponds to two physical data entries, an even page entry and an odd page entry. The highest order virtual address bit not participating in the tag comparison is used to determine which of the two data entries is used. Since page size can vary on a page-pair basis, the determination of which address bits participate in the comparison and which bit is used to make the even-odd selection must be done dynamically during the TLB lookup.

Figure 4.8 shows the contents of one of the dual-entries in the JTLB. The bit range indication in the figure serves to clarify which address bits are (or may be) affected during the translation process.



#### Figure 4.7 JTLB Entry (Tag and Data)

Figure 4.8 JTLB Entry (Tag and Data)



Table 4.6 and Table 4.7 explain each of the fields in a JTLB entry.

| Field Name      | Description                                                                                                                                                                                                                                                                                           |                                                  |                              |  |  |
|-----------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------|------------------------------|--|--|
| PageMask[28:11] | Page Mask Value. The Page Mask defi<br>VPN2 bits from being involved in a co<br>address bit is used to make the even-od<br>below.                                                                                                                                                                     | mparison. It is also u                           | used to determine which      |  |  |
|                 | PageMask                                                                                                                                                                                                                                                                                              | Page Size                                        | Even/Odd Bank<br>Select Bit  |  |  |
|                 | 00_0000_0000_0000_0000                                                                                                                                                                                                                                                                                | 1KB                                              | VAddr[10]                    |  |  |
|                 | 00_0000_0000_0000_0011                                                                                                                                                                                                                                                                                | 4KB                                              | VAddr[12]                    |  |  |
|                 | 00_0000_0000_0000_1111                                                                                                                                                                                                                                                                                | 16KB                                             | VAddr[14]                    |  |  |
|                 | 00_0000_0000_0011_1111                                                                                                                                                                                                                                                                                | 64KB                                             | VAddr[16]                    |  |  |
|                 | 00_0000_0000_1111_1111                                                                                                                                                                                                                                                                                | 256KB                                            | VAddr[18]                    |  |  |
|                 | 00_0000_0011_1111_1111                                                                                                                                                                                                                                                                                | 1MB                                              | VAddr[20]                    |  |  |
|                 | 00_0000_1111_1111_1111                                                                                                                                                                                                                                                                                | 4MB                                              | VAddr[22]                    |  |  |
|                 | 00_0011_1111_1111_1111                                                                                                                                                                                                                                                                                | 16MB                                             | VAddr[24]                    |  |  |
|                 | The PageMask column above shows al<br>pair of bits can only have the same valu<br>compressed version of the PageMask u<br>software, which will always work with                                                                                                                                       | ue, the physical entry<br>using only 8 bits. Thi | in the JTLB will only save a |  |  |
| VPN2[31:13]     | Virtual Page Number divided by 2. This field contains the upper bits of the virtual page number. Because it represents a pair of TLB pages, it is divided by 2. Bits 31:25 are always included in the TLB lookup comparison. Bits 24:13 are included depending on the page size, defined by PageMask. |                                                  |                              |  |  |
| G               | Global Bit. When set, indicates that this entry is global to all processes and/or threads and thus disables inclusion of the ASID in the comparison.                                                                                                                                                  |                                                  |                              |  |  |
| ASID[7:0]       | Address Space Identifier. Identifies wh with.                                                                                                                                                                                                                                                         | ich process or thread                            | this TLB entry is associated |  |  |

# Table 4.6 TLB Tag Entry Fields

# Table 4.7 TLB Data Entry Fields

| Field Name                                            | Description                                                                                                                                                                                                                                                                                                                                                                                                                                             |
|-------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| PFN0([31:12] or [29:10]),<br>PFN1([31:12] or [29:10]) | Physical Frame Number. Defines the upper bits of the physical address.<br>The [29:10] range illustrates that if 1Kbytes page granularity is enabled, the PFN is<br>shifted to the right, before being appended to the untranslated part of the virtual address.<br>In this mode the upper two physical address bits are not covered by PFN but forced to<br>zero.<br>For page sizes larger than 4 KBytes, only a subset of these bits is actually used. |

| Field Name          | Description                                                                                                                                                                                                                                                   |                                                                                                                                                                    |  |  |  |
|---------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| C0[2:0],<br>C1[2:0] | Cacheability. Contains an encoded value of the cacheability attributes and determines whether the page should be placed in the cache or not. The field is encoded as follows:                                                                                 |                                                                                                                                                                    |  |  |  |
|                     | C[2:0]                                                                                                                                                                                                                                                        | Coherency Attribute                                                                                                                                                |  |  |  |
|                     | 000                                                                                                                                                                                                                                                           | Cacheable, noncoherent, write-through, no write-allocate                                                                                                           |  |  |  |
|                     | 001                                                                                                                                                                                                                                                           | Cacheable, noncoherent, write-through, write-allocate                                                                                                              |  |  |  |
|                     | 010                                                                                                                                                                                                                                                           | Uncached                                                                                                                                                           |  |  |  |
|                     | 011                                                                                                                                                                                                                                                           | Cacheable, noncoherent, write-back, write-allocate                                                                                                                 |  |  |  |
|                     | 100                                                                                                                                                                                                                                                           | Maps to entry 011b*                                                                                                                                                |  |  |  |
|                     | 101                                                                                                                                                                                                                                                           | Maps to entry 011b*                                                                                                                                                |  |  |  |
|                     | 110                                                                                                                                                                                                                                                           | Maps to entry 011b*                                                                                                                                                |  |  |  |
|                     | 111                                                                                                                                                                                                                                                           | Maps to entry 010b*                                                                                                                                                |  |  |  |
|                     | but do hav                                                                                                                                                                                                                                                    | pings are not used on the microAptiv UP processor cores<br>re meaning in other MIPS Technologies implementations.<br>ne MIPS32 specification for more information. |  |  |  |
| RIO,<br>RI1         | Read Inhibit bit. Indicates that the page is read protected. If this bit is set, an attempt to read from the page causes a TLB Read Inhibit exception if the PageGrain <sub>IEC</sub> = 1, and even if the $V$ (Valid) bit is set.                            |                                                                                                                                                                    |  |  |  |
| XI0,<br>XI1         | Execute Inhibit bit. Indicates that the page is execution protected. If this bit is set, an attempt to fetch an instruction from the page causes a TLB Execute Inhibit exception if the PageGrain <sub>IEC</sub> = 1, and even if the $V$ (Valid) bit is set. |                                                                                                                                                                    |  |  |  |
| D0,                 | "Dirty" or Write-ena                                                                                                                                                                                                                                          | able Bit. Indicates that the page has been written and/or is writable. If                                                                                          |  |  |  |
| DI                  | this bit is set, stores to the page are permitted. If the bit is cleared, stores to the page cause a TLB Modified exception.                                                                                                                                  |                                                                                                                                                                    |  |  |  |
| V0,<br>V1           | Valid Bit. Indicates that the TLB entry and, thus, the virtual page mapping are valid. If this bit is set, accesses to the page are permitted. If the bit is cleared, accesses to the page cause a TLB Invalid exception.                                     |                                                                                                                                                                    |  |  |  |

### Table 4.7 TLB Data Entry Fields (Continued)

In order to fill an entry in the JTLB, software executes a TLBWI or TLBWR instruction (See 4.4.3 "TLB Instructions" on page 92). Prior to invoking one of these instructions, several CP0 registers must be updated with the information to be written to a TLB entry:

- PageMask is set in the CP0 PageMask register.
- VPN2, VPN2X, and ASID are set in the CP0 EntryHi register.
- *PFN0, C0, D0, V0,* and *G* bits are set in the CP0 *EntryLo0* register.
- PFN1, C1, D1, V1, and G bits are set in the CP0 EntryLo1 register.

Note that the global bit "G" is part of both *EntryLo0* and *EntryLo1*. The resulting "G" bit in the JTLB entry is the logical AND of the two fields in *EntryLo0* and *EntryLo1*. Please refer to 6.2.30 "Config Register (CP0 Register 16, Select 0)" for further details.

The address space identifier (ASID) helps to reduce the frequency of TLB flushing on a context switch. The existence of the ASID allows multiple processes to exist in both the TLB and instruction caches. The ASID value is stored in the *EntryHi* register and is compared to the ASID value of each entry.

# 4.3.2 Instruction TLB

The ITLB is a small 4-entry, fully associative TLB dedicated to perform translations for the instruction stream. The ITLB only maps 4-Kbyte pages/sub-pages or 1-Kbyte pages/sub-pages if  $Config3_{SP}=1$  and  $PageGrain_{ESP}=1$  if  $PageGrain_{Mask}$  is set to "00".

The ITLB is managed by hardware and is transparent to software. If a fetch address cannot be translated by the ITLB, the JTLB is accessed trying to translate it in the following clock cycle. If successful, the translation information is copied into the ITLB. The ITLB is then re-accessed and the address will be successfully translated. This results in an ITLB miss penalty of at least 2 cycles. If the JTLB is busy with other operations, it may take additional cycles.

# 4.3.3 Data TLB

The DTLB is a small 4-entry, fully associative TLB which provides a faster translation for Load/Store addresses than is possible with the JTLB. The DTLB only maps 4-Kbyte pages/sub-pages or 1-Kbyte pages/sub-pages if  $Config3_{SP}=1$  and  $PageGrain_{ESP}=1$ .

Like the ITLB, the DTLB is managed by hardware and is transparent to software. Unlike the ITLB, an access to the DTLB starts a parallel access to the JTLB. If there is a DTLB miss and a JTLB hit, the DTLB can be reloaded that cycle. The DTLB is then re-accessed and the translation will be successful. This parallel access reduces the DTLB miss penalty to 1 cycle.

# 4.4 Virtual-to-Physical Address Translation

Converting a virtual address to a physical address begins by comparing the virtual address from the processor with the virtual addresses in the TLB. There is a match when the VPN of the address is the same as the VPN field of the entry, and either:

- The Global (G) bit of both the even and odd pages of the TLB entry are set, or
- The ASID field of the virtual address is the same as the ASID field of the TLB entry

This match is referred to as a TLB *hit*. If there is no match, a TLB *miss* exception is taken by the processor and software is allowed to refill the TLB from a page table of virtual/physical addresses in memory.

Figure 4.9 shows the logical translation of a virtual address into a physical address.

In this figure the virtual address is extended with an 8-bit ASID, which reduces the frequency of TLB flushing during a context switch. This 8-bit ASID contains the number assigned to that process and is stored in the CP0 *EntryHi* register.

#### Figure 4.9 Overview of a Virtual-to-Physical Address Translation in the microAptiv™ UP Core



If there is a virtual address match in the TLB, the Physical Frame Number (PFN) is output from the TLB and concatenated with the *Offset*, to form the physical address. The *Offset* represents an address within the page frame space. As shown in Figure 4.9, the *Offset* does not pass through the TLB. Figure 4.10 shows a flow diagram of the microAptiv UP core address translation process for two page sizes. The top portion of the figure shows a virtual address for a 4 KByte page size. The width of the *Offset* is defined by the page size. The remaining 20 bits of the address represent the virtual page number (VPN). The bottom portion of Figure 4.10 shows the virtual address for a 16 MByte page size. The remaining 8 bits of the address represent the VPN.



### 4.4.1 Hits, Misses, and Multiple Matches

Each JTLB entry contains a tag and two data fields. If a match is found, the upper bits of the virtual address are replaced with the page frame number (PFN) stored in the corresponding entry in the data array of the JTLB. The granularity of JTLB mappings is defined in terms of TLB pages. The JTLB supports pages of different sizes ranging from 1 KB to 256 MB in powers of 4. If a match is found, but the entry is invalid (i.e., the V bit in the data field is 0), a TLB Invalid exception is taken. If no match occurs (TLB miss), an exception is taken and software refills the TLB from the page table resident in memory. Figure 4.11 shows the translation and exception flow of the TLB.

Software can write over a selected TLB entry or use a hardware mechanism to write into a random entry. The *Random* register selects which TLB entry to use on a TLBWR. This register decrements almost every cycle, wrapping to the maximum when its value is equal to the *Wired* register. Thus, TLB entries below the *Wired* value cannot be replaced by a TLBWR allowing important mappings to be preserved. In order to reduce the possibility for a livelock situation, the *Random* register includes a 10-bit LFSR that introduces a pseudo-random perturbation into the decrement.

The microAptiv UP core implements a TLB write-compare mechanism to ensure that multiple TLB matches do not occur. On the TLB write operation, the VPN2 field to be written is compared with all other entries in the TLB. If a match occurs, the microAptiv UP core takes a machine-check exception, sets the TS bit in the CP0 *Status* register, and aborts the write operation. For further details on exceptions, refer to Chapter 5, "Exceptions and Interrupts in the microAptiv<sup>TM</sup> UP Core" on page 96. There is a hidden bit in each TLB entry that is cleared on a ColdReset. This bit

is set when the TLB entry is written and is included in the match detection. Therefore, uninitialized TLB entries will not cause a TLB shutdown.

Note: This hidden initialization bit leaves the entire JTLB invalid after a ColdReset, eliminating the need to flush the TLB. But, to be compatible with other MIPS processors, it is recommended that software initialize all TLB entries with unique tag values and V bits cleared before the first access to a mapped location.

### 4.4.2 Memory Space

To assist in controlling both the amount of mapped space and the replacement characteristics of various memory regions, the microAptiv UP core provides two mechanisms.

#### 4.4.2.1 Page Sizes

First, the page size can be configured, on a per entry basis, to map different page sizes ranging from 4 KByte to 256 MByte, in multiples of 4 (optionally, the microAptiv UPc core can also support a smaller page size of 1 KByte). The CP0 *PageMask* register is loaded with the desired page size, which is then entered into the TLB when a new entry is written. Thus, operating systems can provide special-purpose maps. For example, a typical frame buffer can be memory mapped with only one TLB entry.

The microAptiv UP core implements the following page sizes:

(optionally 1K), 4K, 16K, 64K, 256K, 1M, 4M, 16M, 64M, 256M.

Software can determine which page sizes are supported by writing all ones to the CP0 *PageMask* register, then reading back the value. For additional information, see 6.2.5 "UserLocal Register (CP0 Register 4, Select 2)".

To enable support of 1 KByte pages in the microAptiv UPc core, a few steps must be taken. First, check that small pages are implemented by reading the CP0 *Config*<sub>SP</sub> bit; if set, small page sizes can be enabled by setting the ESP bit of the CP0 *PageGrain* register. See 6.2.7 "PageGrain Register (CP0 Register 5, Select 1)" on page 144 for more information.

### 4.4.2.2 Replacement Algorithm

The second mechanism controls the replacement algorithm when a TLB miss occurs. To select a TLB entry to be written with a new mapping, the microAptiv UPc core provides a random replacement algorithm. However, the processor also provides a mechanism whereby a programmable number of mappings can be locked into the TLB via the CP0 *Wired* register, thus avoiding random replacement. Please refer to 6.2.8 "Wired Register (CP0 Register 6, Select 0)" on page 145 for further details.



Figure 4.11 TLB Address Translation Flow in the microAptiv™ UP Processor Core

# 4.4.3 TLB Instructions

Table 4.8 lists the microAptiv UP core's TLB-related instructions. Refer to Chapter 12, "microAptiv™ UP Processor Core Instructions" on page 310 for more information on these instructions.

| Op Code | Description of Instruction         |  |  |  |
|---------|------------------------------------|--|--|--|
| TLBP    | Translation Lookaside Buffer Probe |  |  |  |
| TLBR    | Translation Lookaside Buffer Read  |  |  |  |

| Op Code | Description of Instruction                |  |  |
|---------|-------------------------------------------|--|--|
| TLBWI   | Translation Lookaside Buffer Write Index  |  |  |
| TLBWR   | Translation Lookaside Buffer Write Random |  |  |

### Table 4.8 TLB Instructions (Continued)

# 4.5 Fixed Mapping MMU

The microAptiv UP core optionally implements a simple Fixed Mapping (FM) memory management unit that is smaller than the a full translation lookaside buffer (TLB) and more easily synthesized. Like a TLB, the FMT performs virtual-to-physical address translation and provides attributes for the different memory segments. Those memory segments which are unmapped in a TLB implementation (kseg0 and kseg1) are translated identically by the FMT MMU.

The FMT also determines the cacheability of each segment. These attributes are controlled via bits in the *Config* register. Table 4.10 shows the encoding for the K23 (bits 30:28), KU (bits 27:25) and K0 (bits 2:0) of the *Config* register.

| Config Register Fields<br>K23, KU, and K0 | Cache Coherency Attribute                                |
|-------------------------------------------|----------------------------------------------------------|
| 0                                         | Cacheable, noncoherent, write-through, no write-allocate |
| 1                                         | Cacheable, noncoherent, write-through, write-allocate    |
| 3, 4, 5, 6                                | Cacheable, noncoherent, write-back, write-allocate       |
| 2, 7                                      | Uncached                                                 |

**Table 4.9 Cache Coherency Attributes** 

In the microAptiv UP core, no translation exceptions can be taken, although address errors are still possible.

Table 4.10 Cacheability of Segments with Block Address Translation

| Segment    | Virtual Address<br>Range    | Cacheability                                                                                                 |
|------------|-----------------------------|--------------------------------------------------------------------------------------------------------------|
| useg/kuseg | 0x0000_0000-<br>0x7FFF_FFFF | Controlled by the KU field (bits 27:25) of the <i>Config</i> register. Refer to Table 4.9 for the encoding.  |
| kseg0      | 0x8000_0000-<br>0x9FFF_FFFF | Controlled by the K0 field (bits 2:0) of the <i>Config</i> register. See Table 4.9 for the encoding.         |
| kseg1      | 0xA000_0000-<br>0xBFFF_FFFF | Always uncacheable.                                                                                          |
| kseg2      | 0xC000_0000-<br>0xDFFF_FFFF | Controlled by the K23 field (bits 30:28) of the <i>Config</i> register. Refer to Table 4.9 for the encoding. |
| kseg3      | 0xE000_0000-<br>0xFFFF_FFFF | Controlled by K23 field (bits 30:28) of the <i>Config</i> register. Refer to Table 4.9 for the encoding.     |

The FMT performs a simple translation to map from virtual addresses to physical addresses. This mapping is shown in Figure 4.12. When *ERL*=1, useg and kuseg become unmapped and uncached. The *ERL* behavior is the same as if there was a TLB. The *ERL* mapping is shown in Figure 4.13.

The *ERL* bit is usually never asserted by software. It is asserted by hardware after a Reset, SoftReset or NMI. See 5.8 "Exception Descriptions" on page 115 for further information on exceptions.



Figure 4.12 FMT Memory Map (ERL=0) in the microAptiv™ UP Processor Core



Figure 4.13 FMT Memory Map (ERL=1) in the microAptiv™ UP Processor Core

# 4.6 System Control Coprocessor

The System Control Coprocessor (CP0) is implemented as an integral part of microAptiv UP processor core and supports memory management, address translation, exception handling, and other privileged operations. Certain CP0 registers are used to support memory management. Refer to Chapter 6, "CP0 Registers of the microAptiv<sup>™</sup> UP Core" on page 135 for more information on the CP0 register set.

### Chapter 5

# Exceptions and Interrupts in the microAptiv<sup>™</sup> UP Core

The microAptiv<sup>™</sup> UP processor core receives exceptions from a number of sources, including misses in the translation lookaside buffer (TLB), arithmetic overflows, I/O interrupts, and system calls. When the CPU detects one of these exceptions, the normal sequence of instruction execution is suspended and the processor enters kernel mode.

In kernel mode the core disables interrupts and forces execution of a software exception processor (called a handler) located at a specific address. The handler saves the context of the processor, including the contents of the program counter, the current operating mode, and the status of the interrupts (enabled or disabled). This context is saved so it can be restored when the exception has been serviced.

When an exception occurs, the core loads the *Exception Program Counter (EPC)* register with a location where execution can restart after the exception has been serviced. Most exceptions are *precise*, which mean that *EPC* can be used to identify the instruction that caused the exception. For precise exceptions, the restart location in the *EPC* register is the address of the instruction that caused the exception or, if the instruction was executing in a branch delay slot, the address of the branch instruction immediately preceding the delay slot. To distinguish between the two, software must read the BD bit in the CP0 *Cause* register. Bus error exceptions and CP2 exceptions may be imprecise. For imprecise exceptions the instruction that caused the exception cannot be identified.

This chapter contains the following sections:

- Section 5.1 "Exception Conditions"
- Section 5.2 "Exception Priority"
- Section 5.3 "Interrupts"
- Section 5.4 "GPR Shadow Registers"
- Section 5.5 "Exception Vector Locations"
- Section 5.6 "General Exception Processing"
- Section 5.7 "Debug Exception Processing"
- Section 5.8 "Exception Descriptions"
- Section 5.9 "Exception Handling and Servicing Flowcharts"

# 5.1 Exception Conditions

When an exception condition occurs, the instruction causing the exception and all those that follow it in the pipeline are cancelled ("flushed"). Accordingly, any stall conditions and any later exception conditions that might have referenced this instruction are inhibited—obviously there is no benefit in servicing stalls for a cancelled instruction.

When an exception condition is detected on an instruction fetch, the core aborts that instruction and all instructions that follow. When this instruction reaches the W stage, various CP0 registers are written with the exception state, change the current program counter (PC) to the appropriate exception vector address, and clearing the exception bits of earlier pipeline stages.

This implementation allows all preceding instructions to complete execution and prevents all subsequent instructions from completing. Thus, the value in the *EPC* (*ErrorEPC* for errors, or *DEPC* for debug exceptions) is sufficient to restart execution. It also ensures that exceptions are taken in the order of execution; an instruction taking an exception may itself be killed by an instruction further down the pipeline that takes an exception in a later cycle.

# **5.2 Exception Priority**

Table 5.1 contains a list and a brief description of all exception conditions, The exceptions are listed in the order of their relative priority, from highest priority (Reset) to lowest priority. When several exceptions occur simultaneously, the exception with the highest priority is taken.

| Exception                       | Description                                                                                                                                                                                                                                                                                                 |  |  |  |
|---------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| Reset                           | Assertion of SI_ColdReset signal.                                                                                                                                                                                                                                                                           |  |  |  |
| Soft Reset                      | Assertion of SI_Reset signal.                                                                                                                                                                                                                                                                               |  |  |  |
| DSS                             | EJTAG Debug Single Step.                                                                                                                                                                                                                                                                                    |  |  |  |
| DINT                            | EJTAG Debug Interrupt. Caused by the assertion of the external <i>EJ_DINT</i> input, or by setting the <i>EjtagBrk</i> bit in the <i>ECR</i> register.                                                                                                                                                      |  |  |  |
| NMI                             | Asserting edge of SI_NMI signal.                                                                                                                                                                                                                                                                            |  |  |  |
| Machine Check                   | TLB write that conflicts with an existing entry.                                                                                                                                                                                                                                                            |  |  |  |
| Interrupt                       | Assertion of unmasked hardware or software interrupt signal.                                                                                                                                                                                                                                                |  |  |  |
| Deferred Watch                  | Deferred Watch (unmasked by K DM->!(K DM) transition).                                                                                                                                                                                                                                                      |  |  |  |
| DIB                             | EJTAG debug hardware instruction break matched.                                                                                                                                                                                                                                                             |  |  |  |
| WATCH                           | A reference to an address in one of the watch registers (fetch).                                                                                                                                                                                                                                            |  |  |  |
| AdEL                            | Fetch address alignment error.<br>User-mode fetch reference to kernel address.                                                                                                                                                                                                                              |  |  |  |
| TLBL                            | Fetch TLB miss.<br>Fetch TLB hit to page with V=0.                                                                                                                                                                                                                                                          |  |  |  |
| TLB Execute-Inhibit             | An instruction fetch matched a valid TLB entry that had the XI bit set.                                                                                                                                                                                                                                     |  |  |  |
| ICache/SPRAM Parity Error       | Parity error on I-Cache/I-SPRAM access                                                                                                                                                                                                                                                                      |  |  |  |
| IBE                             | Instruction fetch bus error.                                                                                                                                                                                                                                                                                |  |  |  |
| Instruction Validity Exceptions | An instruction could not be completed because it was not allowed access to the required resources (Coprocessor Unusable) or was illegal (Reserved Instruc-<br>tion). If exceptions occur on the same instruction, the Coprocessor Unusable Exception take priority over the Reserved Instruction Exception. |  |  |  |
| Tr                              | Execution of a trap (when trap condition is true).                                                                                                                                                                                                                                                          |  |  |  |
| DDBL / DDBS                     | EJTAG Data Address Break (address only) or EJTAG Data Value Break on Store (address and value).                                                                                                                                                                                                             |  |  |  |
| WATCH                           | A reference to an address in one of the watch registers (data).                                                                                                                                                                                                                                             |  |  |  |

### **Table 5.1 Priority of Exceptions**

| Exception                 | Description                                                                  |
|---------------------------|------------------------------------------------------------------------------|
| AdEL                      | Load address alignment error.<br>User mode load reference to kernel address. |
| AdES                      | Store address alignment error.<br>User mode store to kernel address.         |
| TLBL                      | Load TLB miss (TLB-based MMU).<br>Load TLB hit to page with V=0              |
| TLBS                      | Store TLB miss.<br>Store TLB hit to page with V=0.                           |
| TLB Read-Inhibit          | A data read access matched a valid TLB entry whose RI bit is set.            |
| TLB Mod                   | Store to TLB page with D=0.                                                  |
| DCache/SPRAM Parity Error | Parity error on D-Cache/D-SPRAM access.                                      |
| DBE                       | Load or store bus error.                                                     |
| DDBL                      | EJTAG data hardware breakpoint matched in load data compare.                 |
| CBrk                      | EJTAG complex breakpoint.                                                    |

### Table 5.1 Priority of Exceptions (Continued)

# 5.3 Interrupts

In the MIPS32® Release 1 architecture, support for exceptions included two software interrupts, six hardware interrupts, and a special-purpose timer interrupt. The timer interrupt was provided external to the core and was typically combined with hardware interrupt 5 in a system-dependent manner. Interrupts were handled either through the general exception vector (offset 0x180) or the special interrupt vector (0x200), based on the value of *CauseIV*. Software was required to prioritize interrupts as a function of the *CauseIV* bits in the interrupt handler prologue.

Release 2 of the Architecture, implemented by the microAptiv UP core, adds a number of upward-compatible extensions to the Release 1 interrupt architecture, including support for vectored interrupts and the implementation of a new interrupt mode that permits the use of an external interrupt controller.

The microAptiv UP core also includes the Microcontroller Application-Specific Extension (MCU ASE) that provides enhanced interrupt delivery and interrupt-latency reduction.

# 5.3.1 Interrupt Modes

The microAptiv UP core includes support for three interrupt modes, as defined by Release 2 of the Architecture:

- Interrupt Compatibility mode, in which the behavior of the microAptiv UP is identical to the behavior of a Release 1 implementations.
- Vectored Interrupt (VI) mode, which adds the ability to prioritize and vector interrupts to a handler dedicated to that interrupt, and to assign a GPR shadow set for use during interrupt processing. The presence of this mode is denoted by the *VInt* bit in the *Config3* register. Although this mode is architecturally optional, it is always present on the microAptiv UP processor, so the *VInt* bit will always read as a 1.
- External Interrupt Controller (EIC) mode, which redefines the way interrupts are handled to provide full support for an external interrupt controller that handles prioritization and vectoring of interrupts. As with VI mode, this mode is architecturally optional. The presence of this mode is denoted by the *VEIC* bit in the *Config3* register. On

the microAptiv UP core, the VEIC bit is set externally by the static input, SI\_EICPresent, to allow system logic to indicate the presence of an external interrupt controller.

Following reset, the microAptiv UP processor defaults to Compatibility mode, which is fully compatible with all implementations of Release 1 of the Architecture.

Table 5.2 shows the current interrupt mode of the processor as a function of the Coprocessor 0 register fields that can affect the mode.

| Status <sub>BEV</sub> | Cause <sub>IV</sub> | IntCtI <sub>VS</sub> | Config3 <sub>VINT</sub> | Config3 <sub>VEIC</sub> | Interrupt Mode                                                                                                                                     |
|-----------------------|---------------------|----------------------|-------------------------|-------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------|
| 1                     | х                   | Х                    | х                       | х                       | Compatibly                                                                                                                                         |
| х                     | 0                   | х                    | х                       | х                       | Compatibility                                                                                                                                      |
| х                     | х                   | =0                   | х                       | х                       | Compatibility                                                                                                                                      |
| 0                     | 1                   | ≠0                   | 1                       | 0                       | Vectored Interrupt                                                                                                                                 |
| 0                     | 1                   | ≠0                   | х                       | 1                       | External Interrupt Controller                                                                                                                      |
| 0                     | 1                   | ≠0                   | 0                       | 0                       | Can't happen - IntCtl <sub>VS</sub> can not be non-zero if neither<br>Vectored Interrupt nor External Interrupt Controller mode<br>is implemented. |
| "x'                   | ' den               | otes de              | on't c                  | eare                    |                                                                                                                                                    |

Table 5.2 Interrupt Modes

5.3.1.1 Interrupt Compatibility Mode

This is the default interrupt mode for the processor and is entered when a Reset exception occurs. In this mode, interrupts are non-vectored and dispatched though exception vector offset 16#180 (if  $Cause_{IV} = 0$ ) or vector offset 16#200 (if  $Cause_{IV} = 1$ ). This mode is in effect if any of the following conditions are true:

- Cause<sub>IV</sub> = 0
- Status<sub>BEV</sub> = 1
- $IntCtl_{VS} = 0$ , which would be the case if vectored interrupts are not implemented, or have been disabled.

Here is a typical software handler for interrupt compatibility mode:

```
/*
 * Assumptions:
 * - Cause<sub>IV</sub> = 1 (if it were zero, the interrupt exception would have to
 * be isolated from the general exception vector before getting
 * here)
 * - GPRs k0 and k1 are available (no shadow register switches invoked in
 * compatibility mode)
 * - The software priority is IP9..IP0 (HW7..HW0, SW1..SW0)
 *
 * Location: Offset 0x200 from exception base
 */
```

```
IVexception:
                                 /* Read Cause register for IP bits */
   mfc0 k0, C0 Cause
   mfc0 k1, C0_Status /* and Status register for IM bits */
   andi k0, k0, M CauseIM /* Keep only IP bits from Cause */
   and k0, k0, k1 /\ast and mask with IM bits \ast/
   beq k0, zero, Dismiss /* no bits set - spurious interrupt */
   clz k0, k0 /* Find first bit set, IP9..IP0; k0 = 14..23 */

xori k0, k0, 0x17 /* 14..23 => 9..0 */

sll k0, k0, VS /* Shift to emulate software IntCtl<sub>VS</sub> */

la k1, VectorBase /* Get base of 10 interrupt vectors */

addu k0, k0, k1 /* Compute target from base and offset */

ir k0 /* Jump to specific exception methins */
   jr
         k0
                                 /* Jump to specific exception routine */
   nop
/*
 * Each interrupt processing routine processes a specific interrupt, analogous
 * to those reached in VI or EIC interrupt mode. Since each processing routine
 * is dedicated to a particular interrupt line, it has the context to know
 * which line was asserted. Each processing routine may need to look further
 * to determine the actual source of the interrupt if multiple interrupt requests
 * are ORed together on a single IP line. Once that task is performed, the
 * interrupt may be processed in one of two ways:
 * - Completely at interrupt level (e.g., a simply UART interrupt). The
 *
     SimpleInterrupt routine below is an example of this type.
 * - By saving sufficient state and re-enabling other interrupts. In this
 *
    case the software model determines which interrupts are disabled during
 *
    the processing of this interrupt. Typically, this is either the single
 *
    StatusIM bit that corresponds to the interrupt being processed, or some
 *
    collection of other \text{Status}_{\text{IM}} bits so that "lower" priority interrupts are
     also disabled. The NestedInterrupt routine below is an example of this type.
 */
SimpleInterrupt:
/*
 * Process the device interrupt here and clear the interupt request
* at the device. In order to do this, some registers may need to be
 * saved and restored. The coprocessor 0 state is such that an ERET
 * will simple return to the interrupted code.
 */
   eret
                                 /* Return to interrupted code */
NestedException:
/*
 * Nested exceptions typically require saving the EPC and Status registers,
 * any GPRs that may be modified by the nested exception routine, disabling
 * the appropriate IM bits in Status to prevent an interrupt loop, putting
 * the processor in kernel mode, and re-enabling interrupts. The sample code
 * below can not cover all nuances of this processing and is intended only
 * to demonstrate the concepts.
 */
    /* Save GPRs here, and setup software context */
   mfc0k0, C0_EPC/* Get restart address */sk0, EPCSave/* Save in memory */
   s k0, EPCSave /* Save in memory */

mfc0 k0, C0_Status /* Get Status value */

sw k0, StatusSave /* Save in memory */

li k1, ~IMbitsToClear /* Get Im bits to clear for this interrupt */
```

```
/*
                                  this must include at least the IM bit */
                             /*
                                  for the current interrupt, and may include */
                             /*
                                 others */
   and
         k0, k0, k1
                                /* Clear bits in copy of Status */
   ins
         k0, zero, S StatusEXL, (W StatusKSU+W StatusERL+W StatusEXL)
                                /* Clear KSU, ERL, EXL bits in k0 */
                                /* Modify mask, switch to kernel mode, */
  mtc0
        k0, C0 Status
                                /*
                                   re-enable interrupts */
   /*
    * Process interrupt here, including clearing device interrupt.
    * In some environments this may be done with a thread running in
    * kernel or user mode. Such an environment is well beyond the scope of
    * this example.
    */
/*
* To complete interrupt processing, the saved values must be restored
* and the original interrupted code restarted.
*/
   di
                             /* Disable interrupts - may not be required */
   lw
         k0, StatusSave
                             /* Get saved Status (including EXL set) */
         k1, EPCSave
  1
                            /* and EPC */
                            /* Restore the original value */
  mtc0 k0, C0_Status
                             /* and EPC */
  mtc0 k1, C0 EPC
   /* Restore GPRs and software state */
   eret
                             /* Dismiss the interrupt */
```

#### 5.3.1.2 Vectored Interrupt (VI) Mode

In Vectored Interrupt (VI) mode, a priority encoder prioritizes pending interrupts and generates a vector which can be used to direct each interrupt to a dedicated handler routine. This mode also allows each interrupt to be mapped to a GPR shadow register set for use by the interrupt handler. VI mode is in effect when all the following conditions are true:

- Config3<sub>VInt</sub> = 1
- $Config3_{VEIC} = 0$
- $IntCtl_{VS} \neq 0$
- Cause<sub>IV</sub> = 1
- Status<sub>BEV</sub> = 0

In VI interrupt mode, the eight hardware interrupts are interpreted as individual hardware interrupt requests. The timer interrupt is combined in a system-dependent way (external to the core) with the hardware interrupts (the interrupt with which they are combined is indicated by the *PTI* field in *IntCtll*) to provide the appropriate relative priority of the timer interrupt with that of the hardware interrupts. The processor interrupt logic ANDs each of the *Cause*<sub>*IP*</sub> bits with the corresponding *Status*<sub>*IM*</sub> bits. If any of these values is 1, and if interrupts are enabled (*Status*<sub>*IE*</sub> = 1,

 $Status_{EXL} = 0$ , and  $Status_{ERL} = 0$ ), an interrupt is signaled and a priority encoder scans the values in the order shown in Table 5.3.

| Relative<br>Priority | Interrupt<br>Type | Interrupt<br>Source | Interrupt<br>Request<br>Calculated From | Vector Number<br>Generated by<br>Priority Encoder |
|----------------------|-------------------|---------------------|-----------------------------------------|---------------------------------------------------|
| Highest Priority     | Hardware          | HW7                 | IP9 and IM9                             | 9                                                 |
|                      |                   | HW6                 | IP8 and IM8                             | 8                                                 |
|                      |                   | HW5                 | IP7 and IM7                             | 7                                                 |
|                      |                   | HW4                 | IP6 and IM6                             | 6                                                 |
|                      |                   | HW3                 | IP5 and IM5                             | 5                                                 |
|                      |                   | HW2                 | IP4 and IM4                             | 4                                                 |
|                      |                   | HW1                 | IP3 and IM3                             | 3                                                 |
|                      |                   | HW0                 | IP2 and IM2                             | 2                                                 |
|                      | Software          | SW1                 | IP1 and IM1                             | 1                                                 |
| Lowest Priority      |                   | SW0                 | IP0 and IM0                             | 0                                                 |

 Table 5.3 Relative Interrupt Priority for Vectored Interrupt Mode

The priority order places a relative priority on each hardware interrupt and places the software interrupts at a priority lower than all hardware interrupts. When the priority encoder finds the highest priority pending interrupt, it outputs an encoded vector number that is used in the calculation of the handler for that interrupt, as described below. This is shown pictorially in Figure 5.1.



Figure 5.1 Interrupt Generation for Vectored Interrupt Mode

A typical software handler for vectored interrupt mode bypasses the entire sequence of code following the IV exception label shown for the compatibility mode handler above. Instead, the hardware performs the prioritization, dispatching directly to the interrupt processing routine. Unlike the compatibility mode examples, a vectored interrupt handler may take advantage of a dedicated GPR shadow set to avoid saving any registers. As such, the Simple Interrupt code shown above need not save the GPRs.

A nested interrupt is similar to that shown for compatibility mode, but may also take advantage of running the nested exception routine in the GPR shadow set dedicated to the interrupt or in another shadow set. Such a routine might look as follows:

```
NestedException:
/*
 * Nested exceptions typically require saving the EPC, Status and SRSCtl registers,
* setting up the appropriate GPR shadow set for the routine, disabling
* the appropriate IM bits in Status to prevent an interrupt loop, putting
 * the processor in kernel mode, and re-enabling interrupts. The sample code
 * below can not cover all nuances of this processing and is intended only
 * to demonstrate the concepts.
 */
   /* Use the current GPR shadow set, and setup software context */
          k0, C0 EPC
   mfc0
                            /* Get restart address */
          k0, EPCSave
                              /* Save in memory */
   s
          k0, C0 Status
                              /* Get Status value */
   mfc0
          k0, StatusSave
                              /* Save in memory */
   sw
          k0, C0_SRSCtl
                              /* Save SRSCtl if changing shadow sets */
   mfc0
          k0, SRSCtlSave
   sw
```

```
k1, ~IMbitsToClear /* Get Im bits to clear for this interrupt */
   1 i
                             /* this must include at least the IM bit */
                             /* for the current interrupt, and may include */
                             /* others */
   and
         k0, k0, k1
                                /* Clear bits in copy of Status */
   /* If switching shadow sets, write new value to {\tt SRSCtl}_{PSS} here */
         k0, zero, S StatusEXL, (W StatusKSU+W StatusERL+W StatusEXL)
   ins
                                 /* Clear KSU, ERL, EXL bits in k0 */
   mtc0
         k0, C0 Status
                                 /* Modify mask, switch to kernel mode, */
                                 /*
                                      re-enable interrupts */
   /*
    * If switching shadow sets, clear only KSU above, write target
    * address to EPC, and do execute an eret to clear EXL, switch
    * shadow sets, and jump to routine
    */
   /* Process interrupt here, including clearing device interrupt */
/*
* To complete interrupt processing, the saved values must be restored
* and the original interrupted code restarted.
*/
   di
                             /* Disable interrupts - may not be required */
   lw
         k0, StatusSave
                            /* Get saved Status (including EXL set) */
                            /* and EPC */
   1
         k1, EPCSave
   mtc0
         k0, C0 Status
                            /* Restore the original value */
   lw
         k0, SRSCtlSave
                            /* Get saved SRSCtl */
                             /* and EPC */
   mtc0
         k1, CO EPC
         k0, C0_SRSCtl
                             /* Restore shadow sets */
   mtc0
   ehb
                             /* Clear hazard */
   eret
                             /* Dismiss the interrupt */
```

#### 5.3.1.3 External Interrupt Controller Mode

External Internal Interrupt Controller Mode redefines the way that the processor interrupt logic is configured to provide support for an external interrupt controller. The interrupt controller is responsible for prioritizing all interrupts, including hardware, software, timer, and performance counter interrupts, and directly supplying to the processor the priority level and vector number of the highest priority interrupt. EIC interrupt mode is in effect if all of the following conditions are true:

- Config3<sub>VEIC</sub> = 1
- $IntCtl_{VS} \neq 0$
- Cause<sub>IV</sub> = 1
- Status<sub>BEV</sub>=0

In EIC interrupt mode, the processor sends the state of the software interrupt requests ( $Cause_{IP1..IP0}$ ), the timer interrupt request ( $Cause_{TI}$ ), the performance counter interrupt request ( $Cause_{PCI}$ ) and Fast Debug Channel Interrupt ( $Cause_{FDCI}$ ) to the external interrupt controller, where it prioritizes these interrupts in a system-dependent way with other hardware interrupts. The interrupt controller can be a hard-wired logic block, or it can be configurable based on control and status registers. This allows the interrupt controller to be more specific or more general as a function of the system environment and needs.

The external interrupt controller prioritizes its interrupt requests and produces the priority level and the vector number of the highest priority interrupt to be serviced. The priority level, called the Requested Interrupt Priority Level (RIPL), is an 8-bit encoded value in the range 0..255, inclusive. A value of 0 indicates that no interrupt requests are pending. The values 1..255 represent the lowest (1) to highest (255) RIPL for the interrupt to be serviced. The interrupt controller passes this value on the 8 hardware interrupt lines, which are treated as an encoded value in EIC interrupt mode. There are two implementation options available for the vector offset:

- 1. The first option is to send a separate vector number along with the RIPL to the processor.
- 2. A second option is to send an entire vector offset along with the RIPL to the processor. This option is enabled through the core's configuration GUI, and it is not affected by software.

The microAptiv UP core does not support the option to treat the RIPL value as the vector number for the processor.

Status<sub>IPL</sub> (which overlays StatusI<sub>M9..IM2</sub>) is interpreted as the Interrupt Priority Level (IPL) at which the processor is currently operating (with a value of zero indicating that no interrupt is currently being serviced). When the interrupt controller requests service for an interrupt, the processor compares *RIPL* with *Status*<sub>IPL</sub> to determine if the requested interrupt has higher priority than the current *IPL*. If RIPL is strictly greater than *Status*<sub>IPL</sub>, and interrupts are enabled (*Status*<sub>IE</sub> = 1, *Status*<sub>EXL</sub> = 0, and *Status*<sub>ERL</sub> = 0) an interrupt request is signaled to the pipeline. When the processor starts the interrupt exception, it loads RIPL into *Cause*<sub>RIPL</sub> (which overlays *Cause*<sub>IP9..IP2</sub>) and signals the external interrupt controller to notify it that the request is being serviced. Because *Cause*<sub>RIPL</sub> is only loaded by the processor when an interrupt exception is signaled, it is available to software during interrupt processing. The vector number that the EIC passes to the core is combined with the *IntCtl*<sub>VS</sub> to determine where the interrupt service routine is located. The vector number is not stored in any software-visible registers.

In EIC interrupt mode, the external interrupt controller is also responsible for supplying the GPR shadow set number to use when servicing the interrupt. As such, the *SRSMap* register is not used in this mode, and the mapping of the vectored interrupt to a GPR shadow set is done by programming (or designing) the interrupt controller to provide the correct GPR shadow set number when an interrupt is requested. When the processor loads an interrupt request into  $Cause_{RIPL}$ , it also loads the GPR shadow set number into  $SRSCtl_{EICSS}$ , which is copied to  $SRSCtl_{CSS}$  when the interrupt is serviced.

The operation of EIC interrupt mode is shown pictorially in Figure 5.2.



Figure 5.2 Interrupt Generation for External Interrupt Controller Interrupt Mode

A typical software handler for EIC interrupt mode bypasses the entire sequence of code following the IV exception label shown for the compatibility-mode handler above. Instead, the hardware performs the prioritization, dispatching directly to the interrupt processing routine. Unlike the compatibility mode examples, an EIC interrupt handler may take advantage of a dedicated GPR shadow set to avoid saving any registers. As such, the Simple Interrupt code shown above need not save the GPRs.

A nested interrupt is similar to that shown for compatibility mode, but may also take advantage of running the nested exception routine in the GPR shadow set dedicated to the interrupt or in another shadow set. It also need only copy *Cause*<sub>RIPL</sub> to *Status*<sub>IPL</sub> to prevent lower priority interrupts from interrupting the handler. Such a routine might look as follows:

```
NestedException:
/*
 * Nested exceptions typically require saving the EPC, Status, and SRSCtl registers,
 * setting up the appropriate GPR shadow set for the routine, disabling
 * the appropriate IM bits in Status to prevent an interrupt loop, putting
 * the processor in kernel mode, and re-enabling interrupts. The sample code
 * below can not cover all nuances of this processing and is intended only
 *
  to demonstrate the concepts.
 */
   /* Use the current GPR shadow set, and setup software context */
   mfc0 k1, C0 Cause /* Read Cause to get RIPL value */
                             /* Get restart address */
   mfc0
          k0, C0 EPC
          k1, k1, S_CauseRIPL /* Right justify RIPL field */
   srl
          k0, EPCSave
                             /* Save in memory */
   s
   mfc0
          k0, C0 Status
                              /* Get Status value */
```

```
k0, StatusSave
                          /* Save in memory */
   SW
         k0, k1, S_StatusIPL, 6 /* Set IPL to RIPL in copy of Status */
   ins
   mfc0
         k1, C0 SRSCtl /* Save SRSCtl if changing shadow sets */
   SW
         k1, SRSCtlSave
   /* If switching shadow sets, write new value to SRSCtl<sub>PSS</sub> here */
   ins
         k0, zero, S_StatusEXL, (W_StatusKSU+W_StatusERL+W_StatusEXL)
                                 /* Clear KSU, ERL, EXL bits in k0 */
                                 /* Modify IPL, switch to kernel mode, */
   mtc0 k0, C0_Status
                                 /* re-enable interrupts */
   /*
    * If switching shadow sets, clear only KSU above, write target
    * address to EPC, and do execute an eret to clear EXL, switch
    * shadow sets, and jump to routine
    */
   /* Process interrupt here, including clearing device interrupt */
/*
* The interrupt completion code is identical to that shown for VI mode above.
```

## 5.3.2 Generation of Exception Vector Offsets for Vectored Interrupts

For vectored interrupts (in either VI or EIC interrupt mode), a vector number is produced by the interrupt control logic. This number is combined with *IntCtIVS* to create the interrupt offset, which is added to 16#200 to create the exception vector offset. For VI interrupt mode, the vector number is in the range 0..9, inclusive. For EIC interrupt mode, the vector number is in the range 0..63, inclusive. The *IntCtIVS* field specifies the spacing between vector locations. If this value is zero (the default reset state), the vector spacing is zero and the processor reverts to Interrupt Compatibility Mode. A non-zero value enables vectored interrupts, and Table 5.4 shows the exception vector offset for a representative subset of the vector numbers and values of the *IntCtIVS* field.

|               | Value of IntCtl $_{\rm VS}$ Field |         |         |         |         |  |
|---------------|-----------------------------------|---------|---------|---------|---------|--|
| Vector Number | 2#00001                           | 2#00010 | 2#00100 | 2#01000 | 2#10000 |  |
| 0             | 16#0200                           | 16#0200 | 16#0200 | 16#0200 | 16#0200 |  |
| 1             | 16#0220                           | 16#0240 | 16#0280 | 16#0300 | 16#0400 |  |
| 2             | 16#0240                           | 16#0280 | 16#0300 | 16#0400 | 16#0600 |  |
| 3             | 16#0260                           | 16#02C0 | 16#0380 | 16#0500 | 16#0800 |  |
| 4             | 16#0280                           | 16#0300 | 16#0400 | 16#0600 | 16#0A00 |  |
| 5             | 16#02A0                           | 16#0340 | 16#0480 | 16#0700 | 16#0C00 |  |
| 6             | 16#02C0                           | 16#0380 | 16#0500 | 16#0800 | 16#0E00 |  |
| 7             | 16#02E0                           | 16#03C0 | 16#0580 | 16#0900 | 16#1000 |  |
|               |                                   | •       |         |         |         |  |
|               |                                   | •       |         |         |         |  |
| 61            | 16#09A0                           | 16#1140 | 16#2080 | 16#3F00 | 16#7C00 |  |
| 62            | 16#09C0                           | 16#1180 | 16#2100 | 16#4000 | 16#7E00 |  |
| 63            | 16#09E0                           | 16#11C0 | 16#2180 | 16#4100 | 16#8000 |  |

Table 5.4 Exception Vector Offsets for Vectored Interrupts

The general equation for the exception vector offset for a vectored interrupt is:

```
vectorOffset \leftarrow 16#200 + (vectorNumber \times (IntCtl_{VS} \parallel 2#00000))
```

When using large vector spacing and EIC mode, the offset value can overlap with bits that are specified in the EBase register. Software must ensure that any overlapping bits are specified as 0 in EBase. This implementation ORs together the offset and base registers, but it is architecturally undefined and software should not rely on this behavior.

Although there are 255 EIC priority interrupts, only 64 vectors are provided. There is no one-to-one mapping for each EIC interrupt to its interrupt vector. The 255 priority interrupts will share the 64 interrupt vectors as specified by the  $SI\_EICVector[5:0]$  input pins. However, as mentioned in option 2 of Section 5.3.1.3 "External Interrupt Controller Mode", the  $SI\_Offset[17:1]$  input pins can be used to provide each EIC interrupt with a unique interrupt handler location.

### 5.3.3 MCU ASE Enhancement for Interrupt Handling

The MCU ASE extends the MIPS/microMIPS Architecture with a set of new features designed for the microcontroller market. The MCU ASE contains enhancements in two key areas: interrupt delivery and interrupt latency. For more details, refer to the *The MCU Privileged Resource Architecture* chapter of the *MIPS*® *Architecture for Programmers Volume IV-h: The MCU Application-Specific Extension to the MIPS32 Architecture* [10] or *MIPS*® *Architecture for Programmers Volume IV-h: The MCU Application-Specific Extension to the microMIPS32*<sup>TM</sup> *Architecture* [11].

### 5.3.3.1 Interrupt Delivery

The MCU ASE extends the number of hardware interrupt sources from 6 to 8. For legacy and vectored-interrupt mode, this represents 8 external interrupt sources. For EIC mode, the widened *IPL* and *RIPL* fields can now represent 256 external interrupt sources.

#### 5.3.3.2 Interrupt Latency Reduction

The MCU ASE includes a package of extensions to MIPS/microMIPS that decrease the latency of the processor's response to a signalled interrupt.

#### Interrupt Vector Prefetching

Normally on MIPS architecture processors, when an interrupt or exception is signalled, execution pipelines must be flushed before the interrupt/exception handler is fetched. This is necessary to avoid mixing the contexts of the interrupted/faulting program and the exception handler. The MCU ASE introduces a hardware mechanism in which the interrupt exception vector is prefetched whenever the interrupt signals change. The prefetch memory transaction occurs in parallel with the pipeline flush and exception prioritization. This decreases the overall latency of the execution of the interrupt handler's first instruction.

#### Automated Interrupt Prologue

The use of Shadow Register Sets avoids the software steps of having to save general-purpose registers before handling an interrupt.

The MCU ASE adds additional hardware logic that automatically saves some of the COP0 state in the stack and automatically updates some of the COP0 registers in preparation for interrupt handling.

#### Automated Interrupt Epilogue

A mirror to the Automated Prologue, this features automates the restoration of some of the COP0 registers from the stack and the preparation of some of the COP0 registers for returning to non-exception mode. This feature is implemented within the IRET instruction, which is introduced in this ASE.

#### Interrupt Chaining

An optional feature of the Automated Interrupt Epilogue, this feature allows handling a second interrupt after a primary interrupt is handled, without returning to non-exception mode (and the related pipeline flushes that would normally be necessary).

# 5.4 GPR Shadow Registers

Release 2 of the Architecture optionally removes the need to save and restore GPRs on entry to high priority interrupts or exceptions, and to provide specified processor modes with the same capability. This is done by introducing multiple copies of the GPRs, called *shadow sets*, and allowing privileged software to associate a shadow set with entry to kernel mode via an interrupt vector or exception. The normal GPRs are logically considered shadow set zero.

The number of GPR shadow sets is a build-time option on the microAptiv UP core. Although Release 2 of the Architecture defines a maximum of 16 shadow sets, the core allows one (the normal GPRs), two, four, eight or sixteen shadow sets. The highest number actually implemented is indicated by the *SRSCtl<sub>HSS</sub>* field. If this field is zero, only the normal GPRs are implemented.

Shadow sets are new copies of the GPRs that can be substituted for the normal GPRs on entry to kernel mode via an interrupt or exception. When a shadow set is bound to a kernel mode entry condition, reference to GPRs work exactly as one would expect, but they are redirected to registers that are dedicated to that condition. Privileged software may need to reference all GPRs in the register file, even specific shadow registers that are not visible in the current mode. The RDPGPR and WRPGPR instructions are used for this purpose. The *CSS* field of the *SRSCtl* register provides the number of the current shadow register set, and the *PSS* field of the *SRSCtl* register provides the number of the previous shadow register set (that which was current before the last exception or interrupt occurred).

If the processor is operating in VI interrupt mode, binding of a vectored interrupt to a shadow set is done by writing to the *SRSMap* register. If the processor is operating in EIC interrupt mode, the binding of the interrupt to a specific shadow set is provided by the external interrupt controller, and is configured in an implementation-dependent way. Binding of an exception or non-vectored interrupt to a shadow set is done by writing to the *ESS* field of the *SRSCtl* register. When an exception or interrupt occurs, the value of *SRSCtl<sub>CSS</sub>* is copied to *SRSCtl<sub>PSS</sub>*, and *SRSCtl<sub>CSS</sub>* is set to the value taken from the appropriate source. On an ERET, the value of *SRSCtl<sub>PSS</sub>* is copied back into *SRSCtl<sub>CSS</sub>* to restore the shadow set of the mode to which control returns. More precisely, the rules for updating the fields in the *SRSCtl* register on an interrupt or exception are as follows:

- 1. No field in the *SRSCtl* register is updated if any of the following conditions is true. In this case, steps 2 and 3 are skipped.
  - The exception is one that sets *Status<sub>ERL</sub>*: Reset, Soft Reset, or NMI.
  - The exception causes entry into EJTAG Debug Mode.
  - Status<sub>BEV</sub> = 1
  - Status<sub>EXL</sub> = 1

- 2. SRSCtl<sub>CSS</sub> is copied to SRSCtl<sub>PSS</sub>.
- 3. *SRSCtl<sub>CSS</sub>* is updated from one of the following sources:
  - The appropriate field of the *SRSMap* register, based on IPL, if the exception is an interrupt,  $Cause_{IV} = 1$ ,  $Config3_{VEIC} = 0$ , and  $Config3_{VInt} = 1$ . These are the conditions for a vectored interrupt.
  - The *EICSS* field of the *SRSCtl* register if the exception is an interrupt,  $Cause_{IV} = 1$ , and  $Config3_{VEIC} = 1$ . These are the conditions for a vectored EIC interrupt.
  - The *ESS* field of the *SRSCt*/register in any other case. This is the condition for a non-interrupt exception, or a non-vectored interrupt.

Similarly, the rules for updating the fields in the SRSCt/ register at the end of an exception or interrupt are as follows:

- 1. No field in the *SRSCtl* register is updated if any of the following conditions is true. In this case, step 2 is skipped.
  - A DERET is executed.
  - An ERET is executed with  $Status_{ERL} = 1$ .
- 2. SRSCtl<sub>PSS</sub> is copied to SRSCtl<sub>CSS</sub>.

These rules have the effect of preserving the *SRSCt*/register in any case of a nested exception or one which occurs before the processor has been fully initialize (*Status*<sub>BEV</sub> = 1).

Privileged software may switch the current shadow set by writing a new value into *SRSCtl<sub>PSS</sub>*, loading *EPC* with a target address, and doing an ERET.

# 5.5 Exception Vector Locations

The Reset, Soft Reset, and NMI exceptions are always vectored to location 16#BFC0.0000. EJTAG Debug exceptions are vectored to location 16#BFC0.0480, or to location 16#FF20.0200 if the ProbTrap bit is zero or one, respectively, in the *EJTAG\_Control\_register*. Addresses for all other exceptions are a combination of a vector offset and a vector base address. In Release 1 of the architecture, the vector base address was fixed. In Release 2 of the architecture, software is allowed to specify the vector base address via the *EBase* register for exceptions that occur when *Status*<sub>BEV</sub> equals 0. Table 5.5 gives the vector base address as a function of the exception. Note that the *IV* bit in the *Cause* register causes Interrupts to use a dedicated exception vector offset, rather than the general exception vector. For implementations of Release 2 of the Architecture,

Table 5.4 shows the offset from the base address in the case where  $Status_{BEV} = 0$  and  $Cause_{IV} = 1$ . For implementations of Release 1 of the architecture in which  $Cause_{IV} = 1$ , the vector offset is as if  $IntCt_{IVS}$  were 0. Table 5.7 combines these two tables into one that contains all possible vector addresses as a function of the state that can affect the

vector selection. To avoid complexity in the table, the vector address value assumes that the *EBase* register, as implemented in Release 2 devices, is not changed from its reset state and that  $IntCt_{IVS}$  is 0.

|                                                                            | Statu                                                                                                                                                                                            | JS <sub>BEV</sub> |
|----------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|
| Exception                                                                  | 0                                                                                                                                                                                                | 1                 |
| Reset, Soft Reset, NMI                                                     | 16#BFC                                                                                                                                                                                           | 20.0000           |
| EJTAG Debug (with <i>ProbEn</i> = 0 in the <i>EJTAG Control Register</i> ) | 16#BFC0.0480                                                                                                                                                                                     |                   |
| EJTAG Debug (with <i>ProbEn</i> = 1 in the <i>EJTAG Control Register</i> ) | 16#FF2                                                                                                                                                                                           | 20.0200           |
| Cache/SPRAM Parity Error                                                   | $\begin{array}{c} \text{EBase}_{3130} \parallel 1 \parallel \\ \text{EBase}_{2812} \parallel 16\#000 \\ \text{Note that EBase}_{3130} \text{ have the} \\ \text{fixed value } 2\#10 \end{array}$ | 16#BFC0.0300      |
| Other                                                                      | For Release 1 of the architecture:<br>16#8000.0000<br>For Release 2 of the architecture:<br>EBase <sub>31.12</sub>    16#000<br>Note that EBase <sub>31.30</sub> have the<br>fixed value 2#10    | 16#BFC0.0200      |

Table 5.5 Exception Vector Base Addresses

## **Table 5.6 Exception Vector Offsets**

| Exception                                | Vector Offset                                                                                                            |
|------------------------------------------|--------------------------------------------------------------------------------------------------------------------------|
| TLB Refill, EXL = 0                      | 16#000                                                                                                                   |
| General Exception                        | 16#180                                                                                                                   |
| Interrupt, <i>Cause</i> <sub>IV</sub> =1 | 16#200 (In Release 2 implementa-<br>tions, this is the base of the vectored<br>interrupt table when $Status_{BEV} = 0$ ) |
| Reset, Soft Reset, NMI                   | None (Uses Reset Base Address)                                                                                           |

## **Table 5.7 Exception Vectors**

|                        |                       |                       |                     |                 | Vector                                                                                               |
|------------------------|-----------------------|-----------------------|---------------------|-----------------|------------------------------------------------------------------------------------------------------|
| Exception              | Status <sub>BEV</sub> | Status <sub>EXL</sub> | Cause <sub>IV</sub> | EJTAG<br>ProbEn | For Release 2 Implementations, assumes that EBase retains its reset state and that $IntCtl_{VS} = 0$ |
| Reset, Soft Reset, NMI | х                     | Х                     | х                   | х               | 16#BFC0.0000                                                                                         |
| EJTAG Debug            | Х                     | Х                     | х                   | 0               | 16#BFC0.0480                                                                                         |
| EJTAG Debug            | Х                     | Х                     | х                   | 1               | 16#FF20.0200                                                                                         |
| TLB Refill             | 0                     | 0                     | х                   | х               | 16#8000.0000                                                                                         |
| TLB Refill             | 0                     | 1                     | х                   | Х               | 16#8000.0180                                                                                         |
| TLB Refill             | 1                     | 0                     | х                   | х               | 16#BFC0.0200                                                                                         |
| TLB Refill             | 1                     | 1                     | х                   | х               | 16#BFC0.0380                                                                                         |

|                             |                       |                       |                     |                 | Vector                                                                                                               |
|-----------------------------|-----------------------|-----------------------|---------------------|-----------------|----------------------------------------------------------------------------------------------------------------------|
| Exception                   | Status <sub>BEV</sub> | Status <sub>EXL</sub> | Cause <sub>IV</sub> | EJTAG<br>ProbEn | For Release 2<br>Implementations, assumes<br>that EBase retains its reset<br>state and that IntCtl <sub>VS</sub> = 0 |
| Cache/SPRAM Parity<br>Error | 0                     | X                     | X                   | X               | 16#EBase[31:30]    2#1<br>   EBase[28:12]   <br>16#100                                                               |
| Cache/SPRAM Parity<br>Error | 1                     | Х                     | Х                   | Х               | 16#BFC0.0300                                                                                                         |
| Interrupt                   | 0                     | 0                     | 0                   | Х               | 16#8000.0180                                                                                                         |
| Interrupt                   | 0                     | 0                     | 1                   | Х               | 16#8000.0200                                                                                                         |
| Interrupt                   | 1                     | 0                     | 0                   | Х               | 16#BFC0.0380                                                                                                         |
| Interrupt                   | 1                     | 0                     | 1                   | Х               | 16#BFC0.0400                                                                                                         |
| All others                  | 0                     | Х                     | х                   | Х               | 16#8000.0180                                                                                                         |
| All others                  | 1                     | Х                     | Х                   | Х               | 16#BFC0.0380                                                                                                         |
|                             |                       | 'x' deno              | tes don't care      | ;<br>;          |                                                                                                                      |

Table 5.7 Exception Vectors (Continued)

# 5.6 General Exception Processing

With the exception of Reset, Soft Reset, NMI, cache error, and EJTAG Debug exceptions, which have their own special processing as described below, exceptions have the same basic processing flow:

• If the *EXL* bit in the *Status* register is zero, the *EPC* register is loaded with the PC at which execution will be restarted and the *BD* bit is set appropriately in the *Cause* register (see Table 6.27). The value loaded into the *EPC* register is dependent on whether the processor implements microMIPS, and whether the instruction is in the delay slot of a branch or jump which has delay slots. Table 5.8 shows the value stored in each of the CP0 PC registers, including *EPC*. For implementations of Release 2 of the Architecture if *Status<sub>BEV</sub>* = 0, the *CSS* field in the *SRSCtl* register is copied to the *PSS* field, and the *CSS* value is loaded from the appropriate source.

If the *EXL* bit in the *Status* register is set, the *EPC* register is not loaded and the *BD* bit is not changed in the *Cause* register. For implementations of Release 2 of the Architecture, the *SRSCtl* register is not changed.

| microMIPS<br>Implemented? | In Branch/Jump<br>Delay Slot? | Value stored in EPC/ErrorEPC/DEPC                                                                                                                                                                  |
|---------------------------|-------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| No                        | No                            | Address of the instruction                                                                                                                                                                         |
| No                        | Yes                           | Address of the branch or jump instruction (PC-4)                                                                                                                                                   |
| Yes                       | No                            | Upper bits of the address of the instruction, combined with the <i>ISA Mode</i> bit                                                                                                                |
| Yes                       | Yes                           | Upper bits of the branch or jump instruction (PC-2 or PC-4 depending on size of the instruction in the micro-MIPS ISA Mode and PC-4 in the 32-bit ISA Mode), combined with the <i>ISA Mode</i> bit |

 Table 5.8 Value Stored in EPC, ErrorEPC, or DEPC on an Exception

- The *CE* and *ExcCode* fields of the *Cause* registers are loaded with the values appropriate to the exception. The *CE* field is loaded, but not defined, for any exception type other than a coprocessor unusable exception.
- The EXL bit is set in the Status register.
- The processor is started at the exception vector.

The value loaded into *EPC* represents the restart address for the exception and need not be modified by exception handler software in the normal case. Software need not look at the *BD* bit in the *Cause* register unless it wishes to identify the address of the instruction that actually caused the exception.

Note that individual exception types may load additional information into other registers. This is noted in the description of each exception type below.

#### **Operation:**

```
/* If Status<sub>EXL</sub> is 1, all exceptions go through the general exception vector */
/* and neither EPC nor \texttt{Cause}_{BD} nor <code>SRSCtl</code> are modified */
if Status_{EXL} = 1 then
    vectorOffset \leftarrow 16#180
else
    if InstructionInBranchDelaySlot then
        EPC ← restartPC/* PC of branch/jump */
        \texttt{Cause}_{\texttt{BD}} \ \leftarrow \ \texttt{1}
    else
        EPC \leftarrow restartPC
                                            /* PC of instruction */
        Cause_{BD} \leftarrow 0
    endif
    /* Compute vector offsets as a function of the type of exception */
    NewShadowSet \leftarrow SRSCtl_{ESS} /* Assume exception, Release 2 only */
    if ExceptionType = TLBRefill then
        vectorOffset \leftarrow 16#000
    elseif (ExceptionType = Interrupt) then
        if (Cause_{IV} = 0) then
            vectorOffset ← 16#180
        else
            if (\text{Status}_{\text{BEV}} = 1) or (\text{IntCtl}_{\text{VS}} = 0) then
                vectorOffset \leftarrow 16#200
            else
                if Config3_{VEIC} = 1 then
                    VecNum \leftarrow Cause_{RIPL}
                    NewShadowSet \leftarrow SRSCtl<sub>EICSS</sub>
                else
                    NewShadowSet \leftarrow SRSMap<sub>IPL</sub>×<sub>4+3.IPL</sub>×<sub>4</sub>
                endif
                vectorOffset \leftarrow 16#200 + (VecNum × (IntCtl<sub>VS</sub> || 2#00000))
            endif /* if (Status_{\rm BEV} = 1) or (IntCtl_{\rm VS} = 0) then */
        endif /* if (Cause_{IV} = 0) then */
    endif /* elseif (ExceptionType = Interrupt) then */
    /* Update the shadow set information for an implementation of */
    /* Release 2 of the architecture */
    if ((ArchitectureRevision \geq 2) and (SRSCtl_{HSS} > 0) and (Status_{BEV} = 0) and
        (Status_{ERL} = 0)) then
        SRSCtl_{PSS} \leftarrow SRSCtl_{CSS}
```

```
SRSCtl_{CSS} \leftarrow NewShadowSet
    endif
endif /* if Status<sub>EXL</sub> = 1 then */
Cause_{CE} \leftarrow FaultingCoprocessorNumber
Cause_{ExcCode} \leftarrow ExceptionType
Status_{EXL} \leftarrow 1
/* Calculate the vector base address */
if Status_{BEV} = 1 then
    vectorBase \leftarrow 16#BFC0.0200
else
    if ArchitectureRevision \geq 2 then
        /* The fixed value of EBase_{1,30} forces the base to be in kseg0 or kseg1 */
        vectorBase \leftarrow EBase<sub>31.12</sub> || 16#000
    else
        vectorBase ← 16#8000.0000
    endif
endif
/* Exception PC is the sum of vectorBase and vectorOffset */
PC \leftarrow vectorBase_{.30} \parallel (vectorBase_{29.0} + vectorOffset_{29.0})
                                 /* No carry between bits 29 and 30 */
```

# 5.7 Debug Exception Processing

All debug exceptions have the same basic processing flow:

- The *DEPC* register is loaded with the program counter (PC) value at which execution will be restarted and the *DBD* bit is set appropriately in the *Debug* register. The value loaded into the *DEPC* register is the current PC if the instruction is not in the delay slot of a branch, or the PC-4 of the branch if the instruction is in the delay slot of a branch.
- The DSS, DBp, DDBL, DDBS, DIB, DINT, DIBImpr, DDBLImpr, and DDBSImpr bits in the Debug register are updated appropriately depending on the debug exception type.
- The Debug2 register is updated with additional information for complex breakpoints.
- Halt and Doze bits in the Debug register are updated appropriately.
- *DM* bit in the *Debug* register is set to 1.
- The processor is started at the debug exception vector.

The value loaded into *DEPC* represents the restart address for the debug exception and need not be modified by the debug exception handler software in the usual case. Debug software need not look at the *DBD* bit in the *Debug* register unless it wishes to identify the address of the instruction that actually caused the debug exception.

A unique debug exception is indicated through the DSS, DBp, DDBL, DDBS, DIB, DINT, DIBImpr, DDBLImpr, and DDBSImpr bits in the Debug register.

No other CP0 registers or fields are changed due to the debug exception, thus no additional state is saved.

#### **Operation:**

```
if InstructionInBranchDelaySlot then
     DEPC \leftarrow PC-4
     \text{Debug}_{\text{DBD}} \leftarrow 1
else
     DEPC \leftarrow PC
     Debug_{DBD} \leftarrow 0
endif
\text{Debug}_{\text{D* bits}} \leftarrow \text{DebugExceptionType}
Debug_{Halt} \leftarrow HaltStatusAtDebugException
Debug_{Doze} \leftarrow DozeStatusAtDebugException
\text{Debug}_{\text{DM}} \leftarrow 1
if EJTAGControlRegister_ProbTrap = 1 then
     PC \leftarrow 0xFF20_0200
else
     PC \leftarrow 0xBFC0 0480
endif
```

The same debug exception vector location is used for all debug exceptions. The location is determined by the Prob-Trap bit in the EJTAG Control register (ECR), as shown in Table 5.9.

#### Table 5.9 Debug Exception Vector Addresses

| ProbTrap bit in ECR<br>Register | Debug Exception Vector Address |
|---------------------------------|--------------------------------|
| 0                               | 0xBFC0_0480                    |
| 1                               | 0xFF20_0200 in dmseg           |

# 5.8 Exception Descriptions

The following subsections describe each of the exceptions listed in the same sequence as shown in Table 5.1.

# 5.8.1 Reset/SoftReset Exception

A reset exception occurs when the *SI\_ColdReset* signal is asserted to the processor; a soft reset occurs when the *SI\_Reset* signal is asserted. These exceptions are not maskable. When one of these exceptions occurs, the processor performs a full reset initialization, including aborting state machines, establishing critical state, and generally placing the processor in a state in which it can execute instructions from uncached, unmapped address space. On a Reset/Soft-Reset exception, the state of the processor is not defined, with the following exceptions:

- The Random register is initialized to the number of TLB entries 1.
- The Wired register is initialized to zero.
- The Config register is initialized with its boot state.
- The RP, BEV, TS, SR, NMI, and ERL fields of the Status register are initialized to a specified state.
- The I, R, and W fields of the WatchLo register are initialized to 0.

- The *ErrorEPC* register is loaded with PC-4 if the state of the processor indicates that it was executing an instruction in the delay slot of a branch. Otherwise, the *ErrorEPC* register is loaded with PC. Note that this value may or may not be predictable.
- PC is loaded with 0xBFC0\_0000.

#### **Cause Register ExcCode Value:**

None

#### **Additional State Saved:**

None

#### **Entry Vector Used:**

Reset (0xBFC0\_0000)

#### **Operation:**

```
Wired \leftarrow 0
Config ← ConfigurationState
Status_{RP} \leftarrow 0
\text{Status}_{\text{BEV}} \leftarrow 1
Status_{TS} \leftarrow 0
Status_{SR} \leftarrow 0/1 (depending on Reset or SoftReset)
Status_{NMT} \leftarrow 0
Status_{ERL} \leftarrow 1
WatchLo<sub>I</sub> \leftarrow 0
WatchLo<sub>R</sub> \leftarrow 0
WatchLo<sub>w</sub> \leftarrow 0
if InstructionInBranchDelaySlot then
    ErrorEPC \leftarrow PC - 4
else
    ErrorEPC \leftarrow PC
endif
PC \leftarrow 0xBFC0 0000
```

# 5.8.2 Debug Single Step Exception

A debug single step exception occurs after the CPU has executed one/two instructions in non-debug mode, when returning to non-debug mode after debug mode. One instruction is allowed to execute when returning to a non jump/branch instruction, otherwise two instructions are allowed to execute since the jump/branch and the instruction in the delay slot are executed as one step. Debug single step exceptions are enabled by the SSt bit in the Debug register, and are always disabled for the first one/two instructions after a DERET.

The DEPC register points to the instruction on which the debug single step exception occurred, which is also the next instruction to single step or execute when returning from debug mode. So the DEPC will not point to the instruction which has just been single stepped, but rather the following instruction. The DBD bit in the Debug register is never set for a debug single step exception, since the jump/branch and the instruction in the delay slot is executed in one step.

Exceptions occurring on the instruction(s) executed with debug single step exception enabled are taken even though debug single step was enabled. For a normal exception (other than reset), a debug single step exception is then taken on the first instruction in the normal exception handler. Debug exceptions are unaffected by single step mode, e.g. returning to a SDBBP instruction with debug single step exceptions enabled causes a debug software breakpoint

exception, and *DEPC* points to the SDBBP instruction. However, returning to an instruction (not jump/branch) just before the SDBBP instruction, causes a debug single step exception with the *DEPC* pointing to the SDBBP instruction.

To ensure proper functionality of single step, the debug single step exception has priority over all other exceptions, except reset and soft reset.

#### Debug Register Debug Status Bit Set

DSS

#### **Additional State Saved**

None

#### **Entry Vector Used**

Debug exception vector

# 5.8.3 Debug Interrupt Exception

A debug interrupt exception is either caused by the *EjtagBrk* bit in the *EJTAG Control* register (controlled through the TAP), or caused by the debug interrupt request signal to the CPU.

The debug interrupt exception is an asynchronous debug exception which is taken as soon as possible, but with no specific relation to the executed instructions. The *DEPC* register is set to the instruction where execution should continue after the debug handler is through. The *DBD* bit is set based on whether the interrupted instruction was executing in the delay slot of a branch.

#### Debug Register Debug Status Bit Set

DINT

**Additional State Saved** 

None

#### **Entry Vector Used**

Debug exception vector

# 5.8.4 Non-Maskable Interrupt (NMI) Exception

A non maskable interrupt exception occurs when the *SI\_NMI* signal is asserted to the processor. *SI\_NMI* is an edge sensitive signal - only one NMI exception will be taken each time it is asserted. An NMI exception occurs only at instruction boundaries, so it does not cause any reset or other hardware initialization. The state of the cache, memory, and other processor states are consistent and all registers are preserved, with the following exceptions:

- The BEV, TS, SR, NMI, and ERL fields of the Status register are initialized to a specified state.
- The *ErrorEPC* register is loaded with PC-4 if the state of the processor indicates that it was executing an instruction in the delay slot of a branch. Otherwise, the *ErrorEPC* register is loaded with PC.
- PC is loaded with 0xBFC0\_0000.

#### **Cause Register ExcCode Value:**

None

None

**Entry Vector Used:** 

Reset (0xBFC0 0000)

#### **Operation:**

# 5.8.5 Machine Check Exception

A machine check exception occurs when the processor detects an internal inconsistency. The following condition causes a machine check exception:

• The detection of multiple matching entries in the TLB (TLB-based MMU only). The core detects this condition on a TLB write and prevents the write from being completed. The TS bit in the *Status* register is set to indicate this condition. This bit is only a status flag and does not affect the operation of the device. Software clears this bit at the appropriate time. This condition is resolved by flushing the conflicting TLB entries. The TLB write can then be completed.

#### **Cause Register ExcCode Value:**

MCheck

#### **Additional State Saved:**

None

#### **Entry Vector Used:**

General exception vector (offset 0x180)

### 5.8.6 Interrupt Exception

The interrupt exception occurs when one or more of the eight hardware, two software, or timer interrupt requests is enabled by the *Status* register, and the interrupt input is asserted. See 5.3 "Interrupts" on page 98 for more details about the processing of interrupts.

#### **Register ExcCode Value:**

Int

| Register State | Value                                      |
|----------------|--------------------------------------------|
| CauseIP        | indicates the interrupts that are pending. |

#### Table 5.10 Register States an Interrupt Exception

#### **Entry Vector Used:**

See 5.3.2 "Generation of Exception Vector Offsets for Vectored Interrupts" on page 107 for the entry vector used, depending on the interrupt mode the processor is operating in.

## 5.8.7 Debug Instruction Break Exception

A debug instruction break exception occurs when an instruction hardware breakpoint matches an executed instruction. The *DEPC* register and *DBD* bit in the *Debug* register indicate the instruction that caused the instruction hardware breakpoint to match. This exception can only occur if instruction hardware breakpoints are implemented.

#### Debug Register Debug Status Bit Set:

DIB

#### **Additional State Saved:**

None

#### **Entry Vector Used:**

Debug exception vector

# 5.8.8 Watch Exception — Instruction Fetch or Data Access

The Watch facility provides a software debugging vehicle by initiating a watch exception when an instruction or data reference matches the address information stored in the *WatchHi* and *WatchLo* registers. A Watch exception is taken immediately if the *EXL* and *ERL* bits of the *Status* register are both zero and the *DM* bit of the *Debug* is also zero. If any of those bits is a one at the time that a watch exception would normally be taken, then the WP bit in the *Cause* register is set, and the exception is deferred until all three bits are zero. Software may use the WP bit in the *Cause* register to determine if the *EPC* register points at the instruction that caused the watch exception, or if the exception actually occurred while in kernel mode.

The Watch exception can occur on either an instruction fetch or a data access. Watch exceptions that occur on an instruction fetch have a higher priority than watch exceptions that occur on a data access.

#### **Register ExcCode Value:**

WATCH

| Register State      | Value                                                                                                                                                                                                                                                                                                                 |
|---------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Cause <sub>WP</sub> | Indicates that the watch exception was deferred until after $Status_{EXL}$ , $Status_{ERL}$ , and $Debug_{DM}$ were zero. This bit directly causes a watch exception, so software must clear this bit as part of the exception handler to prevent a watch exception loop at the end of the current handler execution. |

Table 5.11 Register States on a Watch Exception

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.9 Address Error Exception — Instruction Fetch/Data Access

An address error exception occurs on an instruction or data access when an attempt is made to execute one of the following:

- Fetch an instruction, load a word, or store a word that is not aligned on a word boundary
- Load or store a halfword that is not aligned on a halfword boundary
- Reference the kernel address space from user mode

Note that in the case of an instruction fetch that is not aligned on a word boundary, PC is updated before the condition is detected. Therefore, both *EPC* and *BadVAddr* point to the unaligned instruction address. In the case of a data access the exception is taken if either an unaligned address or an address that was inaccessible in the current processor mode was referenced by a load or store instruction.

#### **Cause Register ExcCode Value:**

AdEL: Reference was a load or an instruction fetch

AdES: Reference was a store

Additional State Saved:

#### Table 5.12 CP0 Register States on an Address Exception Error

| Register State          | Value           |
|-------------------------|-----------------|
| BadVAddr                | Failing address |
| Context <sub>VPN2</sub> | UNPREDICTABLE   |
| EntryHi <sub>VPN2</sub> | UNPREDICTABLE   |
| EntryLo0                | UNPREDICTABLE   |
| EntryLo1                | UNPREDICTABLE   |

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.10 TLB Refill Exception — Instruction Fetch or Data Access

During an instruction fetch or data access, a TLB refill exception occurs when no TLB entry matches a reference to a mapped address space and the *EXL* bit is 0 in the *Status* register. Note that this is distinct from the case in which an entry matches but has the valid bit off. In that case, a TLB Invalid exception occurs.

#### **Cause Register ExcCode Value:**

TLBL: Reference was a load or an instruction fetch

TLBS: Reference was a store

#### Additional State Saved:

| Register State | Value                                                                                                                             |
|----------------|-----------------------------------------------------------------------------------------------------------------------------------|
| BadVAddr       | failing address.                                                                                                                  |
| Context        | The BadVPN2 field contains $VA_{31:13}$ of the failing address.                                                                   |
| EntryHi        | The VPN2 field contains $VA_{31:13}$ of the failing address;<br>the ASID field contains the ASID of the reference that<br>missed. |
| EntryLo0       | UNPREDICTABLE                                                                                                                     |
| EntryLo1       | UNPREDICTABLE                                                                                                                     |

### Table 5.13 CP0 Register States on a TLB Refill Exception

#### Entry Vector Used:

TLB refill vector (offset 0x000) if  $Status_{EXL} = 0$  at the time of exception;

general exception vector (offset 0x180) if *Status<sub>EXL</sub>* = 1 at the time of exception

# 5.8.11 TLB Invalid Exception — Instruction Fetch or Data Access

During an instruction fetch or data access, a TLB invalid exception occurs in one of the following cases:

- No TLB entry matches a reference to a mapped address space; and the EXL bit is 1 in the Status register.
- A TLB entry matches a reference to a mapped address space, but the matched entry has the valid bit off.
- A TLB entry matches a reference to a mapped address space, but the reference is an instruction fetch and the matched entry has the execute inhibit (*XI*) bit on.
- A TLB entry matches a reference to a mapped address space, but the reference is a data load and the matched entry has the read inhibit (*RI*) bit on.
- The virtual address is greater than or equal to the bounds address in a FM-based MMU.

#### **Cause Register ExcCode Value:**

TLBL: Reference was a load or an instruction fetch

TLBS: Reference was a store

| Register State | Value                                                                                                                                     |
|----------------|-------------------------------------------------------------------------------------------------------------------------------------------|
| BadVAddr       | Failing address                                                                                                                           |
| Context        | The <i>BadVPN2</i> field contains $VA_{31:13}$ of the failing address.                                                                    |
| EntryHi        | The <i>VPN2</i> field contains $VA_{31:13}$ of the failing address; the <i>ASID</i> field contains the ASID of the reference that missed. |
| EntryLo0       | UNPREDICTABLE                                                                                                                             |
| EntryLo1       | UNPREDICTABLE                                                                                                                             |

 Table 5.14 CP0 Register States on a TLB Invalid Exception

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.12 Execute-Inhibit Exception

An Execute-Inhibit exception occurs when the virtual address of an instruction fetch matches a TLB entry whose XI bit is set. This exception type can only occur if the XI bit is implemented within the TLB and is enabled, which is denoted by the *PageGrain*<sub>XIE</sub> bit.

#### **Cause Register ExcCode Value**

if *PageGrain<sub>IEC</sub>* == 0 TLBL

if *PageGrain<sub>IEC</sub>* == 1TLBXI

#### Additional State Saved

| Table 5.15 CP0 Register States on a Execute-Inhibit Exception |
|---------------------------------------------------------------|
|---------------------------------------------------------------|

| Register State | Value                                                                                                                                                                                                                                                                                                                                                                              |
|----------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| BadVAddr       | Failing address                                                                                                                                                                                                                                                                                                                                                                    |
| Context        | If $Config3_{CTXTC}$ bit is set, then the bits of the <i>Context</i> register corresponding to the set bits of the <i>Virtual Index</i> field of the <i>ContextConfig</i> register are loaded with the high-order bits of the virtual address that missed.<br>If $Config3_{CTXTC}$ bit is clear, then the <i>BadVPN2</i> field contains VA <sub>31.13</sub> of the failing address |
| Entry Hi       | The <i>VPN2</i> field contains $VA_{3113}$ of the failing address; the <i>ASID</i> field contains the ASID of the reference that missed.                                                                                                                                                                                                                                           |
| Entry Lo0      | UNPREDICTABLE                                                                                                                                                                                                                                                                                                                                                                      |
| Entry Lo1      | UNPREDICTABLE                                                                                                                                                                                                                                                                                                                                                                      |

#### **Entry Vector Used**

General exception vector (offset 0#180)

# 5.8.13 Read-Inhibit Exception

A Read-Inhibit exception occurs when the virtual address of a memory load reference matches a TLB entry whose RI bit is set. This exception type can only occur if the RI bit is implemented within the TLB and is enabled, which is denoted by the *PageGrain<sub>XIE</sub>* bit. MIPS16 PC-relative loads are a special case and are not affected by the RI bit.

#### Cause Register ExcCode Value

if  $PageGrain_{IEC} == 0$  TLBL

if PageGrain<sub>IEC</sub> == 1TLBXI

#### **Additional State Saved**

| Register State | Value                                                                                                                                                                                                                                                                                                                                                                              |
|----------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| BadVAddr       | Failing address                                                                                                                                                                                                                                                                                                                                                                    |
| Context        | If $Config3_{CTXTC}$ bit is set, then the bits of the <i>Context</i> register corresponding to the set bits of the <i>Virtual Index</i> field of the <i>ContextConfig</i> register are loaded with the high-order bits of the virtual address that missed.<br>If $Config3_{CTXTC}$ bit is clear, then the <i>BadVPN2</i> field contains VA <sub>31.13</sub> of the failing address |
| Entry Hi       | The <i>VPN2</i> field contains $VA_{3113}$ of the failing address; the <i>ASID</i> field contains the ASID of the reference that missed.                                                                                                                                                                                                                                           |
| Entry Lo0      | UNPREDICTABLE                                                                                                                                                                                                                                                                                                                                                                      |
| Entry Lo1      | UNPREDICTABLE                                                                                                                                                                                                                                                                                                                                                                      |

#### Table 5.16 CP0 Register States on a Read-Inhibit Exception

#### **Entry Vector Used**

General exception vector (offset 0#180)

# 5.8.14 Cache/SPRAM Parity Error Exception

A cache/SPRAM error exception occurs when an instruction or data reference detects a cache tag or data error. This exception is not maskable. To avoid disturbing the error in the cache array the exception vector is to an unmapped, uncached address. This exception is precise.

#### Cause Register ExcCode Value

N/A

| Register State | Value       |
|----------------|-------------|
| CacheErr       | Error state |
| ErrorEPC       | Restart PC  |

#### Table 5.17 CP0 Register States on a Cache/SPRAM Parity Error Exception

#### **Entry Vector Used**

Cache error vector (offset 16#100)

# 5.8.15 Bus Error Exception — Instruction Fetch or Data Access

A bus error exception occurs when an instruction or data access makes a bus request (due to a cache miss or an uncacheable reference) and that request terminates in an error. The bus error exception can occur on either an instruction fetch or a data access. Bus error exceptions that occur on an instruction fetchhave a higher priority than bus error exceptions that occur on a data access.

Bus errors taken on the requested (critical) word of an instruction fetch or data load are precise. Other bus errors, such as stores or non-critical words of a burst read, can be imprecise. These errors are taken when an AHB slave responds with an error code. The microAptiv UP family core does not support imprecise instruction fetch bus error exceptions. Thus, when a slave signals a bus error, this notification must be repeated for all similar instruction fetches until the core has taken the exception.

#### **Cause Register ExcCode Value:**

IBE: Error on an instruction reference

DBE: Error on a data reference

#### **Additional State Saved:**

None

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.16 Debug Software Breakpoint Exception

A debug software breakpoint exception occurs when an SDBBP instruction is executed. The *DEPC* register and *DBD* bit in the *Debug* register will indicate the SDBBP instruction that caused the debug exception.

#### **Debug Register Debug Status Bit Set:**

DBp

**Additional State Saved:** 

None

Entry Vector Used:

Debug exception vector

# 5.8.17 Execution Exception — System Call

The system call exception is one of the execution exceptions. All of these exceptions have the same priority. A system call exception occurs when a SYSCALL instruction is executed.

**Cause Register ExcCode Value:** 

Sys

**Additional State Saved:** 

None

**Entry Vector Used:** 

General exception vector (offset 0x180)

### 5.8.18 Execution Exception — Breakpoint

The breakpoint exception is one of the execution exceptions. All of these exceptions have the same priority. A breakpoint exception occurs when a BREAK instruction is executed.

**Cause Register ExcCode Value:** 

Bp

Additional State Saved:

None

#### **Entry Vector Used:**

General exception vector (offset 0x180)

### 5.8.19 Execution Exception — Reserved Instruction

The reserved instruction exception is one of the execution exceptions. All of these exceptions have the same priority. A reserved instruction exception occurs when a reserved or undefined major opcode or function field is executed. This includes Coprocessor 2 instructions which are decoded reserved in the Coprocessor 2.

#### **Cause Register ExcCode Value:**

RI

**Additional State Saved:** 

None

#### **Entry Vector Used:**

General exception vector (offset 0x180)

### 5.8.20 Execution Exception — Coprocessor Unusable

The coprocessor unusable exception is one of the execution exceptions. All of these exceptions have the same priority. A coprocessor unusable exception occurs when an attempt is made to execute a coprocessor instruction for one of the following:

- a corresponding coprocessor unit that has not been marked usable by setting its CU bit in the Status register
- CP0 instructions, when the unit has not been marked usable, and the processor is executing in user mode

#### **Cause Register ExcCode Value:**

CpU

Additional State Saved:

#### Table 5.18 Register States on a Coprocessor Unusable Exception

| Register State      | Value                                           |
|---------------------|-------------------------------------------------|
| Cause <sub>CE</sub> | Unit number of the coprocessor being referenced |

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.21 Execution Exception — CorExtend Unusable

The CorExtend unusable exception is one of the execution exceptions. All of these exceptions have the same priority. A CorExtend Unusable exception occurs when an attempt is made to execute a CorExtend instruction when  $Status_{CEE}$  is cleared. It is implementation-dependent whether this functionality is supported. Generally, the functionality will only be supported if a CorExtend block contains local destination registers

#### **Cause Register ExcCode Value:**

CEU

Additional State Saved:

None

**Entry Vector Used:** 

General exception vector (offset 0x180)

# 5.8.22 Execution Exception — DSP Module State Disabled

The DSP Module State Disabled exception is an execution exception. It occurs when an attempt is made to execute a DSP Module instruction when the MX bit in the *Status* register is not set. This allows an OS to do "lazy" context switching.

#### **Cause Register ExcCode Value:**

DSPDis

#### Additional State Saved:

None

#### Entry Vector Used:

General exception vector (offset 0x180)

# 5.8.23 Execution Exception — Coprocessor 2 Exception

The Coprocessor 2 exception is one of the execution exceptions. All of these exceptions have the same priority. A Coprocessor 2 exception occurs when a valid Coprocessor 2 instruction cause a general exception in the Coprocessor 2.

#### **Cause Register ExcCode Value:**

C2E

#### Additional State Saved:

Depending on the Coprocessor 2 implementation, additional state information of the exception can be saved in a Coprocessor 2 control register.

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.24 Execution Exception — Implementation-Specific 1 Exception

The Implementation-Specific 1 exception is one of the execution exceptions. All of these exceptions have the same priority. An implementation-specific 1 exception occurs when a valid coprocessor 2 instruction cause an implementation-specific 1 exception in the Coprocessor 2.

#### **Cause Register ExcCode Value:**

IS1

#### **Additional State Saved:**

Depending on the coprocessor 2 implementation, additional state information of the exception can be saved in a coprocessor 2 control register.

#### **Entry Vector Used:**

General exception vector (offset 0x180)

## 5.8.25 Execution Exception — Integer Overflow

The integer overflow exception is one of the execution exceptions. All of these exceptions have the same priority. An integer overflow exception occurs when selected integer instructions result in a 2's complement overflow.

#### **Cause Register ExcCode Value:**

Ov

#### **Additional State Saved:**

None

**Entry Vector Used:** General exception vector (offset 0x180)

# 5.8.26 Execution Exception — Trap

The trap exception is one of the execution exceptions. All of these exceptions have the same priority. A trap exception occurs when a trap instruction results in a TRUE value.

#### **Cause Register ExcCode Value:**

Tr

#### Additional State Saved:

None

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.8.27 Debug Data Break Exception

A debug data break exception occurs when a data hardware breakpoint matches the load/store transaction of an executed load/store instruction. The *DEPC* register and DBD bit in the *Debug* register will indicate the load/store instruction that caused the data hardware breakpoint to match. The load/store instruction that caused the debug exception has not completed e.g. not updated the register file, and the instruction can be re-executed after returning from the debug handler.

#### Debug Register Debug Status Bit Set:

DDBL for a load instruction or DDBS for a store instruction

#### Additional State Saved:

None

#### **Entry Vector Used:**

Debug exception vector

# 5.8.28 Complex Break Exception

A complex data break exception occurs when the complex hardware breakpoint detects an enabled breakpoint. Complex breaks are taken imprecisely—the instruction that actually caused the exception is allowed to complete and the *DEPC* register and DBD bit in the *Debug* register point to a following instruction.

#### Debug Register Debug Status Bit Set:

DIBImpr, DDBLImpr, and/or DDBSImpr

#### Additional State Saved:

Debug2 fields indicate which type(s) of complex breakpoints were detected.

#### Entry Vector Used:

Debug exception vector

# 5.8.29 TLB Modified Exception — Data Access

During a data access, a TLB modified exception occurs on a store reference to a mapped address if the following condition is true:

• The matching TLB entry is valid, but not dirty.

#### **Cause Register ExcCode Value:**

Mod

#### Additional State Saved:

| Register State | Value                                                                                                                             |
|----------------|-----------------------------------------------------------------------------------------------------------------------------------|
| BadVAddr       | failing address                                                                                                                   |
| Context        | The BadVPN2 field contains VA <sub>31:13</sub> of the failing address.                                                            |
| EntryHi        | The VPN2 field contains $VA_{31:13}$ of the failing address;<br>the ASID field contains the ASID of the reference that<br>missed. |
| EntryLo0       | UNPREDICTABLE                                                                                                                     |
| EntryLo1       | UNPREDICTABLE                                                                                                                     |

Table 5.19 Register States on a TLB Modified Exception

#### **Entry Vector Used:**

General exception vector (offset 0x180)

# 5.9 Exception Handling and Servicing Flowcharts

The remainder of this chapter contains flowcharts for the following exceptions and guidelines for their handlers:

- General exceptions and their exception handler
- TLB miss exception and their exception handler
- Reset, soft reset and NMI exceptions, and a guideline to their handler
- Debug exceptions

#### Figure 5.3 General Exception Handler (HW)

Exceptions other than Reset, Soft Reset, NMI, EJTag Debug and cache error, or first-level TLB miss. Note: Interrupts can be masked by IE or IMs and Watch is masked if EXL = 1





#### Figure 5.4 General Exception Servicing Guidelines (SW)

#### Comments



Figure 5.5 TLB Miss Exception Handler (HW)



#### Figure 5.6 TLB Exception Servicing Guidelines (SW)

#### Comments



Figure 5.7 Reset, Soft Reset and NMI Exception Handling and Servicing Guidelines

Chapter 6

# CP0 Registers of the microAptiv<sup>™</sup> UP Core

The System Control Coprocessor (CP0) provides the register interface to the microAptiv UP processor core for the support of memory management, address translation, exception handling, and other privileged operations. Each CP0 register is identified by a *Register Number*, from 0 through 31, and a *Select Number* that is used as the value in the *sel* field of the MFC0 and MTC0 instructions. For instance, the *EBase* register is Register Number 15, Select 1.

After updating a CP0 register, there is a hazard period of zero or more instructions from the update by the MTC0 instruction until the update has taken effect in the core. For a detailed description of CP0 hazards, refer to Section 2.14 "Hazards".

This chapter contains the following sections:

- Section 6.1 "CP0 Register Summary"
- Section 6.2 "CP0 Register Descriptions"

The EJTAG registers are described in Chapter 10, "EJTAG Debug Support in the microAptiv<sup>™</sup> UP Core" on page 230.

# 6.1 CP0 Register Summary

Table 6.1 lists the CP0 registers in numerical order. Individual registers are described in Section 6.2 "CP0 Register Descriptions".

| Register<br>Number | Select<br>Number | Register Name                               | Function                                                                                                                                      |  |  |  |  |
|--------------------|------------------|---------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| 0                  | 0                | Index <sup>3</sup> Index into the TLB array |                                                                                                                                               |  |  |  |  |
| 1                  | 0                | Random <sup>3</sup>                         | Randomly generated index into the TLB array                                                                                                   |  |  |  |  |
| 2                  | 0                | EntryLo0 <sup>3</sup>                       | Low-order portion of the TLB entry for even-numbered virtual pages                                                                            |  |  |  |  |
| 3                  | 0                | EntryLo1 <sup>3</sup>                       | Low-order portion of the TLB entry for odd-numbered virtual pages                                                                             |  |  |  |  |
| 4                  | 0<br>2           | Context/<br>UserLocal                       | Pointer to page table entry in memory<br>User information that can be written by privileged software and<br>read via <i>RDHWR</i> register 29 |  |  |  |  |
| 5                  | 0<br>1           | PageMask<br>PageGrain <sup>3</sup>          | PageMask controls the variable page sizes in TLB entries.<br>PageGrain enables support of 1KB pages in the TLB.                               |  |  |  |  |
| 6                  | 0                | Wired3                                      | Controls the number of fixed ("wired") TLB entries                                                                                            |  |  |  |  |

### Table 6.1 CP0 Registers

| Register<br>Number | Select<br>Number                | Register Name                                                                               | Function                                                                                                                                                                                                                 |
|--------------------|---------------------------------|---------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 7                  | 0                               | HWREna                                                                                      | Enables access via the RDHWR instruction to selected hardware registers in non-privileged mode                                                                                                                           |
| 8                  | 0<br>1<br>2                     | BadVAddr <sup>1</sup><br>BadInstr<br>BadInstrP                                              | Reports the address for the most recent address-related exception<br>Reports the instruction that caused the most recent exception<br>Reports the branch instruction if a delay slot caused the most<br>recent exception |
| 9                  | 0                               | Count <sup>1</sup>                                                                          | Processor cycle count                                                                                                                                                                                                    |
| 10                 | 0                               | EntryHi <sup>3</sup>                                                                        | High-order portion of the TLB entry                                                                                                                                                                                      |
| 11                 | 0                               | Compare <sup>1</sup>                                                                        | Timer interrupt control                                                                                                                                                                                                  |
| 12                 | 0<br>1<br>2<br>3<br>4<br>5      | Status<br>IntCtl<br>SRSCtl<br>SRSMap1<br>View_IPL<br>SRSMAP2                                | Processor status and control<br>Interrupt system status and control<br>Shadow Register Sets status and control<br>Shadow set IPL mapping<br>Contiguous view of IM and IPL fields<br>Shadow set IPL mapping               |
| 13                 | 0<br>4<br>5                     | Cause <sup>1</sup><br>View_RIPL<br>NestedExc                                                | Cause of last exception                                                                                                                                                                                                  |
| 14                 | 0<br>2                          | EPC <sup>1</sup><br>NestedEPc                                                               | Program counter at last exception                                                                                                                                                                                        |
| 15                 | 0<br>1<br>2                     | PRId/<br>EBase<br>CDMMBase                                                                  | Processor identification and revision; exception base address;<br>Common Device Memory Map Base register                                                                                                                 |
| 16                 | 0<br>1<br>2<br>3<br>4<br>5<br>7 | Config<br>Config1<br>Config2<br>Config3<br>Config4<br>Config5<br>Config7                    | Configuration registers                                                                                                                                                                                                  |
| 17                 | 0                               | LLAddr                                                                                      | Load linked address                                                                                                                                                                                                      |
| 18                 | 0-7                             | WatchLo <sup>1</sup>                                                                        | Low-order watchpoint address                                                                                                                                                                                             |
| 19                 | 0-7                             | WatchHi <sup>1</sup>                                                                        | High-order watchpoint address                                                                                                                                                                                            |
| 20 - 22            |                                 | Reserved                                                                                    | Reserved                                                                                                                                                                                                                 |
| 23                 | 0<br>1<br>2<br>3<br>4<br>6      | Debug<br>TraceControl<br>TraceControl2<br>UserTraceData1<br>TraceBPC <sup>2</sup><br>Debug2 | EJTAG Debug register<br>EJTAG Trace Control register<br>EJTAG Trace Control register2<br>EJTAG User Trace Data1 register<br>EJTAG Trace Breakpoint Register<br>EJTAG Debug register 2                                    |

| Register<br>Number                                                      | Select<br>Number | Register Name         | Function                                                   |  |
|-------------------------------------------------------------------------|------------------|-----------------------|------------------------------------------------------------|--|
| 24                                                                      | 0                | DEPC <sup>2</sup>     | Program counter at last debug exception                    |  |
|                                                                         | 3                | UserTraceData2        | EJTAG User Trace Data2 register                            |  |
| 25                                                                      | 0                | PerfCtl0              | Performance counter 0 control                              |  |
|                                                                         | 1                | PerfCnt0              | Performance counter 0                                      |  |
|                                                                         | 2                | PerfCtl1              | Performance counter 1 control                              |  |
|                                                                         | 3                | PerfCnt1              | Performance counter 1                                      |  |
| 26                                                                      | 0                | ErrCtl                | Software test enable of way-select and Data RAM arrays for |  |
|                                                                         |                  |                       | I-Cache and D-Cache                                        |  |
| 27                                                                      | 0                | CacheErr              | Records information about cache/SPRAM parity errors        |  |
| 28                                                                      | 0                | TagLo                 | Low-order portion of cache tag interface                   |  |
|                                                                         | 1                | DataLo                |                                                            |  |
| 29                                                                      |                  | Reserved              | Reserved                                                   |  |
| 30                                                                      | 0                | ErrorEPC <sup>1</sup> | Program counter at last error                              |  |
| 31                                                                      | 0                | DeSAVE <sup>2</sup>   | Debug handler scratchpad register                          |  |
|                                                                         | 2                | KScratch1             | Scratch Register for Kernel Mode                           |  |
|                                                                         | 3                | Kscratch2             | Scratch Register for Kernel Mode                           |  |
| 1. Registers used in exception processing<br>2. Registers used in debug |                  |                       |                                                            |  |

Table 6.1 CP0 Registers (Continued)

3. Registers used in memory management

# 6.2 CP0 Register Descriptions

This section contains descriptions of each CP0 register. The registers are listed in numerical order, first by Register Number, then by Select Number.

For each register described below, field descriptions include the read/write properties of the field (shown in Table 6.2) and the reset state of the field.

| Read/Write<br>Notation | Hardware Interpretation                                                                                                                                                                                                                                                                     | Software Interpretation                                                                                  |
|------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------|
| R/W                    | A field in which all bits are readable and writable<br>Hardware updates of this field are visible by soft<br>ible by hardware reads.<br>If the reset state of this field is "Undefined," eithe<br>before the first read will return a predictable valu<br>definition of UNDEFINED behavior. | ware reads. Software updates of this field are vis-<br>er software or hardware must initialize the value |

Table 6.2 CP0 Register R/W Field Types

| Read/Write<br>Notation | Hardware Interpretation                                                                                                                                                                                                                                                                                                                                                                                   | Software Interpretation                                                                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |
|------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| R                      | A field that is either static or is updated only by<br>hardware.<br>If the Reset State of this field is either "0" or<br>"Preset", hardware initializes this field to zero<br>or to the appropriate state, respectively, on pow-<br>erup.<br>If the Reset State of this field is "Undefined",<br>hardware updates this field only under those<br>conditions specified in the description of the<br>field. | A field to which the value written by software is<br>ignored by hardware. Software may write any<br>value to this field without affecting hardware<br>behavior. Software reads of this field return the<br>last value updated by hardware.<br>If the Reset State of this field is "Undefined,"<br>software reads of this field result in an UNPRE-<br>DICTABLE value except after a hardware<br>update done under the conditions specified in<br>the description of the field. |  |  |  |
| W                      | A field that can be written by software but which can not be read by software.<br>Software reads of this field will return an UNDEFINED value.                                                                                                                                                                                                                                                            |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |  |  |  |
| 0                      | A field that hardware does not update, and for<br>which hardware can assume a zero value.                                                                                                                                                                                                                                                                                                                 | A field to which the value written by software<br>must be zero. Software writes of non-zero val-<br>ues to this field may result in UNDEFINED<br>behavior of the hardware. Software reads of this<br>field return zero as long as all previous software<br>writes are zero.<br>If the Reset State of this field is "Undefined,"<br>software must write this field with zero before it<br>is guaranteed to read as zero.                                                        |  |  |  |

# Table 6.2 CP0 Register R/W Field Types (Continued)

# 6.2.1 Index Register (CP0 Register 0, Select 0)

The *Index* register is a 32-bit read/write register that contains the index used to access the TLB for TLBP, TLBR, and TLBWI instructions. The width of the index field is implementation-dependent as a function of the number of TLB entries that are implemented. The minimum value for TLB-based MMUs is  $Ceiling(Log_2(TLBEntries))$ .

The operation of the processor is UNDEFINED if a value greater than or equal to the number of TLB entries is written to the *Index* register.

### Figure 6.1 Index Register Format

| 31 | 30 | 5 | 4 |       | 0 |
|----|----|---|---|-------|---|
| Р  | 0  |   |   | Index |   |

| Fie   | lds    |                                                                                                          |            |             |
|-------|--------|----------------------------------------------------------------------------------------------------------|------------|-------------|
| Name  | Bit(s) | Description                                                                                              | Read/Write | Reset State |
| Р     | 31     | Probe Failure. Set to 1 when the previous TLBProbe (TLBP) instruction failed to find a match in the TLB. | R          | Undefined   |
| 0     | 30:5   | Must be written as zeros; returns zeros on reads.                                                        | 0          | 0           |
| Index | 4:0    | Index to the TLB entry affected by the TLBRead and TLB-Write instructions.                               | R/W        | Undefined   |

# Table 6.3 Index Register Field Descriptions

# 6.2.2 Random Register (CP0 Register 1, Select 0)

The *Random* register is a read-only register whose value is used to index the TLB during a TLBWR instruction. The width of the Random field is calculated in the same manner as that described for the *Index* register above.

The value of the register varies between an upper and lower bound as follow:

- A lower bound is set by the number of TLB entries reserved for exclusive use by the operating system (the contents of the *Wired* register). The entry indexed by the *Wired* register is the first entry available to be written by a TLB Write Random operation.
- An upper bound is set by the total number of TLB entries minus 1.

The *Random* register is decremented by one almost every clock, wrapping after the value in the *Wired* register is reached. To enhance the level of randomness and reduce the possibility of a live lock condition, an LFSR register is used that prevents the decrement pseudo-randomly.

The processor initializes the *Random* register to the upper bound on a Reset exception and when the *Wired* register is written.

#### Figure 6.2 Random Register Format

| 31 | 5 4 | 4 0    |
|----|-----|--------|
|    | 0   | Random |

#### Table 6.4 Random Register Field Descriptions

| Fiel   | ds     |                                                 |            |                 |
|--------|--------|-------------------------------------------------|------------|-----------------|
| Name   | Bit(s) | Description                                     | Read/Write | Reset State     |
| 0      | 31:5   | Must be written as zero; returns zero on reads. | 0          | 0               |
| Random | 4:0    | TLB Random Index                                | R          | TLB Entries - 1 |

### 6.2.3 EntryLo0 and EntryLo1 Registers (CP0 Registers 2 and 3, Select 0)

The pair of *EntryLo* registers act as the interface between the TLB and the TLBR, TLBWI, and TLBWR instructions. For a TLB-based MMU, *EntryLo0* holds the entries for even pages and *EntryLo1* holds the entries for odd pages.

The contents of the *EntryLo0* and *EntryLo1* registers are undefined after an address error, TLB invalid, TLB modified, or TLB refill exception.

#### Figure 6.3 EntryLo0, EntryLo1 Register Format

| 31 30 29 26 | 25 6 | 5 | 3 | 2 | 1 | 0 |
|-------------|------|---|---|---|---|---|
| RI XI 0     | PFN  | C |   | D | V | G |

| Fiel | lds    |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |            |             |
|------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | Read/Write | Reset State |
| RI   | 31     | Read Inhibit. If this bit is set, an attempt to read data from<br>the page causes a TLB Invalid exception, even if the V<br>(Valid) bit is set. The <i>RI</i> bit is enabled only if the <i>RIE</i> bit of<br>the <i>PageGrain</i> register is set. If the RIE bit of <i>PageGrain</i><br>is not set, the RI bit of <i>EntryLoO/EntryLo1</i> is a reserved 0<br>bit as per the MIPS32 specification.                                                                                                                                                                                                               | R/W        | 0           |
| XI   | 30     | Execute Inhibit. If this bit is set, an attempt to fetch from<br>the page causes a TLB Invalid exception, even if the V<br>(Valid) bit is set. The XI bit is enabled only if the XIE bit of<br>the PageGrain register is set. If the XIE bit of PageGrain<br>is not set, the XI bit of EntryLo0/EntryLo1 is a reserved 0<br>bit as per the MIPS32 specification.                                                                                                                                                                                                                                                   | R/W        | 0           |
| 0    | 29:26  | These 4 bits are normally part of the PFN; however,<br>because the core supports only 32 bits of physical address,<br>the PFN is only 20 bits wide. Therefore, bits 29:26 of this<br>register must be written with zeros.                                                                                                                                                                                                                                                                                                                                                                                          | R/W        | 0           |
| PFN  | 25:6   | Page Frame Number. Contributes to the definition of the high-order bits of the physical address.<br>If the processor is enabled to support 1KB pages ( <i>Config3<sub>SP</sub></i> = 1 and <i>PageGrain<sub>ESP</sub></i> = 1), the PFN field corresponds to bits 29:10 of the physical address (the field is shifted left by 2 bits relative to the Release 1 definition to make room for PA <sub>1110</sub> ).<br>If the processor is not enabled to support 1KB pages ( <i>Config3<sub>SP</sub></i> = 0 or <i>PageGrain<sub>ESP</sub></i> = 0), the PFN field corresponds to bits 3112 of the physical address. | R/W        | Undefined   |
| С    | 5:3    | Coherency attribute of the page. See Table 6.6.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | R/W        | Undefined   |
| D    | 2      | "Dirty" or write-enable bit, indicating that the page has<br>been written, and/or is writable. If this bit is a one, then<br>stores to the page are permitted. If this bit is a zero, then<br>stores to the page cause a TLB Modified exception.                                                                                                                                                                                                                                                                                                                                                                   | R/W        | Undefined   |
| V    | 1      | Valid bit, indicating that the TLB entry, and thus the virtual page mapping, are valid. If this bit is a one, then accesses to the page are permitted. If this bit is a zero, then accesses to the page cause a TLB Invalid exception.                                                                                                                                                                                                                                                                                                                                                                             | R/W        | Undefined   |
| G    | 0      | Global bit. On a TLB write, the logical AND of the G bits<br>in both the <i>EntryLo0</i> and <i>EntryLo1</i> register becomes the G<br>bit in the TLB entry. If the TLB entry G bit is a one, then<br>the ASID comparisons are ignored during TLB matches.<br>On a read from a TLB entry, the G bits of both <i>EntryLo0</i><br>and <i>EntryLo1</i> reflect the state of the TLB G bit.                                                                                                                                                                                                                            | R/W        | Undefined   |

# Table 6.5 EntryLo0, EntryLo1 Register Field Descriptions

Table 6.6 lists the encoding of the C field of the *EntryLo0* and *EntryLo1* registers and the K0 field of the *Config* register.

| C[5:3] Value                                   | Cache Coherency Attribute                                                                                                                                                                                                                                                                            |
|------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0                                              | Cacheable, noncoherent, write-through, no write allocate                                                                                                                                                                                                                                             |
| 1                                              | Cacheable, noncoherent, write-through, write allocate                                                                                                                                                                                                                                                |
| 3*, 4, 5, 6                                    | Cacheable, noncoherent, write-back, write allocate                                                                                                                                                                                                                                                   |
| 2*, 7                                          | Uncached                                                                                                                                                                                                                                                                                             |
| core. For example, va<br>Note that these value | required by the MIPS32 architecture. Only values 0, 1, 2 and 3 are used in an microAptiv UP lues 4, 5 and 6 are not used and are mapped to 3. The value 7 is not used and is mapped to 2. If a base meaning in other MIPS Technologies processor implementations. Refer to the for more information. |

#### Table 6.6 Cache Coherency Attributes

# 6.2.4 Context Register (CP0 Register 4, Select 0)

The *Context* register is a read/write register containing a pointer to an entry in the page table entry (PTE) array. This array is an operating system data structure that stores virtual-to-physical translations. During a TLB miss, the operating system loads the TLB with the missing translation from the PTE array. The *Context* register duplicates some of the information provided in the *BadVAddr* register but is organized in such a way that the operating system can directly reference an 8-byte page table entry (PTE) in memory.

A TLB exception (TLB Refill, TLB Invalid, or TLB Modified) causes bits  $VA_{31:13}$  of the virtual address to be written into the *BadVPN2* field of the *Context* register. The *PTEBase* field is written and used by the operating system.

The BadVPN2 field of the Context register is not defined after an address error exception.

#### Figure 6.4 Context Register Format

| PTEBase | BadVPN2 | 0 |
|---------|---------|---|
|         |         |   |

### Table 6.7 Context Register Field Descriptions

| Fie     | lds    |                                                                                                                                                                                                                      |            |             |
|---------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name    | Bit(s) | Description                                                                                                                                                                                                          | Read/Write | Reset State |
| PTEBase | 31:23  | This field is for use by the operating system and is nor-<br>mally written with a value that allows the operating system<br>to use the <i>Context</i> Register as a pointer into the current<br>PTE array in memory. | R/W        | Undefined   |
| BadVPN2 | 22:4   | This field is written by hardware on a TLB miss. It con-<br>tains bits $VA_{31:13}$ of the virtual address that missed.                                                                                              | R          | Undefined   |
| 0       | 3:0    | Must be written as zero; returns zero on reads.                                                                                                                                                                      | 0          | 0           |

# 6.2.5 UserLocal Register (CP0 Register 4, Select 2)

The UserLocal register is a read-write register that is not interpreted by the hardware and conditionally readable via the RDHWR instruction.

Figure 6.5 shows the format of the UserLocal register; Table 6.8 describes the UserLocal register fields.

#### Figure 6.5 UserLocal Register Format

| 31 |           | 0 |
|----|-----------|---|
|    | UserLocal |   |

#### Table 6.8 UserLocal Register Field Descriptions

| Fie       | lds    |                                                                               | Read / |             |
|-----------|--------|-------------------------------------------------------------------------------|--------|-------------|
| Name      | Bit(s) | Description                                                                   | Write  | Reset State |
| UserLocal | 31:0   | This field contains software information that is not interpreted by hardware. | R/W    | Undefined   |

#### **Programming Notes**

Privileged software may write this register with arbitrary information and make it accessible to unprivileged software via register 29 (ULR) of the RDHWR instruction. To do so, bit 29 of the HWREna register must be set to a 1 to enable unprivileged access to the register. In some operating environments, the UserLocal register contains a pointer to a thread-specific storage block that is obtained via the RDHWR register.

# 6.2.6 PageMask Register (CP0 Register 5, Select 0)

The *PageMask* register is a read/write register used for reading from and writing to the TLB. It holds a comparison mask that sets the variable page size for each TLB entry, as shown in Table 6.10. Figure 6.6 shows the format of the PageMask register; Table 6.9 describes the PageMask register fields.

#### Figure 6.6 PageMask Register Format

| 31 | 29 | 28   | 13 12 11 10 0 |
|----|----|------|---------------|
|    | 0  | Mask | MaskX 0       |

| Table 6.9 PageMask Register Field Descriptions |
|------------------------------------------------|
|------------------------------------------------|

| Fields    |       |                                                                                                                                                             |            |             |  |
|-----------|-------|-------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|
| Name Bits |       | Description                                                                                                                                                 | Read/Write | Reset State |  |
| Mask      | 28:13 | The <i>Mask</i> field is a bit mask in which a "1" bit indicates that the corresponding bit of the virtual address should not participate in the TLB match. | R/W        | Undefined   |  |

| Fields |             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |            |                             |  |
|--------|-------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-----------------------------|--|
| Name   | Bits        | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | Read/Write | Reset State                 |  |
| MaskX  | 12:11       | In Release 2 of the Architecture, the <i>MaskX</i> field is an extension to the <i>Mask</i> field to support 1KB pages with definition and action analogous to that of the <i>Mask</i> field, defined above.<br>If 1KB pages are enabled ( <i>Config3<sub>SP</sub></i> = 1 and <i>PageGrain<sub>ESP</sub></i> = 1), these bits are writable and readable, and their values are copied to and from the TLB entry on a TLB write or read, respectively.<br>If 1KB pages are not enabled ( <i>Config3<sub>SP</sub></i> = 0 or <i>PageGrain<sub>ESP</sub></i> = 0), these bits are not writable, return zero on read, and the effect on the TLB entry on a write is as if they were written with the value 2#11.<br>In Release 1 of the Architecture, these bits must be written as zero, return zero on read, and have no effect on the virtual address translation. | R/W        | 0<br>(See Descrip-<br>tion) |  |
| 0      | 31:29, 10:0 | Ignored on writes; returns zero on reads.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | R          | 0                           |  |

#### Table 6.9 PageMask Register Field Descriptions

Table 6.10 Values for the Mask and MaskX<sup>1</sup> Fields of the PageMask Register

|            | Bit |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |                 |                 |
|------------|-----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|-----------------|-----------------|
| Page Size  | 28  | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 <sup>1</sup> | 11 <sup>1</sup> |
| 4 KBytes   | 0   | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 1               | 1               |
| 16 KBytes  | 0   | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 1  | 1  | 1               | 1               |
| 64 KBytes  | 0   | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 1  | 1  | 1  | 1  | 1               | 1               |
| 256 KBytes | 0   | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |
| 1 MByte    | 0   | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |
| 4 MByte    | 0   | 0  | 0  | 0  | 0  | 0  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |
| 16 MByte   | 0   | 0  | 0  | 0  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |
| 64 MByte   | 0   | 0  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |
| 256 MByte  | 1   | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1  | 1               | 1               |

1.  $PageMask_{12..11} = PageMask_{MaskX}$  exists only on implementations of Release 2 of the architecture and are treated as if they had the value 2#11 if 1K pages are not enabled ( $Config3_{SP} = 0$  or  $PageGrain_{ESP} = 0$ ).

It is implementation-dependent how many of the encodings described in Table 6.10 are implemented. All processors must implement the 4KB page size. If a particular page size encoding is not implemented by a processor, a read of the *PageMask* register must return zeros in all bits that correspond to encodings that are not implemented, thereby potentially returning a value different than that written by software.

Software may determine which page sizes are supported by writing all ones to the *PageMask* register, then reading the value back. If a pair of bits reads back as ones, the processor implements that page size. The operation of the processor is **UNDEFINED** if software loads the *Mask* field with a value other than one of those listed in Table 6.10,

even if the hardware returns a different value on a read. Hardware may depend on this requirement in implementing hardware structures.

# 6.2.7 PageGrain Register (CP0 Register 5, Select 1)

The *PageGrain* register is a read/write register used for enabling 1KB smaller than 4KB-page support. It is used for reading from and writing to the TLB.

The contents of the *PageGrain* register are not reflected in the contents of the TLB; therefore, the TLB must be flushed before any change to the *PageGrain* register is made. Behavior is UNDEFINED if a value other than those listed is used.

#### Figure 6.7 PageGrain Register Format

| 31 30 29 28 27 26 | 0 |
|-------------------|---|
| RIE XIE 0 ESPIEC  | 0 |

| Fields |        |                 |                                                                                                           |            |             |  |
|--------|--------|-----------------|-----------------------------------------------------------------------------------------------------------|------------|-------------|--|
| Name   | Bit(s) | _               | Description                                                                                               | Read/Write | Reset State |  |
| RIE    | 31     | Read Inhibit En | able.                                                                                                     | R/W        | 0           |  |
|        |        | Encoding        | Meaning                                                                                                   |            |             |  |
|        |        | 0               | <i>RI</i> bit of <i>EntryLo0</i> and <i>EntryLo1</i> registers is disabled and not writeable by software. |            |             |  |
|        |        | 1               | <i>RI</i> bit of <i>EntryLo0</i> and <i>EntryLo1</i> registers is enabled.                                |            |             |  |
| XIE    | 30     | Execute Inhibit | Enable.                                                                                                   | R/W        | 0           |  |
|        |        | Encoding        | Meaning                                                                                                   |            |             |  |
|        |        | 0               | <i>XI</i> bit of <i>EntryLo0</i> and <i>EntryLo1</i> registers is disabled and not writeable by software. |            |             |  |
|        |        | 1               | XI bit of EntryLo0 and EntryLo1 reg-<br>isters is enabled                                                 |            |             |  |
| 0      | 29     | Reserved. Must  | be written as zero; returns zero on read.                                                                 | 0          | 0           |  |

#### Table 6.11 PageGrain Register Field Descriptions

| Field | ds     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |            |             |
|-------|--------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name  | Bit(s) | -                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Read/Write | Reset State |
| ESP   | 28     | Enables support                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | for 1KB pages.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | R/W        | 0           |
|       |        | Encoding                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |            |             |
|       |        | 0                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | 1KB page support is not enabled                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |            |             |
|       |        | 1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | 1KB page support is enabled                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |            |             |
|       |        | <ul> <li>sor 0 registers:</li> <li>The <i>PFN</i> fiel ters holds the is shifted left</li> <li>The <i>MaskX</i> fi able and is conto form the "of the <i>VPN2X</i> 12:11 of the virtual according to the similar field of the similar for the</li></ul> | the following changes occur to coproces-<br>d of the <i>EntryLo0</i> and <i>EntryLo1</i> regis-<br>physical address down to bit 10 (the field<br>by 2 bits from the Release 1 definition)<br>field of the <i>PageMask</i> register is writ-<br>ncatenated to the right of the <i>Mask</i> field<br>don't care" mask for the TLB entry.<br>field of the <i>EntryHi</i> register and bits<br>virtual address are writable.<br>ddress translation algorithm is modified to<br>aller page size.<br>0, 1KB pages are not implemented, and<br>ed on writes and returns zero on reads. |            |             |
| IEC   | 27     | Enables unique<br>Execute-Inhibit                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | exception codes for the Read-Inhibit and exceptions.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | R/W        | 0           |
|       |        | Encoding                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |            |             |
|       |        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | Read-Inhibit and Execute-Inhibit<br>exceptions both use the TLBL excep-<br>tion code.<br>Read-Inhibit exceptions use the                                                                                                                                                                                                                                                                                                                                                                                                                                                        |            |             |
|       |        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | TLBRI exception code.<br>Execute-Inhibit exceptions use the<br>TLBXI exception code.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |            |             |
| 0     | 26:0   | Must be written                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | as zero; returns zero on reads.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | 0          | 0           |

Table 6.11 PageGrain Register Field Descriptions (Continued)

## 6.2.8 Wired Register (CP0 Register 6, Select 0)

The *Wired* register is a read/write register that specifies the boundary between the wired and random entries in the TLB as shown in Figure 6.8. The width of the *Wired* field is calculated in the same manner as that described for the *Index* register above. Wired entries are fixed, non-replaceable entries that are not overwritten by a TLBWR instruction. Wired entries can be overwritten by a TLBWI instruction.

The *Wired* register is reset to zero by a Reset exception. Writing the *Wired* register causes the *Random* register to reset to its upper bound.

The operation of the processor is undefined if a value greater than or equal to the number of TLB entries is written to the *Wired* register.

This register is only valid with a TLB-based MMU cores. It is reserved for a FM based MMU core.





#### Figure 6.9 Wired Register Format



#### **Table 6.12 Wired Register Field Descriptions**

| Fields |        |                                                 |            |             |
|--------|--------|-------------------------------------------------|------------|-------------|
| Name   | Bit(s) | Description                                     | Read/Write | Reset State |
| 0      | 31:5   | Must be written as zero; returns zero on reads. | 0          | 0           |
| Wired  | 4:0    | TLB wired boundary.                             | R/W        | 0           |

## 6.2.9 HWREna Register (CP0 Register 7, Select 0)

The *HWREna* register contains a bit mask that determines which hardware registers are accessible via the RDHWR instruction.

Figure 6.10 shows the format of the HWREna Register; Table 6.13 describes the HWREna register fields.

#### Figure 6.10 HWREna Register Format

| 31 30 | 29  | 28 | 4 | 3    | 0 |
|-------|-----|----|---|------|---|
| 0     | ULR | (  | ) | Mask |   |

| Fields |       |                                                                                                                                                                                                                                                                                                                                                                 |            |             |
|--------|-------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits  | Description                                                                                                                                                                                                                                                                                                                                                     | Read/Write | Reset State |
| 0      | 31:30 | Must be written with zero; returns zero on read                                                                                                                                                                                                                                                                                                                 | 0          | 0           |
| 0      | 28:4  | Must be written with zero; returns zero on read                                                                                                                                                                                                                                                                                                                 | 0          | 0           |
| ULR    | 29    | User Local Register. This register provides read access to the coprocessor 0 <i>UserLocal</i> register. In some operating environments, the <i>UserLocal</i> register is a pointer to a thread-specific storage block.                                                                                                                                          | R/W        | 0           |
| Mask   | 3:0   | Each bit in this field enables access by the RDHWR<br>instruction to a particular hardware register (which<br>may not be an actual register). If bit 'n' in this field is<br>a 1, access is enabled to hardware register 'n'. If bit<br>'n' of this field is a 0, access is disabled.<br>See the RDHWR instruction for a list of valid hard-<br>ware registers. | R/W        | 0           |

#### Table 6.13 HWREna Register Field Descriptions

Privileged software may determine which of the hardware registers are accessible by the RDHWR instruction. In doing so, a register may be virtualized at the cost of handling a Reserved Instruction Exception, interpreting the instruction, and returning the virtualized value. For example, if it is not desirable to provide direct access to the *Count* register, access to that register may be individually disabled and the return value can be virtualized by the operating system.

## 6.2.10 BadVAddr Register (CP0 Register 8, Select 0)

The *BadVAddr* register is a read-only register that captures the most recent virtual address that caused one of the following exceptions:

- Address error (AdEL or AdES)
- TLB Refill
- TLB Invalid
- TLB Modified

The *BadVAddr* register does not capture address information for cache or bus errors, because they are not addressing errors.

#### Figure 6.11 BadVAddr Register Format

31

BadVAddr

| Field    | ds   |                      |            |             |
|----------|------|----------------------|------------|-------------|
| Name     | Bits | Description          | Read/Write | Reset State |
| BadVAddr | 31:0 | Bad virtual address. | R          | Undefined   |

#### Table 6.14 BadVAddr Register Field Description

## 6.2.11 BadInstr Register (CP0 Register 8, Select 1)

The *BadInstr* register is an optional read-only register that captures the most recent instruction that caused one of the following exceptions:

Instruction Validity

Coprocessor Unusable, Reserved Instruction

• Execution Exception

Integer Overflow, Trap, System Call, Breakpoint, Floating-point, Coprocessor 2 exception

Addressing

Address Error, TLB Refill, TLB Invalid, TLB Read Inhibit, TLB Execute Inhibit, TLB Modified

The *Badlnstr* register is provided to allow acceleration of instruction emulation. The *Badlnstr* register is only set by exceptions that are synchronous to an instruction. The *Badlnstr* register is not set by Interrupts or by NMI, Machine check, Bus Error, Cache Error, Watch, or EJTAG exceptions.

When a synchronous exception occurs for which there is no valid instruction word (for example TLB Refill - Instruction Fetch), the value stored in *BadInstr* is **UNPREDICTABLE**.

Presence of the *Badlnstr* register is indicated by the *Config3*<sub>Bl</sub> bit. The *Badlnstr* register is instantiated per-VPE in an MT ASE processor.

Figure 6.12 shows the proposed format of the BadInstr register; Table 6.15 describes the BadInstr register fields.

#### Figure 6.12 BadInstr Register Format

| 2 | -1 |
|---|----|
| J |    |
|   |    |

BadInstr

#### Table 6.15 BadInstr Register Field Descriptions

| Field    | S    |                                                                                                                                   | Read / | Reset     |
|----------|------|-----------------------------------------------------------------------------------------------------------------------------------|--------|-----------|
| Name     | Bits | Description                                                                                                                       | Write  | State     |
| BadInstr | 31:0 | Faulting instruction word.<br>Instruction words smaller than 32 bits are placed in bits<br>15:0, with bits 31:16 containing zero. | R      | Undefined |

## 6.2.12 BadInstrP Register (CP0 Register 8, Select 2)

The *BadInstrP* register is an optional register that is used in conjunction with the *BadInstr* register. The *BadInstrP* register contains the prior branch instruction when the faulting instruction is in a branch delay slot.

The BadInstrP register is updated for these exceptions:

Instruction Validity

Coprocessor Unusable, Reserved Instruction

Execution Exception

Integer Overflow, Trap, System Call, Breakpoint, Floating-point, Coprocessor 2 exception

Addressing

Address Error, TLB Refill, TLB Invalid, TLB Read Inhibit, TLB Execute Inhibit, TLB Modified

The *BadInstrP* register is provided to allow acceleration of instruction emulation. The *BadInstrP* register is only set by exceptions that are synchronous to an instruction. The *BadInstrP* register is not set by Interrupts or by NMI, Machine check, Bus Error, Cache Error, Watch, or EJTAG exceptions. When a synchronous exception occurs, and the faulting instruction is not in a branch delay slot, then the value stored in *BadInstrP* is **UNPREDICTABLE**.

Presence of the *BadInstrP* register is indicated by the *Config3<sub>BP</sub>* bit. The *BadInstrP* register is instantiated per-VPE in an MT ASE processor.

Figure 6.13 shows the proposed format of the BadInstrP register; Table 6.16 describes the BadInstrP register fields.

#### Figure 6.13 BadInstrP Register Format

31

BadInstrP

#### Table 6.16 BadInstrP Register Field Descriptions

| Fields    |      |                                                                                                                                  | Read / | Reset     |
|-----------|------|----------------------------------------------------------------------------------------------------------------------------------|--------|-----------|
| Name      | Bits | Description                                                                                                                      | Write  | State     |
| BadInstrP | 31:0 | Prior branch instruction.<br>Instruction words smaller than 32 bits are placed in bits<br>15:0, with bits 31:16 containing zero. | R      | Undefined |

## 6.2.13 Count Register (CP0 Register 9, Select 0)

The *Count* register acts as a timer, incrementing at a constant rate, whether or not an instruction is executed, retired, or any forward progress is made through the pipeline. The counter increments every other clock if the *DC* bit in the *Cause* register is 0.

The *Count* register can be written for functional or diagnostic purposes, including at reset or to synchronize processors.

By writing the *CountDM* bit in the *Debug* register, it is possible to control whether the *Count* register continues incrementing while the processor is in debug mode.

#### Figure 6.14 Count Register Format



### Table 6.17 Count Register Field Description

| Fiel  | ds   |                   |            |             |
|-------|------|-------------------|------------|-------------|
| Name  | Bits | Description       | Read/Write | Reset State |
| Count | 31:0 | Interval counter. | R/W        | Undefined   |

## 6.2.14 EntryHi Register (CP0 Register 10, Select 0)

The EntryHi register contains the virtual address match information used for TLB read, write, and access operations.

A TLB exception (TLB Refill, TLB Invalid, or TLB Modified) causes bits  $VA_{31..13}$  of the virtual address to be written into the *VPN2* field of the *EntryHi* register. An implementation of Release 2 of the Architecture which supports 1KB pages also writes  $VA_{12..11}$  into the *VPN2X* field of the *EntryHi* register. A TLBR instruction writes the *EntryHi* register with the corresponding fields from the selected TLB entry. The *ASID* field is written by software with the current address space identifier value and is used during the TLB comparison process to determine TLB match.

Because the *ASID* field is overwritten by a TLBR instruction, software must save and restore the value of ASID around use of the TLBR. This is especially important in TLB Invalid and TLB Modified exceptions, and in other memory management software.

The VPNX2 and VPN2 fields of the EntryHi register are not defined after an address error exception and these fields may be modified by hardware during the address error exception sequence. Software writes of the EntryHi register (via MTC0) do not cause the implicit write of address-related fields in the BadVAddr, Context registers.

#### Figure 6.15 EntryHi Register Format

| 31   | 13 12 11 10 8 | 7 0  |
|------|---------------|------|
| VPN2 | VPN2X 0       | ASID |

#### Table 6.18 EntryHi Register Field Descriptions

| VPN2 | 31:13 | $VA_{3113}$ of the virtual address (virtual page number / 2).                                                     | R/W | Undefined |
|------|-------|-------------------------------------------------------------------------------------------------------------------|-----|-----------|
|      |       | This field is written by hardware on a TLB exception or<br>on a TLB read, and is written by software before a TLB |     |           |
|      |       | write.                                                                                                            |     |           |

| VPN2X | 12:11 | In Release 2 of the Architecture, the VPN2X field is an extension to the VPN2 field to support 1KB pages. These bits are not writable by either hardware or software unless $Config3_{SP} = 1$ and $PageGrain_{ESP} = 1$ . If enabled for write, this field contains $VA_{1211}$ of the virtual address and is written by hardware on a TLB exception or on a TLB read, and is by software before a TLB write. If writes are not enabled, and in implementations of Release 1 of the Architecture, this field must be written with zero and returns zeros on read. | R/W | 0         |
|-------|-------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|-----------|
| 0     | 10:8  | Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | 0   | 0         |
| ASID  | 7:0   | Address space identifier. This field is written by hardware<br>on a TLB read and by software to establish the current<br>ASID value for TLB write and against which TLB refer-<br>ences match each entry's TLB <i>ASID</i> field.                                                                                                                                                                                                                                                                                                                                  | R/W | Undefined |

| Table 6.18 | EntryHi Reg | gister Field | Descriptions | (Continued) |
|------------|-------------|--------------|--------------|-------------|
|------------|-------------|--------------|--------------|-------------|

## 6.2.15 Compare Register (CP0 Register 11, Select 0)

The *Compare* register acts in conjunction with the *Count* register to implement a timer and timer interrupt function. The timer interrupt is an output of the cores. The *Compare* register maintains a stable value and does not change on its own.

When the value of the *Count* register equals the value of the *Compare* register, the *SI\_TimerInt* pin is asserted. This pin will remain asserted until the *Compare* register is written. The *SI\_TimerInt* pin can be fed back into the core on one of the interrupt pins to generate an interrupt. However, this is no longer needed as the core will internally route the interrupt to the IP number set by the IntCt1.IPTI field.

For diagnostic purposes, the *Compare* register is a read/write register. In normal use, however, the *Compare* register is write-only. Writing a value to the *Compare* register, as a side effect, clears the timer interrupt.

## Figure 6.16 Compare Register Format

| 31 |         | 0 |
|----|---------|---|
|    | Compare |   |

#### **Table 6.19 Compare Register Field Description**

| Fiel    | ds     |                               |            |             |
|---------|--------|-------------------------------|------------|-------------|
| Name    | Bit(s) | Description                   | Read/Write | Reset State |
| Compare | 31:0   | Interval count compare value. | R/W        | Undefined   |

## 6.2.16 Status Register (CP0 Register 12, Select 0)

The *Status* register is a read/write register that contains the operating mode, interrupt enabling, and the diagnostic states of the processor. Fields of this register combine to create operating modes for the processor. Refer to

4.2 "Modes of Operation" on page 77 for a discussion of operating modes, and 5.3 "Interrupts" on page 98 for a discussion of interrupt modes.

Interrupt Enable: Interrupts are enabled when all of the following conditions are true:

- *IE* = 1
- EXL = 0
- ERL = 0
- DM = 0

If these conditions are met, then the settings of the *IM* and *IE* bits enable the interrupts.

**Operating Modes**: If the *DM* bit in the *Debug* register is 1, then the processor is in debug mode; otherwise the processor is in either kernel or user mode. The following CPU *Status* register bit settings determine user or kernel mode:

- User mode: UM = 1, EXL = 0, and ERL = 0
- Kernel mode: UM = 0, or EXL = 1, or ERL = 1

Coprocessor Accessibility: The *Status* register *CU* bits control coprocessor accessibility. If any coprocessor is unusable, then an instruction that accesses it generates an exception.

Figure 6.17 shows the format of the Status register; Table 6.20 describes the Status register fields.

#### Figure 6.17 Status Register Format 31 28 27 26 25 22 21 20 19 18 10 9 7 24 23 17 16 8 6 5 4 3 2 1 0 CU3..CU0 RP FR RE MX R BEV TS SR NMI IM9 CEE IM8..IM2 IM1..IM0 R UM R ERL EXL IE IPL **IPL**

| Field | ds   |                                                                                                                                                                                                                         |            |             |
|-------|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name  | Bits | Description                                                                                                                                                                                                             | Read/Write | Reset State |
| CU3   | 31   | Controls access to coprocessor 3. COP3 is not supported.<br>This bit cannot be written and will read as 0.                                                                                                              | R          | 0           |
| CU2   | 30   | Controls access to coprocessor 2. This bit can only be writ-<br>ten if coprocessor is attached to the COP2 interface. ( <i>C2</i><br>bit in Config1 is set). This bit will read as 0 if no coproces-<br>sor is present. |            | 0           |
| CU1   | 29   | Controls access to coprocessor 1. This bit can only be written if the FPU is configured. This bit will read as 0 if the FPU is not present.                                                                             | R/W        | 0           |

## **Table 6.20 Status Register Field Descriptions**

| Field | ls   |                              |                                                                                                                        |            |                           |
|-------|------|------------------------------|------------------------------------------------------------------------------------------------------------------------|------------|---------------------------|
| Name  | Bits | -                            | Description                                                                                                            | Read/Write | Reset State               |
| CU0   | 28   | Controls access to           | o coprocessor 0:                                                                                                       | R/W        | Undefined                 |
|       |      | Encoding                     | Meaning                                                                                                                |            |                           |
|       |      | 0                            | Access not allowed                                                                                                     |            |                           |
|       |      | 1                            | Access allowed                                                                                                         |            |                           |
|       |      |                              | always usable when the processor is run-<br>ode, independent of the state of the <i>CU0</i>                            |            |                           |
| RP    | 27   |                              | power mode. The state of the <i>RP</i> bit is external core interface as the <i>SI_RP</i> sig-                         | R/W        | 0 for Cold<br>Reset only. |
| FR    | 26   | microAptiv UP c              | to floating-point registers. Because the ore does not contain a floating-point unit, I on writes and reads as zero.    | R/W        | 0                         |
| RE    | 25   |                              | everse-endian memory references while unning in user mode:                                                             | R/W        | Undefined                 |
|       |      | Encoding                     | Meaning                                                                                                                |            |                           |
|       |      | 0                            | User mode uses configured endianness                                                                                   |            |                           |
|       |      | 1                            | User mode uses reversed endianness                                                                                     |            |                           |
|       |      |                              | lode nor Kernel Mode nor Supervisor are affected by the state of this bit.                                             |            |                           |
| MX    | 24   | MIPS DSP Exten<br>resources: | ision. Enables access to DSP Module                                                                                    | R/W        | 0                         |
|       |      | Encoding                     | Meaning                                                                                                                |            |                           |
|       |      | 0                            | Access not allowed                                                                                                     |            |                           |
|       |      | 1                            | Access allowed                                                                                                         |            |                           |
|       |      | this bit has been s          | ecute any DSP Module instruction before<br>set to 1 will cause a DSP State Disabled<br>ate of this bit is reflected in |            |                           |
| R     | 23   | Reserved. This fi            | eld is ignored on writes and reads as 0.                                                                               | R          | 0                         |
| BEV   | 22   | Controls the loca            | tion of exception vectors:                                                                                             | R/W        | 1                         |
|       |      | Encoding                     | Meaning                                                                                                                |            |                           |
|       |      | 0                            | Normal                                                                                                                 |            |                           |
|       |      | 1                            | Bootstrap                                                                                                              |            |                           |

| Field   | ls           |                                                                                                                   |                                                                                                                                                             |            |                               |
|---------|--------------|-------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------------------------|
| Name    | Bits         |                                                                                                                   | Description                                                                                                                                                 | Read/Write | Reset State                   |
| TS      | 21           | match on multiple<br>TLBWR instruction<br>down condition in<br>exception is also<br>TLB-based MMU<br>FM-based MMU | write a 0 to this bit to clear it and cannot                                                                                                                | R/W        | 0                             |
| SR      | 20           | Indicates that the was due to a Soft                                                                              | entry through the reset exception vector Reset:                                                                                                             | R/W        | 1 for Soft<br>Reset; 0 other- |
|         |              | Encoding                                                                                                          | Meaning                                                                                                                                                     |            | wise                          |
|         |              | 0                                                                                                                 | Not Soft Reset (NMI or Reset)                                                                                                                               |            |                               |
|         |              | 1                                                                                                                 | Soft Reset                                                                                                                                                  |            |                               |
|         |              | Software can only force a 0-1 transit                                                                             | y write a 0 to this bit to clear it and cannot tion.                                                                                                        |            |                               |
| NMI     | 19           | Indicates that the was due to an NM                                                                               | entry through the reset exception vector II:                                                                                                                | R/W        | 1 for NMI; 0<br>otherwise     |
|         |              | Encodin                                                                                                           | g Meaning                                                                                                                                                   |            |                               |
|         |              | 0                                                                                                                 | Not NMI (Soft Reset or Reset)                                                                                                                               |            |                               |
|         |              | 1                                                                                                                 | NMI                                                                                                                                                         |            |                               |
|         |              | Software can only force a 0-1 transit                                                                             | y write a 0 to this bit to clear it and cannot tion.                                                                                                        |            |                               |
| CEE     | 17           |                                                                                                                   | e: Implementation-dependent. If CorEx-                                                                                                                      | R/W        | Undefined                     |
|         |              | attempt to execut<br>cleared will resul                                                                           | tes that this bit should be used, any<br>e a CorExtend instruction with this bit<br>t in a CorExtend Unusable exception.<br>ed if CorExtend is not present. |            |                               |
| IM9:IM2 | 18,<br>16:10 | ware interrupts. F                                                                                                | Controls the enabling of each of the hard-<br>Refer to 5.3 "Interrupts" on page 98 for a ion of enabled interrupts.                                         | R/W        | Undefined for<br>IM7:IM2      |
|         |              | Encoding                                                                                                          | Meaning                                                                                                                                                     |            | 0 for IM9:IM8                 |
|         |              | 0                                                                                                                 | Interrupt request disabled                                                                                                                                  |            |                               |
|         |              | 1                                                                                                                 | Interrupt request enabled                                                                                                                                   |            |                               |
|         |              | which EIC interru                                                                                                 | ns of Release 2 of the Architecture in upt mode is enabled ( $Config3_{VEIC} = 1$ ),                                                                        |            |                               |
|         |              | these bits have a the <i>IPL</i> field, des                                                                       | different meaning and are interpreted as cribed below.                                                                                                      |            |                               |

Table 6.20 Status Register Field Descriptions (Continued)

| Fields  |              |                                                                                                                                   |                                                                                                                                                                                 |            |                                                      |
|---------|--------------|-----------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|------------------------------------------------------|
| Name    | Bits         |                                                                                                                                   | Description                                                                                                                                                                     | Read/Write | Reset State                                          |
| IPL     | 18,<br>16:10 | which EIC interru<br>this field is the en<br>An interrupt will<br>higher than this v<br>If EIC interrupt n<br>these bits have a o | ns of Release 2 of the Architecture in<br>upt mode is enabled ( $Config3_{VEIC} = 1$ ),<br>ncoded (0:255) value of the current IPL.<br>be signaled only if the requested IPL is | R/W        | Undefined for<br>IPL15:IPL10<br>0 for<br>IPL18:IPL17 |
| IM1:IM0 | 9:8          | ware interrupts. R                                                                                                                | Controls the enabling of each of the soft-<br>Refer to Section 5.3 "Interrupts" for a<br>ion of enabled interrupts.                                                             | R/W        | Undefined                                            |
|         |              | Encoding                                                                                                                          | Meaning                                                                                                                                                                         |            |                                                      |
|         |              | 0                                                                                                                                 | Interrupt request disabled                                                                                                                                                      |            |                                                      |
|         |              | 1                                                                                                                                 | Interrupt request enabled                                                                                                                                                       |            |                                                      |
|         |              | which EIC interru                                                                                                                 | ns of Release 2 of the Architecture in<br>apt mode is enabled, these bits are writ-<br>effect on the interrupt system.                                                          |            |                                                      |
| R       | 7:5          | Reserved. This fi                                                                                                                 | eld is ignored on writes and reads as 0.                                                                                                                                        | R          | 0                                                    |
| UM      | 4            | See Section 4.2 "<br>of operating mod                                                                                             | he base operating mode of the processor.<br>Modes of Operation" for a full discussion<br>es. The encoding of this bit is:                                                       | R/W        | Undefined                                            |
|         |              | Encoding                                                                                                                          | Meaning                                                                                                                                                                         |            |                                                      |
|         |              | 0                                                                                                                                 | Base mode is Kernel Mode                                                                                                                                                        |            |                                                      |
|         |              | 1                                                                                                                                 | Base mode is User Mode                                                                                                                                                          |            |                                                      |
|         |              |                                                                                                                                   | cessor can also be in kernel mode if <i>ERL</i> gardless of the state of the <i>UM</i> bit.                                                                                     |            |                                                      |
| R       | 3            | This bit is reserve<br>as zero.                                                                                                   | ed. This bit is ignored on writes and reads                                                                                                                                     | R          | 0                                                    |

Table 6.20 Status Register Field Descriptions (Continued)

| Fields |      |                                                                                                                                                                                                            |                                                                                                                                                                                                                |            |             |
|--------|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits | -                                                                                                                                                                                                          | Description                                                                                                                                                                                                    | Read/Write | Reset State |
| ERL    | 2    |                                                                                                                                                                                                            | by the processor when a Reset, Soft Reset, ror exception are taken.                                                                                                                                            | R/W        | 1           |
|        |      | Encoding                                                                                                                                                                                                   | Meaning                                                                                                                                                                                                        |            |             |
|        |      | 0                                                                                                                                                                                                          | Normal level                                                                                                                                                                                                   |            |             |
|        |      | 1                                                                                                                                                                                                          | Error level                                                                                                                                                                                                    |            |             |
|        |      | <ul> <li>Interrupts are d</li> <li>The ERET inst<br/>ErrorEPC inst</li> <li>The lower 2<sup>29</sup> t<br/>and uncached r<br/>Management o<br/>This allows ma<br/>of cache errors<br/>UNDEFINED</li> </ul> | is running in kernel mode<br>lisabled<br>ruction will use the return address held in                                                                                                                           |            |             |
| EXL    | 1    | other than Reset,                                                                                                                                                                                          | Set by the processor when any exception<br>Soft Reset, or NMI exceptions is taken.                                                                                                                             | R/W        | Undefined   |
|        |      | Encoding                                                                                                                                                                                                   | Meaning                                                                                                                                                                                                        |            |             |
|        |      | 0                                                                                                                                                                                                          | Normal level                                                                                                                                                                                                   |            |             |
|        |      | 1                                                                                                                                                                                                          | Exception level                                                                                                                                                                                                |            |             |
|        |      | <ul> <li>Interrupts are d</li> <li>TLB Refill exc<br/>instead of the T</li> <li>EPC, Cause<sub>B</sub></li> </ul>                                                                                          | is running in Kernel Mode<br>lisabled.<br>eptions use the general exception vector<br>ILB Refill vectors.<br><sub>D</sub> and <i>SRSCtl</i> (implementations of<br>e Architecture only) will not be updated if |            |             |
| IE     | 0    | Interrupt Enable:<br>and hardware inte                                                                                                                                                                     | Acts as the master enable for software errupts:                                                                                                                                                                | R/W        | Undefined   |
|        |      | Encoding                                                                                                                                                                                                   | Meaning                                                                                                                                                                                                        |            |             |
|        |      | 0                                                                                                                                                                                                          | Interrupts are disabled                                                                                                                                                                                        |            |             |
|        |      | 1                                                                                                                                                                                                          | Interrupts are enabled                                                                                                                                                                                         |            |             |
|        |      |                                                                                                                                                                                                            | e Architecture, this bit may be modified<br>DI and EI instructions.                                                                                                                                            |            |             |

Table 6.20 Status Register Field Descriptions (Continued)

## 6.2.17 IntCtl Register (CP0 Register 12, Select 1)

The *IntCtl* register controls the expanded interrupt capability added in Release 2 of the Architecture, including vectored interrupts and support for an external interrupt controller. This register does not exist in implementations of Release 1 of the Architecture.

Figure 6.18 shows the format of the IntCtl register; Table 6.21 describes the IntCtl register fields.

|    |    |    |     |     |    |    |     | Figure 6.1 | 8 Ir | ntCtl F    | Registe | er Forr     | nat   |   |    |   |   |   |
|----|----|----|-----|-----|----|----|-----|------------|------|------------|---------|-------------|-------|---|----|---|---|---|
| 31 | 29 | 28 | 26  | 25  | 23 | 22 | 21  | 20         | 16   | 15         | 14      | 13          | 12 10 | 9 | 5  | 4 |   | 0 |
| IP | ΓI | IP | PCI | IPF | DC | PF | ICE | StkDec     |      | Clr<br>EXL | APE     | Use<br>KStk | 000   |   | VS |   | 0 |   |

## Table 6.21 IntCtl Register Field Descriptions

| Fiel | ds    |                                                                                            |                                                                                              |                                                                                                                                                                                                 |       | Reset             |
|------|-------|--------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|-------------------|
| Name | Bits  | 1                                                                                          | Desc                                                                                         | Read/Write                                                                                                                                                                                      | State |                   |
| IPTI | 31:29 | modes, this fiel<br>Timer Interrupt                                                        | ld specifies<br>t request is i                                                               | y and Vectored Interrupt<br>the IP number to which the<br>nerged, and allows software<br>nsider <i>Cause<sub>TI</sub></i> for a poten-                                                          | R     | Externally<br>Set |
|      |       | Encoding                                                                                   | IP bit                                                                                       | Hardware Interrupt<br>Source                                                                                                                                                                    |       |                   |
|      |       | 2                                                                                          | 2                                                                                            | HW0                                                                                                                                                                                             |       |                   |
|      |       | 3                                                                                          | 3                                                                                            | HW1                                                                                                                                                                                             |       |                   |
|      |       | 4                                                                                          | 4                                                                                            | HW2                                                                                                                                                                                             |       |                   |
|      |       | 5                                                                                          | 5                                                                                            | HW3                                                                                                                                                                                             |       |                   |
|      |       | 6                                                                                          | 6                                                                                            | HW4                                                                                                                                                                                             |       |                   |
|      |       | 7                                                                                          | 7                                                                                            | HW5                                                                                                                                                                                             |       |                   |
|      |       | SI_IPTI[2:0].<br>cate the specifi<br>the SI_Timerli<br>The value of the<br>Interrupt Contr | This allows<br>c SI_Int has<br>nt signal is a<br>is field is no<br>oller Mode<br>is expected | by the static input,<br>external logic to communi-<br>rdware interrupt pin to which<br>attached.<br>bt meaningful if External<br>is enabled. The external inter-<br>to provide this information |       |                   |

| Fiel  | ds    |                                                                                                                                             |                                                                                                     |                                                                                                                                                                                                                             |            | Deast                          |
|-------|-------|---------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|--------------------------------|
| Name  | Bits  | -                                                                                                                                           | Desc                                                                                                | cription                                                                                                                                                                                                                    | Read/Write | Reset<br>State                 |
| IPPCI | 28:26 | modes, this field<br>Performance Co                                                                                                         | d specifies<br>ounter Inter<br>to determi                                                           | and Vectored Interrupt<br>the IP number to which the<br>rupt request is merged, and<br>ne whether to consider<br>nterrupt.                                                                                                  | R          | 0                              |
|       |       | Encoding                                                                                                                                    | IP bit                                                                                              | Hardware Interrupt<br>Source                                                                                                                                                                                                |            |                                |
|       |       | 2                                                                                                                                           | 2                                                                                                   | HW0                                                                                                                                                                                                                         |            |                                |
|       |       | 3                                                                                                                                           | 3                                                                                                   | HW1                                                                                                                                                                                                                         |            |                                |
|       |       | 4                                                                                                                                           | 4                                                                                                   | HW2                                                                                                                                                                                                                         |            |                                |
|       |       | 5                                                                                                                                           | 5                                                                                                   | HW3                                                                                                                                                                                                                         |            |                                |
|       |       | 6                                                                                                                                           | 6                                                                                                   | HW4                                                                                                                                                                                                                         |            |                                |
|       |       | 7                                                                                                                                           | 7                                                                                                   | HW5                                                                                                                                                                                                                         |            |                                |
| IPFDC | 25:23 | the SI_PCInt si<br>The value of thi<br>Interrupt Control<br>rupt controller i<br>for that interrup<br>For Interrupt Co<br>modes, this field | gnal is atta<br>s field is no<br>oller Mode<br>s expected<br>t mode.<br>ompatibility<br>d specifies | rdware interrupt pin to which<br>ched.<br>ot meaningful if External<br>is enabled. The external inter-<br>to provide this information<br>and Vectored Interrupt<br>the IP number to which the<br>upt request is merged, and | R          | Preset or<br>Externally<br>Set |
|       |       | allows software<br><i>Cause<sub>FDC</sub></i> for                                                                                           |                                                                                                     |                                                                                                                                                                                                                             |            |                                |
|       |       | Encoding                                                                                                                                    | IP bi                                                                                               | Hardware<br>t Interrupt Source                                                                                                                                                                                              |            |                                |
|       |       | 2                                                                                                                                           | 2                                                                                                   | HW0                                                                                                                                                                                                                         |            |                                |
|       |       | 3                                                                                                                                           | 3                                                                                                   | HW1                                                                                                                                                                                                                         |            |                                |
|       |       | 4                                                                                                                                           | 4                                                                                                   | HW2                                                                                                                                                                                                                         |            |                                |
|       |       | 5                                                                                                                                           | 5                                                                                                   | HW3                                                                                                                                                                                                                         |            |                                |
|       |       | 6                                                                                                                                           | 6                                                                                                   | HW4                                                                                                                                                                                                                         |            |                                |
|       |       | 7                                                                                                                                           | 7                                                                                                   | HW5                                                                                                                                                                                                                         |            |                                |
|       |       | nal Interrupt Co<br>enabled. The ex<br>provide this info                                                                                    | ontroller Mo<br>ternal inter<br>ormation fo                                                         | <b>NPREDICTABLE</b> if Exter-<br>ode is both implemented and<br>rupt controller is expected to<br>or that interrupt mode.<br>emented, this field returns                                                                    |            |                                |

Table 6.21 IntCtl Register Field Descriptions (Continued)

| Field  | ls    |           |           |                |                                                        |                                                       |       |            | Reset |
|--------|-------|-----------|-----------|----------------|--------------------------------------------------------|-------------------------------------------------------|-------|------------|-------|
| Name   | Bits  | _         |           |                | Description                                            |                                                       |       | Read/Write | State |
| PF     | 22    | Enable    | s Vector  | Pref           | etching Feature                                        | •                                                     |       | RW         | 0     |
|        |       | Enc       | oding     |                | Меа                                                    | ning                                                  |       |            |       |
|        |       |           | 0         | Vec            | tor Prefetching                                        | disabled.                                             |       |            |       |
|        |       |           | 1         | Vec            | tor Prefetching                                        | enabled.                                              |       |            |       |
| ICE    | 21    | For IRI   | ET instru | uction         | n. Enables Inter                                       | rupt Chaining.                                        |       | RW         | 0     |
|        |       | Enc       | oding     |                | Меа                                                    | ning                                                  |       |            |       |
|        |       |           | 0         | Inte           | errupt Chaining                                        | disabled                                              |       |            |       |
|        |       |           | 1         | Inte           | errupt Chaining                                        | enabled                                               |       |            |       |
| StkDec | 20:16 |           |           |                | feature. This is the nented from the                   |                                                       |       | RW         | 0x3   |
|        |       |           | Encodi    | ing            | Decrement<br>Amount in<br>Words                        | Decrement<br>Amount in<br>Bytes                       |       |            |       |
|        |       |           | 0-3       |                | 3                                                      | 12                                                    |       |            |       |
|        |       |           | Other     | S              | As encoded,<br>e.g. 0x5<br>means 5<br>words            | 4 * encoded<br>value<br>e.g. 0x5<br>means 20<br>bytes |       |            |       |
| ClrEXL | 15    | If set, d | during A  | uto-F          | feature and IRE<br>Prologue and IR<br>EXL fields are o | ET interrupt cl                                       | nain- | RW         | 0     |
|        |       | Enco      | oding     |                | Mea                                                    | ning                                                  |       |            |       |
|        |       | (         | 0         | Field<br>tions | ds are not cleare                                      | ed by these ope                                       | ra-   |            |       |
|        |       |           | 1         | Field          | ds are cleared b                                       | y these operation                                     | ons.  |            |       |
| APE    | 14    | Enable    | s Auto-F  | Prolo          | gue feature.                                           |                                                       |       | RW         | 0     |
|        |       | Enc       | oding     |                | Меа                                                    | ning                                                  |       |            |       |
|        |       |           | 0         | Aut            | to-Prologue dis                                        | abled                                                 |       |            |       |
|        |       |           | 1         | Aut            | to-Prologue ena                                        | bled                                                  |       |            |       |

## Table 6.21 IntCtl Register Field Descriptions (Continued)

| Fiel    | ds    |                            |                                                                                                |                                                                                                         |            | Reset |
|---------|-------|----------------------------|------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------|------------|-------|
| Name    | Bits  | -                          | Description                                                                                    |                                                                                                         | Read/Write | State |
| UseKStk | 13    | Chooses which<br>Prologue. | Stack to use during                                                                            | Interrupt Automated                                                                                     | RW         | 0     |
|         |       | Encoding                   | Меа                                                                                            | ining                                                                                                   |            |       |
|         |       | 0                          | Copy \$29 of the Pr<br>Current SRS at the                                                      |                                                                                                         |            |       |
|         |       |                            | This is used for Ba<br>ments with only or                                                      |                                                                                                         |            |       |
|         |       | 1                          | nel mode stacks. Ir<br>the SRS used durir                                                      | vironments where<br>User-mode and Ker-<br>n this case, \$29 of<br>ng IAP must be<br>oftware to hold the |            |       |
| 0       | 12:10 | Must be writter            | n as zero; returns zer                                                                         | o on read.                                                                                              | 0          | 0     |
| VS      | 9:5   | (as denoted by             | . If vectored interrup<br><i>Config3<sub>VInt</sub></i> or <i>Con</i><br>pacing between vector | fig3 <sub>VEIC</sub> ), this field                                                                      | R/W        | 0     |
|         |       | Encoding                   | Spacing<br>Between<br>Vectors (hex)                                                            | Spacing<br>Between<br>Vectors<br>(decimal)                                                              |            |       |
|         |       | 16#00                      | 16#000                                                                                         | 0                                                                                                       |            |       |
|         |       | 16#01                      | 16#020                                                                                         | 32                                                                                                      |            |       |
|         |       | 16#02                      | 16#040                                                                                         | 64                                                                                                      |            |       |
|         |       | 16#04                      | 16#080                                                                                         | 128                                                                                                     |            |       |
|         |       | 16#08                      | 16#100                                                                                         | 256                                                                                                     |            |       |
|         |       | 16#10                      | 16#200                                                                                         | 512                                                                                                     |            |       |
|         |       |                            | s are reserved. The c<br>EFINED if a reserve                                                   | peration of the pro-<br>ed value is written to                                                          |            |       |
| 0       | 4:0   | Must be written            |                                                                                                |                                                                                                         | 0          | 0     |

Table 6.21 IntCtl Register Field Descriptions (Continued)

## 6.2.18 SRSCtl Register (CP0 Register 12, Select 2)

The *SRSCtl* register controls the operation of GPR shadow sets in the processor. This register does not exist in implementations of the architecture prior to Release 2.

Figure 6.19 shows the format of the SRSCt/ register; Table 6.22 describes the SRSCt/ register fields.

|    | Figure 6.19 SRSCtl Register Format |    |     |    |    |            |    |       |    |        |        |    |     |    |    |         |   |     |   |        |        |   |     |   |  |
|----|------------------------------------|----|-----|----|----|------------|----|-------|----|--------|--------|----|-----|----|----|---------|---|-----|---|--------|--------|---|-----|---|--|
| 31 | 30                                 | 29 |     | 26 | 25 |            | 22 | 21    | 18 | 17     | 16     | 15 |     | 12 | 11 | 10      | 9 |     | 6 | 5      | 4      | 3 |     | 0 |  |
| 0  | 0                                  |    | HSS |    |    | 0<br>00 00 |    | EICSS |    | (<br>0 | )<br>0 |    | ESS |    |    | 0<br>00 |   | PSS |   | (<br>0 | )<br>0 |   | CSS |   |  |

#### Fields Reset Name Bits Description **Read/Write** State 0 31:30 Must be written as zeros; returns zero on read. 0 0 HSS 29:26 R Highest Shadow Set. This field contains the highest Preset shadow set number that is implemented by this processor. A value of zero in this field indicates that only the normal GPRs are implemented. Possible values of this field for the microAptiv UP processor are: Encoding Meaning 0 One shadow set (normal GPR set) is present. 1 Two shadow sets are present. 3 Four shadow sets are present. 7 Eight shadow sets are present 15 Sixteen shadow sets are present 2, 4-6, 8-14 Reserved The value in this field also represents the highest value that can be written to the ESS, EICSS, PSS, and CSS fields of this register, or to any of the fields of the SRSMap register. The operation of the processor is **UNDEFINED** if a value larger than the one in this field is written to any of these other fields. 0 25:22 0 0 Must be written as zeros; returns zero on read. EICSS 21:18 R Undefined EIC interrupt mode shadow set. If Config3<sub>VEIC</sub> is 1 (EIC interrupt mode is enabled), this field is loaded from the external interrupt controller for each interrupt request and is used in place of the SRSMap register to select the current shadow set for the interrupt. See Section 5.3.1 "Interrupt Modes" for a discussion of EIC interrupt mode. If *Config3*<sub>VFIC</sub> is 0, this field must be written as zero, and returns zero on read. 0 17:16 Must be written as zeros; returns zero on read. 0 0 R/W ESS 15:12 0 Exception Shadow Set. This field specifies the shadow set to use on entry to Kernel Mode caused by any exception other than a vectored interrupt. The operation of the processor is **UNDEFINED** if soft-

ware writes a value into this field that is greater than the

### **Table 6.22 SRSCtl Register Field Descriptions**

value in the HSS field.

| Fiel | ds    |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |            | Reset |
|------|-------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------|
| Name | Bits  | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | Read/Write | State |
| 0    | 11:10 | Must be written as zeros; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 0          | 0     |
| PSS  | 9:6   | Previous Shadow Set. If GPR shadow registers are<br>implemented, and with the exclusions noted in the next<br>paragraph, this field is copied from the <i>CSS</i> field when<br>an exception or interrupt occurs. An ERET instruction<br>copies this value back into the <i>CSS</i> field if <i>Status<sub>BEV</sub></i> = 0.<br>This field is not updated on any exception which sets<br><i>Status<sub>ERL</sub></i> to 1 (i.e., Reset, Soft Reset, NMI, cache<br>error), an entry into EJTAG Debug mode, or any excep-<br>tion or interrupt that occurs with <i>Status<sub>EXL</sub></i> = 1, or<br><i>Status<sub>BEV</sub></i> = 1. This field is notupdated on an exception<br>that occurs while <i>Status<sub>ERL</sub></i> = 1.<br>The operation of the processor is <b>UNDEFINED</b> if soft-<br>ware writes a value into this field that is greater than the<br>value in the <i>HSS</i> field.                                                                                                                                                                                           | R/W        | 0     |
| 0    | 5:4   | Must be written as zeros; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 0          | 0     |
| CSS  | 3:0   | Current Shadow Set. If GPR shadow registers are imple-<br>mented, this field is the number of the current GPR set.<br>With the exclusions noted in the next paragraph, this<br>field is updated with a new value on any interrupt or<br>exception, and restored from the <i>PSS</i> field on an ERET.<br>Table 6.23 describes the various sources from which the<br><i>CSS</i> field is updated on an exception or interrupt.<br>This field is not updated on any exception which sets<br><i>Status<sub>ERL</sub></i> to 1 (i.e., Reset, Soft Reset, NMI, cache<br>error), an entry into EJTAG Debug mode, or any excep-<br>tion or interrupt that occurs with <i>Status<sub>EXL</sub></i> = 1, or<br><i>Status<sub>BEV</sub></i> = 1. Neither is it updated on an ERET with<br><i>Status<sub>ERL</sub></i> = 1 or <i>Status<sub>BEV</sub></i> = 1. This field is not<br>updated on an exception that occurs while <i>Status<sub>ERL</sub></i> = 1.<br>The value of <i>CSS</i> can be changed directly by software<br>only by writing the <i>PSS</i> field and executing an ERET<br>instruction. | R          | 0     |

## Table 6.22 SRSCtl Register Field Descriptions (Continued)

## 6.2.19 SRSMap Register (CP0 Register 12, Select 3)

| Exception Type              | Condition                                                                | SRSCtl <sub>CSS</sub> Source | Comment                                                         |  |  |  |  |  |
|-----------------------------|--------------------------------------------------------------------------|------------------------------|-----------------------------------------------------------------|--|--|--|--|--|
| Exception                   | All                                                                      | SRSCtl <sub>ESS</sub>        |                                                                 |  |  |  |  |  |
| Non-Vectored<br>Interrupt   | $Cause_{IV} = 0$                                                         | SRSCtl <sub>ESS</sub>        | Treat as exception                                              |  |  |  |  |  |
| Vectored Interrupt          | $Cause_{IV} = 1$ and<br>$Config3_{VEIC} = 0$ and<br>$Config3_{VInt} = 1$ | SRSMap <sub>VECTNUM</sub>    | Source is internal map register.<br>(for VECTNUM see Table 5.3) |  |  |  |  |  |
| Vectored EIC Inter-<br>rupt | Cause <sub>IV</sub> =1 and<br>Config3 <sub>VEIC</sub> =1                 | SRSCtl <sub>EICSS</sub>      | Source is external interrupt con-<br>troller.                   |  |  |  |  |  |

| Table 6.23 Sources f | or new SRSCtl <sub>CSS</sub> on | an Exception or Interrupt |
|----------------------|---------------------------------|---------------------------|
|----------------------|---------------------------------|---------------------------|

The *SRSMap* register contains 8, 4-bit fields that provide the mapping from a vector number to the shadow set number to use when servicing such an interrupt. The values from this register are not used for a non-interrupt exception, or a non-vectored interrupt ( $Cause_{IV} = 0$  or  $IntCt/_{VS} = 0$ ). In such cases, the shadow set number comes from *SRSCtl<sub>ESS</sub>*.

If *SRSCtl<sub>HSS</sub>* is zero, the results of a software read or write of this register are **UNPREDICTABLE**.

The operation of the processor is **UNDEFINED** if a value is written to any field in this register that is greater than the value of *SRSCtl<sub>HSS</sub>*.

The *SRSMap* register contains the shadow register set numbers for vector numbers 7:0. The same shadow set number can be established for multiple interrupt vectors, creating a many-to-one mapping from a vector to a single shadow register set number.

Figure 6.20 shows the format of the SRSMap register; Table 6.24 describes the SRSMap register fields.

| Figure 6.20 | SRSMap Register Format |
|-------------|------------------------|
|             |                        |

| 31 28 | 27 24 | 23 20 | 19 16 | 15 12 | 11 8 | 7 4  | 3 0  |
|-------|-------|-------|-------|-------|------|------|------|
| SSV7  | SSV6  | SSV5  | SSV4  | SSV3  | SSV2 | SSV1 | SSV0 |

#### Table 6.24 SRSMap Register Field Descriptions

| Fiel | ds    |                                                |            |             |
|------|-------|------------------------------------------------|------------|-------------|
| Name | Bits  | Description                                    | Read/Write | Reset State |
| SSV7 | 31:28 | Shadow register set number for Vector Number 7 | R/W        | 0           |
| SSV6 | 27:24 | Shadow register set number for Vector Number 6 | R/W        | 0           |
| SSV5 | 23:20 | Shadow register set number for Vector Number 5 | R/W        | 0           |
| SSV4 | 19:16 | Shadow register set number for Vector Number 4 | R/W        | 0           |
| SSV3 | 15:12 | Shadow register set number for Vector Number 3 | R/W        | 0           |
| SSV2 | 11:8  | Shadow register set number for Vector Number 2 | R/W        | 0           |
| SSV1 | 7:4   | Shadow register set number for Vector Number 1 | R/W        | 0           |
| SSV0 | 3:0   | Shadow register set number for Vector Number 0 | R/W        | 0           |

## 6.2.20 View\_IPL Register (CP0 Register 12, Select 4)

| Figure 6-21 View_IPL Register Format |   |    |     |   |  |  |  |  |  |  |
|--------------------------------------|---|----|-----|---|--|--|--|--|--|--|
| 31                                   |   | 10 | 9   | 0 |  |  |  |  |  |  |
|                                      | 0 |    | IM  |   |  |  |  |  |  |  |
|                                      |   |    | IPL |   |  |  |  |  |  |  |

| Fie  | lds       |                                                                                                                               | Read / |                                           |  |
|------|-----------|-------------------------------------------------------------------------------------------------------------------------------|--------|-------------------------------------------|--|
| Name | Bits      | Description                                                                                                                   | Write  | Reset State                               |  |
| IM   | 9:0       | Interrupt Mask.<br>If EIC interrupt mode is not enabled, controls which inter-<br>rupts are enabled.                          | R/W    | Undefined for<br>IM7:IM2<br>0 for IM9:IM8 |  |
| IPL  | 9:2       | Interrupt Priority Level.<br>If EIC interrupt mode is enabled, this field is the encoded<br>value of the current <i>IPL</i> . | R/W    | Undefined                                 |  |
| 0    | 31:10,1:0 | Must be written as zero; returns zero on read.                                                                                | 0      | 0                                         |  |

## Table 6.25 View\_IPL Register Field Descriptions

This register gives read and write access to the *IM* or *IPL* field that is also available in the *Status* Register. The use of this register allows the Interrupt Mask or the Priority Level to be read/written without extracting/inserting that bit field from/to the *Status* Register.

The *IPL* field might be located in non-contiguous bits within the *Status* Register. All of the *IPL* bits are presented as a contiguous field within this register.

## 6.2.21 SRSMap2 Register (CP0 Register 12, Select 5)

The *SRSMap2* register contains 2 4-bit fields that provide the mapping from an vector number to the shadow set number to use when servicing such an interrupt. The values from this register are not used for a non-interrupt exception, or a non-vectored interrupt ( $Cause_{IV} = 0$  or  $IntCtl_{VS} = 0$ ). In such cases, the shadow set number comes from  $SRSCtl_{ESS}$ .

If SRSCtl<sub>HSS</sub> is zero, the results of a software read or write of this register are UNPREDICTABLE.

The operation of the processor is **UNDEFINED** if a value is written to any field in this register that is greater than the value of  $SRSCtl_{HSS}$ .

The *SRSMap2* register contains the shadow register set numbers for vector numbers 9:8. The same shadow set number can be established for multiple interrupt vectors, creating a many-to-one mapping from a vector to a single shadow register set number.

Figure 6-22 shows the format of the SRSMap2 register; Table 6.26 describes the SRSMap2 register fields.

| 31 | 8 | 7 |      | 4 | 3 |      | 0 |
|----|---|---|------|---|---|------|---|
| 0  |   |   | SSV9 |   |   | SSV8 |   |

Figure 6-22 SRSMan Register Format

# Table 6.26 SRSMap Register Field Descriptions

| Fie  | elds                |                                                | Read / | Reset |
|------|---------------------|------------------------------------------------|--------|-------|
| Name | me Bits Description |                                                | Write  | State |
| 0    | 31:8                | Must be written as zero; returns zero on read. | R      | 0     |
| SSV9 | 7:4                 | Shadow register set number for Vector Number 9 | R/W    | 0     |
| SSV8 | 3:0                 | Shadow register set number for Vector Number 8 | R/W    | 0     |

## 6.2.22 Cause Register (CP0 Register 13, Select 0)

The *Cause* register primarily describes the cause of the most recent exception. In addition, fields also control software interrupt requests and the vector through which interrupts are dispatched. With the exception of the *IP1..0, DC*, *IV*, and *WP* fields, all fields in the *Cause* register are read-only. Release 2 of the Architecture added optional support for an External Interrupt Controller (EIC) interrupt mode, in which *IP7..2* are interpreted as the Requested Interrupt Priority Level (RIPL).

Figure 6.23 shows the format of the Cause register; Table 6.27 describes the Cause register fields.

#### Figure 6.23 Cause Register Format

| 31 30 29 28 27 26 25 24 23 22 21 20 18                                 | 17 10  | 987      | 6 2      | 1 0 |
|------------------------------------------------------------------------|--------|----------|----------|-----|
| BD TI     CE     DC PCI     IC     AP     IV     WP     FD<br>CI     0 | IP9IP2 | IP1IP0 0 | Exc Code | 0   |
|                                                                        | RIPL   |          |          |     |

#### **Table 6.27 Cause Register Field Descriptions**

| Fiel | ds   |                                     |                                                                              |            |             |
|------|------|-------------------------------------|------------------------------------------------------------------------------|------------|-------------|
| Name | Bits |                                     | Description                                                                  | Read/Write | Reset State |
| BD   | 31   | Indicates wheth<br>branch delay slo | er the last exception taken occurred in a ot:                                | R          | Undefined   |
|      |      | Encoding                            | Meaning                                                                      |            |             |
|      |      | 0                                   | Not in delay slot                                                            |            |             |
|      |      | 1                                   | In delay slot                                                                |            |             |
|      |      | The processor u when the excep      | pdates <i>BD</i> only if <i>Status<sub>EXL</sub></i> was zero tion occurred. |            |             |

| Fields |       |                                      |                                                                                                                                                                              |            |             |  |
|--------|-------|--------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|
| Name   | Bits  | -                                    | Description                                                                                                                                                                  | Read/Write | Reset State |  |
| TI     | 30    |                                      | t. This bit denotes whether a timer inter-<br>(analogous to the <i>IP</i> bits for other inter-                                                                              | R          | Undefined   |  |
|        |       | Encoding                             | Meaning                                                                                                                                                                      |            |             |  |
|        |       | 0                                    | No timer interrupt is pending                                                                                                                                                |            |             |  |
|        |       | 1                                    | Timer interrupt is pending                                                                                                                                                   |            |             |  |
|        |       |                                      | e <i>TI</i> bit is available on the external core<br>SI_ <i>TimerInt</i> signal                                                                                              |            |             |  |
| CE     | 29:28 | sor Unusable e<br>hardware on ev     | nit number referenced when a Coproces-<br>xception is taken. This field is loaded by<br>very exception, but is <b>UNPREDICT</b> -<br>exceptions except for Coprocessor Unus- | R          | Undefined   |  |
| DC     | 27    | cations, the <i>Cc</i> of meaningful | register. In some power-sensitive appli-<br>ount register is not used and is the source<br>power dissipation. This bit allows the<br>to be stopped in such situations.       | R/W        | 0           |  |
|        |       | Encoding                             | Meaning                                                                                                                                                                      |            |             |  |
|        |       | 0                                    | Enable counting of <i>Count</i> register                                                                                                                                     |            |             |  |
|        |       | 1                                    | Disable counting of <i>Count</i> register                                                                                                                                    |            |             |  |
| PCI    | 26    | Release 2 of th performance co       | ounter Interrupt. In an implementation of<br>e Architecture, this bit denotes whether a<br>punter interrupt is pending (analogous to<br>other interrupt types):              | R          | 0           |  |
|        |       | Encoding                             | Meaning                                                                                                                                                                      |            |             |  |
|        |       | 0                                    | No timer interrupt is pending                                                                                                                                                |            |             |  |
|        |       | 1                                    | Timer interrupt is pending                                                                                                                                                   |            |             |  |
|        |       |                                      | e <i>PCI</i> bit is available on the external<br>interface as the <i>SI_PCInt</i> signal.                                                                                    |            |             |  |
| IC     | 25    | Indicates if Interior Instruction.   | errupt Chaining occurred on the last IRET                                                                                                                                    | R          | Undefined   |  |
|        |       | Encoding                             | Meaning                                                                                                                                                                      |            |             |  |
|        |       | 0                                    | Interrupt Chaining did not happen on last IRET                                                                                                                               |            |             |  |
|        |       | 1                                    | Interrupt Chaining occurred during last IRET                                                                                                                                 |            |             |  |

 Table 6.27 Cause Register Field Descriptions (Continued)

| Name     AP | Bits |                                      |                                                                                                                                               |            |             |
|-------------|------|--------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| AP          |      |                                      |                                                                                                                                               | Read/Write | Reset State |
|             | 24   | Indicates wheth<br>rupt Auto-Prole   | her an exception occurred during Inter-<br>ogue.                                                                                              | R          | Undefined   |
|             |      | Encoding                             | Meaning                                                                                                                                       |            |             |
|             |      | 0                                    | Exception did not occur during<br>Auto-Prologue operation.                                                                                    |            |             |
|             |      | 1                                    | Exception occurred during Auto-Pro-<br>logue operation.                                                                                       |            |             |
| IV          | 23   |                                      | her an interrupt exception uses the gen-<br>vector or a special interrupt vector:                                                             | R/W        | Undefined   |
|             |      | Encoding                             | Meaning                                                                                                                                       |            |             |
|             |      | 0                                    | Use the general exception vector<br>(16#180)                                                                                                  |            |             |
|             |      | 1                                    | Use the special interrupt vector (16#200)                                                                                                     |            |             |
|             |      |                                      | ions of Release 2 of the architecture, if<br>1 and <i>Status<sub>BEV</sub></i> is 0, the special inter-                                       |            |             |
|             |      |                                      | resents the base of the vectored interrupt                                                                                                    |            |             |
|             |      | table.                               |                                                                                                                                               |            |             |
| WP          | 22   | Status <sub>EXL</sub> or S           | watch exception was deferred because<br>Status <sub>ERL</sub> had a value of 1 at the time the<br>n was detected. This bit indicates that the | R/W        | Undefined   |
|             |      |                                      | n was deferred, and it causes the excep-<br>ted when <i>Status<sub>EXL</sub></i> and <i>Status<sub>ERL</sub></i> are                          |            |             |
|             |      | both zero. As so<br>the watch except | uch, software must clear this bit as part of ption handler to prevent a watch excep-                                                          |            |             |
|             |      |                                      | d not write a 1 to this bit when itsvalue is<br>ing a 0-to-1 transition. If such atransition                                                  |            |             |
|             |      | is caused by so                      | ftware, it is UNPREDICTABLE                                                                                                                   |            |             |
|             |      | with no side eff                     | are ignores the write, accepts the write<br>fects, or accepts the write and initiates a                                                       |            |             |
|             |      | watch exceptio<br>both zero.         | n when $Status_{EXL}$ and $Status_{ERL}$ are                                                                                                  |            |             |
| FDCI        | 21   | Fast Debug Ch                        | annel Interrupt. This bit denotes whether<br>t is pending (analogous to the <i>IP</i> bits for<br>types):                                     | R          | Undefined   |
|             |      | Encoding                             | Meaning                                                                                                                                       |            |             |
|             |      | 0                                    | No Fast Debug Channel interrupt is pending                                                                                                    |            |             |
|             |      | 1                                    | Fast Debug Channel interrupt is pend-<br>ing                                                                                                  |            |             |

Table 6.27 Cause Register Field Descriptions (Continued)

| Fiel    | ds               |                                                                                                                                                                  |                                                                                                                         |                                                                                                                                                                                                                                                                                                       |                                                          |            |                                |
|---------|------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------|------------|--------------------------------|
| Name    | Bits             | -                                                                                                                                                                | Description                                                                                                             |                                                                                                                                                                                                                                                                                                       |                                                          | Read/Write | Reset State                    |
| IP9:IP2 | 17:10            | Indicates an                                                                                                                                                     | interrupt i                                                                                                             | s pending:                                                                                                                                                                                                                                                                                            |                                                          | R          | Undefined                      |
|         |                  | Bit                                                                                                                                                              | Name                                                                                                                    | Meaning                                                                                                                                                                                                                                                                                               |                                                          |            | for IP7:IP2                    |
|         |                  | 17                                                                                                                                                               | IP9                                                                                                                     | Hardware Interrupt 7                                                                                                                                                                                                                                                                                  |                                                          |            | 0 for IP9:IP8                  |
|         |                  | 16                                                                                                                                                               | IP8                                                                                                                     | Hardware Interrupt 6                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 15                                                                                                                                                               | IP7                                                                                                                     | Hardware interrupt 5                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 14                                                                                                                                                               | IP6                                                                                                                     | Hardware interrupt 4                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 13                                                                                                                                                               | IP5                                                                                                                     | Hardware interrupt 3                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 12                                                                                                                                                               | IP4                                                                                                                     | Hardware interrupt 2                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 11                                                                                                                                                               | IP3                                                                                                                     | Hardware interrupt 1                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
|         |                  | 10                                                                                                                                                               | IP2                                                                                                                     | Hardware interrupt 0                                                                                                                                                                                                                                                                                  |                                                          |            |                                |
| RIPL    | 17:10            | in an implementation interrupt 5. In implementation which EIC is $= 0$ , timer a bined in an inhardware interval ( <i>Config3<sub>VE</sub></i> and are interval) | nentation-or<br>ntations of<br>nterrupt m<br>and perform<br>implementaterrupt. If I<br>$_{1C} = 1$ , the<br>preted as t | counter interrupts are combined ependent way with hardware Release 2 of the Architecture ode is not enabled ( <i>Config2</i> ) nance counter interrupts are ation-dependent way with a EIC interrupt mode is enable ese bits have a different meather <i>RIPL</i> field, described be a tority Level. | re<br>se in<br><i>SVEIC</i><br>com-<br>ny<br>ed<br>ning, | R          | Undefined for                  |
|         |                  | In implemer<br>which EIC i<br>1), this field<br>requested in<br>interrupt is r<br>If EIC interr<br>these bits ha                                                 | ntations of<br>nterrupt m<br>is the enco<br>terrupt. A<br>requested.<br>rupt mode i<br>we a differ                      | Release 2 of the Architectur<br>ode is enabled ( <i>Config3<sub>VEI</sub></i> )<br>oded (0255) value of the<br>value of zero indicates that<br>is not enabled ( <i>Config3<sub>VEIC</sub></i> )<br>ent meaning and are interpre-<br>escribed above.                                                   | $C^{=}$<br>no<br>c = 0),                                 |            | bits 15:10<br>0 for bits 17:16 |
| IP1:IP0 | 9:8              | Controls the                                                                                                                                                     | request fo                                                                                                              | or software interrupts:                                                                                                                                                                                                                                                                               |                                                          | R/W        | Undefined                      |
|         |                  | Bit                                                                                                                                                              | Name                                                                                                                    | Meaning                                                                                                                                                                                                                                                                                               |                                                          |            |                                |
|         |                  | 9                                                                                                                                                                | IP1 R                                                                                                                   | equest software interrupt 1                                                                                                                                                                                                                                                                           |                                                          |            |                                |
|         |                  | 8                                                                                                                                                                | IP0 R                                                                                                                   | equest software interrupt 0                                                                                                                                                                                                                                                                           |                                                          |            |                                |
|         |                  | ler for priori<br>interrupt sou                                                                                                                                  | tization in<br>trces. The                                                                                               | I to an external interrupt con<br>EIC interrupt mode with ot<br>state of these bits is availabl<br>face as the SI_SWInt[1:0] b                                                                                                                                                                        | her<br>le on                                             |            |                                |
| ExcCode | 6:2              | Exception co                                                                                                                                                     | ode - see T                                                                                                             | Cable 6.28                                                                                                                                                                                                                                                                                            |                                                          | R          | Undefined                      |
| 0       | 20:18, 7,<br>1:0 | Must be wri                                                                                                                                                      | tten as zero                                                                                                            | o; returns zero on read.                                                                                                                                                                                                                                                                              |                                                          | 0          | 0                              |

 Table 6.27 Cause Register Field Descriptions (Continued)

| Exception Code Value |             |              |                                                                                                                                                                                                                                                                                                                   |
|----------------------|-------------|--------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Decimal              | Hexadecimal | Mnemonic     | Description                                                                                                                                                                                                                                                                                                       |
| 0                    | 16#00       | Int          | Interrupt                                                                                                                                                                                                                                                                                                         |
| 1                    | 16#01       | MOD          | TLB modified exception                                                                                                                                                                                                                                                                                            |
| 2                    | 16#02       | TLBL         | TLB exception (load or instruction fetch)                                                                                                                                                                                                                                                                         |
| 3                    | 16#03       | TLBS         | TLB exception (store)                                                                                                                                                                                                                                                                                             |
| 4                    | 16#04       | AdEL         | Address error exception (load or instruction fetch)                                                                                                                                                                                                                                                               |
| 5                    | 16#05       | AdES         | Address error exception (store)                                                                                                                                                                                                                                                                                   |
| 6                    | 16#06       | IBE          | Bus error exception (instruction fetch)                                                                                                                                                                                                                                                                           |
| 7                    | 16#07       | DBE          | Bus error exception (data reference: load or store)                                                                                                                                                                                                                                                               |
| 8                    | 16#08       | Sys          | Syscall exception                                                                                                                                                                                                                                                                                                 |
| 9                    | 16#09       | Bp           | Breakpoint exception                                                                                                                                                                                                                                                                                              |
| 10                   | 16#0a       | RI           | Reserved instruction exception                                                                                                                                                                                                                                                                                    |
| 11                   | 16#0b       | CpU          | Coprocessor Unusable exception                                                                                                                                                                                                                                                                                    |
| 12                   | 16#0c       | Ov           | Arithmetic Overflow exception                                                                                                                                                                                                                                                                                     |
| 13                   | 16#0d       | Tr           | Trap exception                                                                                                                                                                                                                                                                                                    |
| 14-15                | 16#0e-16#0f | -            | Reserved                                                                                                                                                                                                                                                                                                          |
| 16                   | 16#10       | IS1          | Implementation-Specific Exception 1 (COP2)                                                                                                                                                                                                                                                                        |
| 17                   | 16#11       | CEU          | CorExtend Unusable                                                                                                                                                                                                                                                                                                |
| 18                   | 16#12       | C2E          | Coprocessor 2 exceptions                                                                                                                                                                                                                                                                                          |
| 19                   | 16#13       | TLBRI        | TLB Read-Inhibit                                                                                                                                                                                                                                                                                                  |
| 20                   | 16#14       | TLBXI        | TLB Execute-Inhibit                                                                                                                                                                                                                                                                                               |
| 21-22                | 16#15-16#16 | -            | Reserved                                                                                                                                                                                                                                                                                                          |
| 23                   | 16#17       | WATCH        | Reference to WatchHi/WatchLo address                                                                                                                                                                                                                                                                              |
| 24                   | 16#18       | MCheck       | Machine check                                                                                                                                                                                                                                                                                                     |
| 25                   | 16#19       | -            | Reserved                                                                                                                                                                                                                                                                                                          |
| 26                   | 16#1a       | DSPDis       | DSP Module State Disabled exception                                                                                                                                                                                                                                                                               |
| 27-29                | 16#1b-16#1d | -            | Reserved                                                                                                                                                                                                                                                                                                          |
| 30                   | 16#1e       | Parity Error | Parity error. In normal mode, a parity error exception has a dedicated vector and the <i>Cause</i> register is not updated. If a parity error occurs while in Debug Mode, this code is written to the <i>Debug<sub>DExcCode</sub></i> field to indicate that re-entry to Debug Mode was caused by a parity error. |
| 31                   | 16#1f       | -            | Reserved                                                                                                                                                                                                                                                                                                          |

## Table 6.28 Cause Register ExcCode Field

## 6.2.23 View\_RIPL Register (CP0 Register 13, Select 4)

This register gives read access to the *IP* or *RIPL* field that is also available in the *Cause* Register. The use of this register allows the Interrupt Pending or the Requested Priority Level to be read without extracting that bit field from the *Cause* Register.

#### Figure 6-24 View\_RIPL Register Format

| 31 |   | 10 9   | 2 1 0   |
|----|---|--------|---------|
|    | 0 | IP9IP2 | IP1 IP0 |
|    |   | RIPL   |         |
|    |   |        |         |

| Fie     | lds   |                                                                                                                                               |              |                                           |
|---------|-------|-----------------------------------------------------------------------------------------------------------------------------------------------|--------------|-------------------------------------------|
| Name    | Bits  | Description                                                                                                                                   | Read / Write | Reset State                               |
| 0       | 31:10 | Must be written as zero; returns zero on read.                                                                                                | 0            | 0                                         |
| IP9:IP2 | 9:2   | HW Interrupt Pending.<br>If EIC interrupt mode is not enabled, indicates which HW<br>interrupts are pending.                                  | R            | Undefined for<br>IP7:IP2<br>0 for IP9:IP8 |
| RIPL    | 9:2   | Interrupt Priority Level.<br>If EIC interrupt mode is enabled, this field indicates the<br>Requested Priority Level of the pending interrupt. | R            | Undefined                                 |
| IP1:IP0 | 1:0   | SW Interrupt Pending.<br>If EIC interrupt mode is not enabled, controls which SW interrupts are pending.                                      | R/W          | Undefined                                 |

## Table 6.29 View\_RIPL Register Field Descriptions

## 6.2.24 NestedExc (CP0 Register 13, Select 5)

The Nested Exception (NestedExc) register is an optional read-only register containing the values of  $Status_{EXL}$  and  $Status_{ERL}$  prior to acceptance of the current exception.

This register is part of the Nested Fault feature. The existence of the register can be determined by reading the *Config5<sub>NFExists</sub>* bit.

Figure 6-25 shows the format of the NestedExc register; Table 6.30 describes the NestedExc register fields.

| rigure 0-25 NesteuLAC Register i offiat |   |     |     |   |  |
|-----------------------------------------|---|-----|-----|---|--|
| 31                                      | 3 | 2   | 1   | 0 |  |
| 0                                       |   | ERL | EXL | 0 |  |

#### Figure 6-25 NestedExc Register Format

| Fields |      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Read / | Reset     |
|--------|------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-----------|
| Name   | Bits | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Write  | State     |
| 0      | 31:3 | Reserved, read as 0.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | R0     | 0         |
| ERL    | 2    | Value of <i>Status<sub>ERL</sub></i> prior to acceptance of current exception.<br>Updated by all exceptions that would set either <i>Status<sub>EXL</sub></i> or <i>Status<sub>ERL</sub></i> . Not updated by Debug exceptions.                                                                                                                                                                                                                                                                                                                                                                                | R      | Undefined |
| EXL    | 1    | <ul> <li>Value of <i>Status<sub>EXL</sub></i> prior to acceptance of current exception.</li> <li>Updated by exceptions which would update EPC if <i>Status<sub>EXL</sub></i> is not set (MCheck, Interrupt, Address Error, all TLB exceptions, Bus Error, CopUnusable, Reserved Instruction, Overflow, Trap, Syscall, FPU, etc.) . For these exception types, this register field is updated regardless of the value of <i>Status<sub>EXL</sub></i>.</li> <li>Not updated by exception types which update <i>ErrorEPC</i> - (Reset, Soft Reset, NMI, Cache Error). Not updated by Debug exceptions.</li> </ul> | R      | Undefined |
| 0      | 0    | Reserved, read as 0.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | R0     | 0         |

## 6.2.25 Exception Program Counter (CP0 Register 14, Select 0)

The *Exception Program Counter (EPC)* is a read/write register that contains the address at which processing resumes after an exception has been serviced. All bits of the *EPC* register are significant and must be writable.

For synchronous (precise) exceptions, the EPC contains one of the following:

- The virtual address of the instruction that was the direct cause of the exception
- The virtual address of the immediately preceding branch or jump instruction, when the exception-causing instruction is in a branch delay slot, and the *Branch Delay* bit in the *Cause* register is set.

On new exceptions, the processor does not write to the *EPC* register when the *EXL* bit in the *Status* register is set; however, the register can still be written via the MTC0 instruction.

In processors that implement microMIPS, a read of the *EPC* register (via MFC0) returns the following value in the destination GPR:

 $GPR[rt] \leftarrow ExceptionPC_{31..1} || ISAMode_0$ 

That is, the upper 31 bits of the exception PC are combined with the lower bit of the *ISAMode* field and written to the GPR.

Similarly, a write to the *EPC* register (via MTC0) takes the value from the GPR and distributes that value to the exception PC and the *ISAMode* field, as follows:

ExceptionPC  $\leftarrow$  GPR[rt]<sub>31..1</sub> || 0 ISAMode  $\leftarrow$  2#0 || GPR[rt]<sub>0</sub>

That is, the upper 31 bits of the GPR are written to the upper 31 bits of the exception PC, and the lower bit of the exception PC is cleared. The upper bit of the *ISAMode* field is cleared, and the lower bit is loaded from the lower bit of the GPR.

#### Figure 6.26 EPC Register Format

| 31  | 0 |
|-----|---|
| EPC |   |

#### Table 6.31 EPC Register Field Description

| Fields      |      |                            |            |             |
|-------------|------|----------------------------|------------|-------------|
| Name Bit(s) |      | Description                | Read/Write | Reset State |
| EPC         | 31:0 | Exception Program Counter. | R/W        | Undefined   |

## 6.2.26 NestedEPC (CP0 Register 14, Select 2)

The Nested Exception Program Counter (NestedEPC) is an optional read/write register with the same behavior as the EPC register, except that:

- The NestedEPC register ignores the value of Status<sub>EXL</sub> and is therefore updated on the occurrence of any exception, including nested exceptions.
- The *NestedEPC* register is not used by the ERET/DERET/IRET instructions. To return to the address stored in *NestedEPC*, software must copy the value of the *NestedEPC* register to the *EPC* register.

This register is part of the Nested Fault feature. The existence of the register can be determined by reading the *Config5NFExists* bit.

Figure 6-25 shows the format of the NestedEPC register; Table 6.30 describes the NestedEPC register fields.

#### Figure 6-27 NestedEPC Register Format

NestedEPC

| Fields       |      |                                                                                                                                                                                                                                    | Read / | Reset     |
|--------------|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-----------|
| Name         | Bits | Description                                                                                                                                                                                                                        | Write  | State     |
| NestedEPC :0 |      | Nested Exception Program Counter                                                                                                                                                                                                   | R/W    | Undefined |
|              |      | Updated by exceptions which would update EPC if<br><i>Status<sub>FXI</sub></i> is not set (MCheck, Interrupt, Address Error,                                                                                                       |        |           |
|              |      | all TLB exceptions, Bus Error, CopUnusable, Reserved<br>Instruction, Overflow, Trap, Syscall, FPU, etc.). For<br>these exception types, this register field is updated<br>regardless of the value of <i>Status<sub>EXL</sub></i> . |        |           |
|              |      | Not updated by exception types which update <i>ErrorEPC</i><br>i.e., Reset, Soft Reset, NMI, and Cache Error.<br>Not updated by Debug exceptions.                                                                                  |        |           |

## Table 6.32 NestedEPC Register Field Descriptions

## 6.2.27 Processor Identification (CP0 Register 15, Select 0)

The *Processor Identification (PRId)* register is a 32-bit, read-only register that contains information identifying the manufacturer, manufacturer options, processor identification, and revision level of the processor.

#### Figure 6.28 PRId Register Format

| 31 | 24          | 23 16      | 15 8         | 7 | 5 4      | 2 | 1 | 0 |
|----|-------------|------------|--------------|---|----------|---|---|---|
|    | Company Opt | Company ID | Processor ID |   | Revisior | 1 |   |   |

## Table 6.33 PRId Register Field Descriptions

| Field        | s     |                                                                                                                                                                                                                                                                                 |            |             |
|--------------|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bit(s)  |       | Description                                                                                                                                                                                                                                                                     | Read/Write | Reset State |
| Company Opt  | 31:24 | Company Option. Whatever name is specified by the SoC builder who synthesizes the microAptiv UP core; refer to your SoC manual. This field should be preset by the config GUI with a number between 0x00 and 0x7F; higher values (0x80-0xFF) are reserved by MIPS Technologies. | R          | Preset      |
| Company ID   | 23:16 | Company Identifier. Identifies the company that designed<br>or manufactured the processor. In the microAptiv UP this<br>field contains a value of 1 to indicate MIPS Technologies,<br>Inc.                                                                                      | R          | 1           |
| Processor ID | 15:8  | Processor Identifier. Identifies the type of processor. This field allows software to distinguish between the various types of MIPS Technologies processors.                                                                                                                    | R          | 0x9E        |

| Field    | ds     |                     |                           |                                                                                                                                    |                      |                 |            |             |
|----------|--------|---------------------|---------------------------|------------------------------------------------------------------------------------------------------------------------------------|----------------------|-----------------|------------|-------------|
| Name     | Bit(s) |                     |                           | Description                                                                                                                        |                      |                 | Read/Write | Reset State |
| Revision | 7:0    | processo<br>between | or. This fie<br>different | n. Specifies the revision<br>ld allows software to<br>revisions of the same<br>the following three s                               | distingu<br>processo | ish<br>or type. | R          | Preset      |
|          |        | Bits                | Name                      | Meaning                                                                                                                            | Read/<br>Write       | Reset           |            |             |
|          |        | 7:5                 | Major<br>Revision         | This number is<br>increased on major<br>revisions of the pro-<br>cessor core.                                                      | R                    | Preset          |            |             |
|          |        | 4:2                 | Minor<br>Revision         | This number is<br>increased on each<br>incremental revi-<br>sion of the proces-<br>sor and reset on<br>each new major<br>revision. | R                    | Preset          |            |             |
|          |        | 1:0                 | Patch<br>Level            | If a patch is made to<br>modify an older<br>revision of the pro-<br>cessor, this field is<br>incremented.                          | R                    | Preset          |            |             |

Table 6.33 PRId Register Field Descriptions (Continued)

## 6.2.28 EBase Register (CP0 Register 15, Select 1)

The *EBase* register is a read/write register containing the base address of the exception vectors used when  $Status_{BEV}$  equals 0, and a read-only CPU number value that may be used by software to distinguish different processors in a multiprocessor system.

The *EBase* register provides the ability for software to identify the specific processor within a multiprocessor system, and allows the exception vectors for each processor to be different, especially in systems composed of heterogeneous processors. Bits 31:12 of the *EBase* register are concatenated with zeros to form the base of the exception vectors when *Status*<sub>BEV</sub> is 0. The exception vector base address comes from the fixed defaults (see Section 5.5 "Exception Vector Locations") when *Status*<sub>BEV</sub> is 1, or for any EJTAG Debug exception. The reset state of bits 31:12 of the *EBase* register to 16#8000.0000, providing backward compatibility with Release 1 implementations.

Bits 31:30 of the *EBase* Register are fixed with the value 2#10 to force the exception base address to be in the kseg0 or kseg1 unmapped virtual address segments.

If the value of the exception base register is to be changed, this must be done with  $Status_{BEV}$  equal 1. The operation of the processor is **UNDEFINED** if the Exception Base field is written with a different value when  $Status_{BEV}$  is 0.

Combining bits 31:12 with the *Exception Base* field allows the base address of the exception vectors to be placed at any 4KByte page boundary. If vectored interrupts are used, a vector offset greater than 4KBytes can be generated. In

this case, bit 12 of the *Exception Base* field must be zero. The operation of the processor is **UNDEFINED** if software writes bit 12 of the *Exception Base* field with a 1 and enables the use of a vectored interrupt whose offset is greater than 4KBytes from the exception base address.

Figure 6.29 shows the format of the EBase Register; Table 6.34 describes the EBase register fields.

#### Figure 6.29 EBase Register Format

| 31 | 30 | 29 12          | 11 10 | 9 0    |
|----|----|----------------|-------|--------|
| 1  | 0  | Exception Base | 0 0   | CPUNum |

### Table 6.34 EBase Register Field Descriptions

| Fields                                     |       |                                                                                                                                                                                                                                                                                                                                      |            |                |
|--------------------------------------------|-------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|----------------|
| Name                                       | Bits  | Description                                                                                                                                                                                                                                                                                                                          | Read/Write | Reset State    |
| 1                                          | 31    | This bit is ignored on writes and returns one on reads.                                                                                                                                                                                                                                                                              | R          | 1              |
| 0                                          | 30    | This bit is ignored on writes and returns zero on reads.                                                                                                                                                                                                                                                                             | R          | 0              |
| 1 3                                        |       | In conjunction with bits 31:30, this field specifies the base address of the exception vectors when <i>Status<sub>BEV</sub></i> is zero.                                                                                                                                                                                             | R/W        | 0              |
| 0                                          | 11:10 | Must be written as zero; returns zero on reads.                                                                                                                                                                                                                                                                                      | 0          | 0              |
| CPUNum 9:0 Th<br>ces<br>par<br>is s<br>cor |       | This field specifies the number of the CPU in a multipro-<br>cessor system and can be used by software to distinguish a<br>particular processor from the others. The value in this field<br>is set by the <i>SI_CPUNum[9:0]</i> static input pins to the<br>core. In a single processor system, this value should be set<br>to zero. | R          | Externally Set |

## 6.2.29 CDMMBase Register (CP0 Register 15, Select 2)

The 36-bit physical base address for the Common Device Memory Map facility is defined by this register. This register only exists if  $Config3_{CDMM}$  is set to one.

Figure 6.30 shows the format of the CDMMBase register, and Table 6.35 describes the register fields.

#### Figure 6.30 CDMMBase Register Format

| 31              | 11 1 | 9    | 8        | 0 |
|-----------------|------|------|----------|---|
| CDMM_UPPER_ADDR | E    | N CI | CDMMSize |   |

#### Table 6.35 CDMMBase Register Field Descriptions

| Fields                                                         |  |                                                                                                                                                                                                                                                 |            |             |
|----------------------------------------------------------------|--|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bits                                                      |  | Description                                                                                                                                                                                                                                     | Read/Write | Reset State |
| Name     Bits       CDMM_UP     31:11       PER_ADDR     31:11 |  | Bits 35:15 of the base physical address of the mem-<br>ory mapped registers.<br>The number of implemented physical address bits is<br>implementation-specific. For the unimplemented<br>address bits, writes are ignored and reads return zero. | R/W        | Undefined   |

| Fiel     | Fields                                                                                                                                                                                                           |                  |                                                                                                                                   |            |             |
|----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------|-----------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name     | Bits                                                                                                                                                                                                             | _                | Description                                                                                                                       | Read/Write | Reset State |
| EN       | EN       10       Enables the CDMM region.         If this bit is cleared, memory requests to this addregion access regular system memory. If this bit set, memory requests to this region access the CDMM logic |                  |                                                                                                                                   |            | 0           |
|          |                                                                                                                                                                                                                  |                  | MeaningCDMM Region is disabled.CDMM Region is enabled.                                                                            |            |             |
| CI       | 9                                                                                                                                                                                                                | Register Block o | dicates that the first 64-byte Device<br>of the CDMM is reserved for addi-<br>nat manage CDMM region behavior<br>evice registers. | R          | 0           |
| CDMMSize | 8:0                                                                                                                                                                                                              |                  | ents the number of 64-byte Device instantiated in the core.                                                                       | R          | Preset      |
|          |                                                                                                                                                                                                                  | Encoding         | Meaning                                                                                                                           |            |             |
|          |                                                                                                                                                                                                                  | 0                | 1 DRB                                                                                                                             |            |             |
|          |                                                                                                                                                                                                                  | 1                | 2 DRBs                                                                                                                            |            |             |
|          |                                                                                                                                                                                                                  | 2                | 3 DRBs                                                                                                                            |            |             |
|          |                                                                                                                                                                                                                  |                  |                                                                                                                                   |            |             |
|          |                                                                                                                                                                                                                  | 511              | 512 DRBs                                                                                                                          |            |             |

## 6.2.30 Config Register (CP0 Register 16, Select 0)

The *Config* register specifies various configuration and capabilities information. Most of the fields in the *Config* register are initialized by hardware during the Reset exception process, or are constant. The *KO*, *KU*, and *K23* fields must be initialized by software in the Reset exception handler, if the reset value is not desired.

Figure 6.31 shows the format of the Config Register Format - Select 0, and Table 6.36 describes the register fields.

| Figure 6.31 | Config Register | Format — Select 0 |
|-------------|-----------------|-------------------|
|-------------|-----------------|-------------------|

| 3 | 31 | 30 | 28 2 | 27 25 | 24  | 23  | 22  | 21 | 20  | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 | 10 | 9 7 | 6 | i | 3 | 2 | 0 |  |
|---|----|----|------|-------|-----|-----|-----|----|-----|----|----|----|----|----|----|----|----|----|-----|---|---|---|---|---|--|
| ľ | М  | K2 | 23   | KU    | ISP | DSP | UDI | SB | MDU | WC | M  | М  | BM | BE | А  | Т  | AF | ł  | MT  |   | 0 |   | K | 0 |  |

## Table 6.36 Config Register Field Descriptions

| Fields |        |                                                                                |            |             |
|--------|--------|--------------------------------------------------------------------------------|------------|-------------|
| Name   | Bit(s) | Description                                                                    | Read/Write | Reset State |
| М      | 31     | This bit is hardwired to '1' to indicate the presence of the Config1 register. | R          | 1           |

| Fields |        |                                                                                                                                                                                                                                                                                                                                                  |                     |                     |
|--------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------|---------------------|
| Name   | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                      | Read/Write          | Reset State         |
| K23    | 30:28  | This field controls the cacheability of the kseg2 and kseg3<br>address segments in FM implementations. This field is<br>valid in the FM-based MMU processors and is reserved in<br>the TLB-based MMU processors.<br>Refer to Table 6.37 for the field encoding.                                                                                  | FM: R/W<br>TLB: R   | FM: 010<br>TLB: 000 |
| KU     | 27:25  | FM: R/W<br>TLB: R                                                                                                                                                                                                                                                                                                                                | FM: 010<br>TLB: 000 |                     |
| ISP    | 24     | Indicates whether Instruction ScratchPad RAM is present.<br>Set by the <i>ISP_Present</i> static input pin, if scratchpad was<br>enabled when the core was built.<br>0 = No Instruction ScratchPad is present<br>1 = Instruction ScratchPad is present                                                                                           | R                   | Externally Se       |
| DSP    | 23     | Indicates whether Data ScratchPad RAM is present. Set by<br>the $DSP\_Present$ static input pin, if scratchpad was<br>enabled when the core was built.<br>0 = No Data ScratchPad is present<br>1 = Data ScratchPad is present<br>(Don't confuse this with the MIPS DSP Module, whose<br>presence is indicated by <i>Config3<sub>DSPP</sub></i> ) | R                   | Externally Se       |
| UDI    | 22     | This bit indicates that CorExtend User Defined Instructions<br>have been implemented.<br>0 = No User Defined Instructions are implemented<br>1 = User Defined Instructions are implemented                                                                                                                                                       | R                   | Preset              |
| SB     | 21     | Indicates whether SimpleBE bus mode is enabled.<br>This bit is hardwired to "1" to indicate only simple byte<br>enables allowed on bus interface.                                                                                                                                                                                                | R                   | 1                   |
| MDU    | 20     | This bit indicates the type of Multiply/Divide Unit present.<br>0 = Fast, high-performance MDU<br>1 = Iterative, area-efficient MDU                                                                                                                                                                                                              | R                   | Preset              |
| WC     | 19     | Reserved diagnostic bit. Please refer to MD00213 "Cache Configuration Application Note".                                                                                                                                                                                                                                                         | 0                   | 0                   |
| MM     | 18:17  | This bit indicates whether merging is enabled in the 32 byte collapsing write buffer. Set via <i>Sl_MergeMode[1:0]</i> input pins:<br>00 = No Merging<br>10 = Merging allowed<br>x1 = Reserved                                                                                                                                                   | R                   | Externally Se       |
| BM     | 16     | This bit is hardwired to "0" to indicate burst order is sequential.                                                                                                                                                                                                                                                                              | R                   | 0                   |
| BE     | 15     | Indicates the endian mode in which the processor is run-<br>ning. Set via <i>SI_Endian</i> input pin.<br>0: Little endian<br>1: Big endian                                                                                                                                                                                                       | R                   | Externally Se       |

## Table 6.36 Config Register Field Descriptions (Continued)

| Fiel | ds     |                                                                                                                                                                            |            |             |  |  |
|------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|--|
| Name | Bit(s) | Description                                                                                                                                                                | Read/Write | Reset State |  |  |
| AT   | 14:13  | Architecture type implemented by the processor. This field is always 00 to indicate the MIPS32 architecture.                                                               | R          | 00          |  |  |
| AR   | 12:10  | <ul> <li>Architecture revision level. This field is always 001 to indicate MIPS32 Release 2.</li> <li>0: Release 1</li> <li>1: Release 2</li> <li>2-7: Reserved</li> </ul> | R          | 001         |  |  |
| MT   | 9:7    | MMU Type:<br>1: Standard TLB<br>3: Fixed Mapping<br>0,2, 4-7: Reserved                                                                                                     | R          | Preset      |  |  |
| 0    | 6:3    | Must be written as zeros; returns zeros on reads.                                                                                                                          | 0          | 0           |  |  |
| К0   | 2:0    | Kseg0 coherency algorithm. Refer to Table 6.37 for the field encoding.                                                                                                     | R/W        | 010         |  |  |

Table 6.36 Config Register Field Descriptions (Continued)

### Table 6.37 Cache Coherency Attributes

| C(2:0) Value                                 | Cache Coherency Attribute                                                                                                                                                                                                                                                                                        |  |  |  |  |  |  |  |
|----------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|--|--|
| 0                                            | Cacheable, noncoherent, write-through, no write allocate                                                                                                                                                                                                                                                         |  |  |  |  |  |  |  |
| 1                                            | Cacheable, noncoherent, write-through, write allocate                                                                                                                                                                                                                                                            |  |  |  |  |  |  |  |
| 3*, 4, 5, 6                                  | Cacheable, noncoherent, write-back, write allocate                                                                                                                                                                                                                                                               |  |  |  |  |  |  |  |
| 2*, 7                                        | Uncached                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |  |  |
| 2 and 3 are used. For mapped to 2. Note that | equired by the MIPS32 architecture. In the microAptiv UP processor core, only values 0, 1, example, values 4, 5 and 6 are not used and are mapped to 3. The value 7 is not used and is at these values do have meaning in other MIPS Technologies processor implementations. specification for more information. |  |  |  |  |  |  |  |

## 6.2.31 Config1 Register (CP0 Register 16, Select 1)

The *Config1* register is an adjunct to the *Config* register and encodes additional information about capabilities present on the core. All fields in the *Config1* register are read-only.

The instruction and data cache configuration parameters include encodings for the number of sets per way, the line size, and the associativity. The total cache size for a cache is therefore:

Associativity \* Line Size \* Sets Per Way

If the line size is zero, there is no cache implemented.

| 31 | 30 | 2        | 25 | 24 2 | 22 | 21 | 19 | 18 |    | 16 | 15 | 1 | 3 | 12 | 10 | 9 | 7 | 6  | 5  | 4  | 3  | 2  | 1  | 0  |
|----|----|----------|----|------|----|----|----|----|----|----|----|---|---|----|----|---|---|----|----|----|----|----|----|----|
| М  |    | MMU Size |    | IS   |    | Π  |    |    | IA |    | D  | S |   | DL | ,  | D | A | C2 | MD | PC | WR | CA | EP | FP |

#### Figure 6.32 Config1 Register Format — Select 1

## Table 6.38 Config1 Register Field Descriptions — Select 1

| Fields   |        |                                                                                                                                                                                                                                                  |            |             |  |  |  |  |  |  |
|----------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|--|--|--|--|--|
| Name     | Bit(s) | Description                                                                                                                                                                                                                                      | Read/Write | Reset State |  |  |  |  |  |  |
| М        | 31     | This bit is hardwired to '1' to indicate the presence of the <i>Config2</i> register.                                                                                                                                                            | R          | 1           |  |  |  |  |  |  |
| MMU Size | 30:25  | This field contains the number of entries in the TLB minus<br>one. The field is read as 15 or 31 decimal in the microAptiv<br>UPc core. The field is read as 0 decimal in the FM-based<br>MMU core, because no TLB is present.                   | R          | Preset      |  |  |  |  |  |  |
| IS       | 24:22  | This field contains the number of instruction cache sets per<br>way. Five options are available in the microAptiv UP core.<br>All others values are reserved:<br>0x0: 64<br>0x1: 128<br>0x2: 256<br>0x3: 512<br>0x4: 1024<br>0x5 - 0x7: Reserved | R          | Preset      |  |  |  |  |  |  |
| IL       | 21:19  | This field contains the instruction cache line size. If an<br>instruction cache is present, it must contain a fixed line size<br>of 16 bytes.<br>0x0: No I-Cache present<br>0x3: 16 bytes<br>0x1, 0x2, 0x4 - 0x7: Reserved                       | R          | Preset      |  |  |  |  |  |  |
| IA       | 18:16  | This field contains the level of instruction cache associativ-<br>ity.<br>0x0: Direct mapped<br>0x1: 2-way<br>0x2: 3-way<br>0x3: 4-way<br>0x4 - 0x7: Reserved                                                                                    | R          | Preset      |  |  |  |  |  |  |
| DS       | 15:13  | This field contains the number of data cache sets per way.<br>0x0: 64<br>0x1: 128<br>0x2: 256<br>0x3: 512<br>0x4: 1024<br>0x5 - 0x7: Reserved                                                                                                    | R          | Preset      |  |  |  |  |  |  |
| DL       | 12:10  | This field contains the data cache line size. If a data cache is<br>present, then it must contain a line size of 16 bytes.<br>0x0: No D-Cache present<br>0x3: 16 bytes<br>0x1, 0x2, 0x4 - 0x7: Reserved                                          | R          | Preset      |  |  |  |  |  |  |

| Fiel | lds    |                                                                                                                                                                                                                 |                       |             |  |  |
|------|--------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------|-------------|--|--|
| Name | Bit(s) | Description                                                                                                                                                                                                     | Read/Write            | Reset State |  |  |
| DA   | 9:7    | This field contains the type of set associativity for the data<br>cache.<br>0x0: Direct mapped<br>0x1: 2-way<br>0x2: 3-way<br>0x3: 4-way<br>0x4 - 0x7: Reserved                                                 | R                     | Preset      |  |  |
| C2   | 6      | Coprocessor 2 present.<br>0: No coprocessor is attached to the COP2 interface<br>1: A coprocessor is attached to the COP2 interface<br>If the Cop2 interface logic is not implemented, this bit will<br>read 0. | to the COP2 interface |             |  |  |
| MD   | 5      | MDMX implemented. This bit always reads as 0 because MDMX is not supported.                                                                                                                                     | R                     | 0           |  |  |
| РС   | 4      | Performance Counter registers implemented.<br>0: No Performance Counter registers are implemented<br>1: Performance Counter registers are implemented                                                           | R                     | Preset      |  |  |
| WR   | 3      | Watch registers implemented.<br>0: No Watch registers are implemented<br>1: One or more Watch registers are implemented                                                                                         | R                     | Preset      |  |  |
| CA   | 2      | Code compression (MIPS16e) implemented.<br>0: MIPS16e is not implemented<br>1: MIPS16e is implemented                                                                                                           | R                     | 0           |  |  |
| EP   | 1      | EJTAG present: This bit is always set to indicate that the core implements EJTAG.                                                                                                                               | R                     | 1           |  |  |
| FP   | 0      | FPU implemented.<br>0: No FPU<br>1: FPU is implemented                                                                                                                                                          | R                     | 0           |  |  |

Table 6.38 Config1 Register Field Descriptions — Select 1 (Continued)

## 6.2.32 Config2 Register (CP0 Register 16, Select 2)

The *Config2* register is an adjunct to the *Config* register and is reserved to encode additional capabilities information. *Config2* is allocated for showing the configuration of level 2/3 caches. These fields are reset to 0 because L2/L3 caches are not supported by the microAptiv UP core. All fields in the *Config2* register are read-only.

#### Figure 6.33 Config2 Register Format — Select 2



| Fields |        |                                                                                       |            |             |
|--------|--------|---------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bit(s) | Description                                                                           | Read/Write | Reset State |
| М      | 31     | This bit is hardwired to '1' to indicate the presence of the <i>Config3</i> register. | R          | 1           |
| 0      | 30:0   | These bits are reserved.                                                              | R          | 0           |

### Table 6.39 Config2 Register Field Descriptions — Select 1

### 6.2.33 Config3 Register (CP0 Register 16, Select 3)

The Config3 register encodes additional capabilities. All fields in the Config3 register are read-only.

Figure 6-34 shows the format of the Config3 register; Table 6.40 describes the Config3 register fields.

|    |          |    |    |    | J     |     | -                | 0.    |      |     | -                     |                  |   |     |     |                  |                  |    |              |   |    |    |
|----|----------|----|----|----|-------|-----|------------------|-------|------|-----|-----------------------|------------------|---|-----|-----|------------------|------------------|----|--------------|---|----|----|
| 31 | 30       | 23 | 22 | 21 | 20 18 | 17  | 16               | 15 14 | 13   | 12  | 11                    | 10               | 9 | 8   | 7   | 6                | 5                | 4  | 3            | 2 | 1  | 0  |
| М  | 00000000 |    | IP | LW | MMAR  | MCU | ISA<br>On<br>Exc | ISA   | ULRI | RXI | D<br>S<br>P<br>2<br>P | D<br>S<br>P<br>P | 0 | ITL | LPA | V<br>E<br>I<br>C | V<br>I<br>n<br>t | SP | CD<br>M<br>M | 0 | SM | TL |

#### Figure 6-34 Config3 Register Format

### Table 6.40 Config3 Register Field Descriptions

| Fi   | elds      |                                                                              |                                                                                                    |            |             |
|------|-----------|------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------|------------|-------------|
| Name | Bits      |                                                                              | Description                                                                                        | Read/Write | Reset State |
| М    | 31        | This bit is reserved                                                         | to indicate that a Config4 register is present.                                                    | R          | 1           |
| 0    | 30:23,9,2 | Must be written as a                                                         | zeros; returns zeros on read.                                                                      | 0          | 0           |
| IPLW | 22:21     | Width of the Status                                                          | S <sub>IPL</sub> and Cause <sub>RIPL</sub> fields:                                                 | R          | 1           |
|      |           | Encoding                                                                     | Meaning                                                                                            |            |             |
|      |           | 0                                                                            | <i>IPL</i> and <i>RIPL</i> fields are 6-bits in width.                                             |            |             |
|      |           | 1                                                                            | <i>IPL</i> and <i>RIPL</i> fields are 8-bits in width.                                             |            |             |
|      |           | Others                                                                       | Reserved.                                                                                          |            |             |
|      |           | used as the most sig<br>respectively, of that<br>If the <i>RIPL</i> field is | 8-bits in width, bits 17 and 16 of <i>Cause</i> are gnificant bit and second most significant bit, |            |             |

| Fiel     | ds    |                       |                                                                                              |             |                                            |
|----------|-------|-----------------------|----------------------------------------------------------------------------------------------|-------------|--------------------------------------------|
| Name     | Bits  |                       | Read/Write                                                                                   | Reset State |                                            |
| MMAR     | 20:18 | microMIPS Archite     | cture revision level:                                                                        | R           | 0                                          |
|          |       | Encoding              | Meaning                                                                                      |             |                                            |
|          |       | 0                     | Release 1                                                                                    |             |                                            |
|          |       | 1-7                   | Reserved                                                                                     |             |                                            |
| MCU      | 17    | MIPS MCU ASE in       | nplemented.                                                                                  | R           | 1                                          |
|          |       | Encoding              | Meaning                                                                                      |             |                                            |
|          |       | 0                     | MCU ASE is not implemented.                                                                  |             |                                            |
|          |       | 1                     | MCU ASE is implemented.                                                                      |             |                                            |
| ISAOnExc | 16    |                       | tion Set Architecture used when vectoring to<br>ts exceptions whose vectors are offsets from | RW          | Preset, driven<br>by signal<br>external to |
|          |       | Encoding              | Meaning                                                                                      |             | CPU core                                   |
|          |       | 0                     | MIPS32 ISA is used on entrance to an exception vector.                                       |             |                                            |
|          |       | 1                     | microMIPS is used on entrance to an exception vector.                                        |             |                                            |
| ISA      | 15:14 | Indicates Instruction | n Set Availability.                                                                          | R           | Preset, driven<br>by signal                |
|          |       | Encoding              | Meaning                                                                                      |             | external to CPU core                       |
|          |       | 0                     | Only MIPS32 is implemented.                                                                  |             |                                            |
|          |       | 1                     | Only microMIPS is implemented.                                                               |             |                                            |
|          |       | 2                     | Both MIPS32 and microMIPS are<br>implemented. MIPS32 ISA used when<br>coming out of reset.   |             |                                            |
|          |       | 3                     | Both MIPS32 and microMIPS are<br>implemented. microMIPS is used<br>when coming out of reset. |             |                                            |
| ULRI     | 13    |                       | implemented. This bit indicates whether the ssor 0 register is implemented.                  | R           | 1                                          |
|          |       | Encoding              | Meaning                                                                                      |             |                                            |
|          |       | 0                     | UserLocal register is not imple-<br>mented                                                   |             |                                            |
|          |       | 1                     | UserLocal register is implemented                                                            |             |                                            |

### Table 6.40 Config3 Register Field Descriptions (Continued)

| Fie   | lds  |                                           |                                                                                                                                              |            |                |  |
|-------|------|-------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------|------------|----------------|--|
| Name  | Bits |                                           | Description                                                                                                                                  | Read/Write | Reset State    |  |
| RXI   | 12   | Indicates whether t<br>PageGrain register | he <i>RIE</i> and <i>XIE</i> bits exist within the                                                                                           | R          | Preset         |  |
|       |      | Encoding                                  | Meaning                                                                                                                                      |            |                |  |
|       |      | 0                                         | The <i>RIE</i> and <i>XIE</i> bits are not imple-<br>mented within the <i>PageGrain</i> regis-<br>ter.                                       |            |                |  |
|       |      | 1                                         | The <i>RIE</i> and <i>XIE</i> bits are implemented within the <i>PageGrain</i> register                                                      |            |                |  |
| DSP2P | 11   | Reads 1 to indicate implemented           | Reads 1 to indicate that Revision 2 of the MIPS DSP Module is implemented                                                                    |            |                |  |
| DSPP  | 10   | Reads 1 to indicate implemented.          | that the MIPS DSP Module extension is                                                                                                        | R          | Preset         |  |
| ITL   | 8    | Indicates that iFlow                      | Indicates that iFlowtrace hardware is present.                                                                                               |            |                |  |
| LPA   | 7    |                                           | ce of support for large physical addresses on<br>s. Not used by MIPS32 processors and returns                                                | R          | 0              |  |
|       |      | Encoding                                  | Meaning                                                                                                                                      |            |                |  |
|       |      | 0                                         | Large physical address support is not implemented                                                                                            |            |                |  |
|       |      | 1                                         | Large physical address support is implemented                                                                                                |            |                |  |
|       |      | For implementation<br>returns zero on rea | ns of Release 1 of the Architecture, this bit d.                                                                                             |            |                |  |
| VEIC  | 6    | Indicates support f                       | or an external interrupt controller.                                                                                                         | R          | Externally Set |  |
|       |      | Encoding                                  | Meaning                                                                                                                                      |            |                |  |
|       |      | 0                                         | Support for EIC interrupt mode is not implemented                                                                                            |            |                |  |
|       |      | 1                                         | Support for EIC interrupt mode is implemented                                                                                                |            |                |  |
|       |      | This allows externa                       | it is set by the static input, <i>SI_EICPresent</i> .<br>al logic to communicate whether an external<br>is attached to the processor or not. |            |                |  |
| VInt  | 5    | Indicates implement                       | ntation of Vectored interrupts.                                                                                                              | R          | 1              |  |
|       |      | Encoding                                  | Meaning                                                                                                                                      |            |                |  |
|       |      | 0                                         | Vector interrupts are not implemented                                                                                                        |            |                |  |
|       |      | 1                                         | Vectored interrupts are implemented                                                                                                          |            |                |  |
|       |      | On the microAptiv<br>tored interrupts are | UP core, this bit is always a 1, because vec-<br>implemented.                                                                                |            |                |  |

### Table 6.40 Config3 Register Field Descriptions (Continued)

| Fiel | lds  |      |                                  |                                                                                                                                    |            |             |  |
|------|------|------|----------------------------------|------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|
| Name | Bits |      |                                  | Description                                                                                                                        | Read/Write | Reset State |  |
| SP   | 4    |      |                                  | that Small (1KByte) page support is imple-<br>ll always read as 0 if no TLB is present.                                            | R          | Preset      |  |
|      |      |      | Encoding                         | Meaning                                                                                                                            |            |             |  |
|      |      |      | 0                                | Small page support is not implemented                                                                                              |            |             |  |
|      |      |      | 1                                | Small page support is implemented                                                                                                  |            |             |  |
| CDMM | 3    |      |                                  | emory Map implemented. This bit indicates<br>I is implemented.                                                                     | R Preset   |             |  |
|      |      |      | Encoding                         | Meaning                                                                                                                            |            |             |  |
|      |      |      | 0                                | CDMM is not implemented                                                                                                            |            |             |  |
|      |      |      | 1                                | CDMM is implemented                                                                                                                |            |             |  |
| SM   | 1    | Smar | rtMIPS ASE is                    | implemented. This bit indicates whether the<br>implemented. Because SmartMIPS is not<br>oAptiv UP core, this bit will always be 0. | R          | 0           |  |
|      |      |      | Encoding                         | Meaning                                                                                                                            |            |             |  |
|      |      |      | 0                                | SmartMIPS ASE is not implemented                                                                                                   |            |             |  |
|      |      |      | 1                                | SmartMIPS ASE is implemented                                                                                                       |            |             |  |
| TL   | TL 0 |      | e Logic implem<br>is implemented | hented. This bit indicates whether PC or data d.                                                                                   | R          | Preset      |  |
|      |      |      | Encoding                         | Meaning                                                                                                                            |            |             |  |
|      |      |      | 0                                | Trace logic is not implemented                                                                                                     |            |             |  |
|      |      |      | 1                                | Trace logic is implemented                                                                                                         |            |             |  |

### Table 6.40 Config3 Register Field Descriptions (Continued)

## 6.2.34 Config4 Register (CP0 Register 16, Select 4)

The *Config4* register encodes additional capabilities. This register is required if any optional feature described by this register is implemented and is otherwise optional.

Figure 6-35 shows the format of the Config4 register; Table 6.41 describes the Config4 register fields.

#### Figure 6-35 Config4 Register Format

| 31 | 30     | 0 |
|----|--------|---|
| Μ  | 000000 |   |

| Fields |      |                                                                             | Read / | Reset |
|--------|------|-----------------------------------------------------------------------------|--------|-------|
| Name   | Bits | Description                                                                 | Write  | State |
| М      | 31   | This bit is reserved to indicate that a <i>Config5</i> register is present. | R      | 1     |
| 0      | 30:0 | Must be written as zeros; returns zeros on read.                            | 0      | 0     |

#### Table 6.41 Config4 Register Field Descriptions

## 6.2.35 Config5 Register (CP0 Register 16, Select 5)

The *Config5* register encodes additional capabilities. This register is required if any optional feature described by this register is implemented and is otherwise optional.

Figure 6-36 shows the format of the Config5 register; Table 6.42 describes the Config5 register fields.

#### Figure 6-36 Config5 Register Format

| 31 | 30     | 3 | 2       | 1 | 0  |
|----|--------|---|---------|---|----|
| М  | 000000 |   | UF<br>R | 0 | NF |

| Fie      | lds    |                                 |                                                                                                              | Read / | Deast          |
|----------|--------|---------------------------------|--------------------------------------------------------------------------------------------------------------|--------|----------------|
| Name     | Bits   |                                 | Description                                                                                                  | Write  | Reset<br>State |
| М        | 31     |                                 | rved. With the current architectural defini-<br>ould always read as a 0.                                     | R      | 0              |
| 0        | 30:3,1 | Must be writte                  | n as zeros; returns zeros on read.                                                                           | 0      | 0              |
| UFR      | 2      | to <i>Status<sub>FR</sub></i> w | ire. This feature allows user mode access ith <i>CTC1</i> and <i>CFC1</i> .                                  | R/W    | 0              |
|          |        | Encoding                        | Meaning                                                                                                      |        |                |
|          |        | 0                               | User mode FR instructions not allowed                                                                        |        |                |
|          |        | 1                               | User mode FR instructions allowed                                                                            |        |                |
| NFExists | 0      | The Nested Fa                   | he Nested Fault feature is present.<br>ult feature allows recognition of faulting<br>n an exception handler. | R      | 1              |

#### Table 6.42 Config5 Register Field Descriptions

### 6.2.36 Config7 Register (CP0 Register 16, Select 7)

The *Config7* register contains implementation specific configuration information. A number of these bits are writeable to disable certain performance enhancing features within the microAptiv UP core.



| Fie  | elds       |                                                                                                                                                                                                                                                                                                                                                                                  | Read / |                        |
|------|------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|------------------------|
| Name | Bits       | Description                                                                                                                                                                                                                                                                                                                                                                      | Write  | Reset State            |
| WII  | 31         | Wait IE Ignore. Indicates that this processor will allow an interrupt<br>to unblock a WAIT instruction, even if <i>IE</i> is preventing the interrupt<br>from being taken. This avoids problems using the WAIT instruction<br>for 'bottom half' interrupt servicing.<br>In WII mode when <i>IE</i> =0, waking up from Sleep mode will not enter<br>an Interrupt Service Routine. | R      | 1                      |
| 0    | 30:19,17:0 | These bits are unused and should be written as 0.                                                                                                                                                                                                                                                                                                                                | R      | 0                      |
| HCI  | 18         | Hardware Cache Initialization: Indicates that a cache does not<br>require initialization by software.<br>This bit will most likely only be set on simulation-only cache mod-<br>els and not on real hardware.                                                                                                                                                                    | R      | Based on HW<br>present |

#### Table 6.43 Config7 Register Field Descriptions

### 6.2.37 Load Linked Address (CP0 Register 17, Select 0)

The *LLAddr* register contains the physical address read by the most recent Load Linked (LL) instruction. This register is for diagnostic purposes only, and serves no function during normal operation.

#### Figure 6.38 LLAddr Register Format

| 31 | 28 | 27 0        |
|----|----|-------------|
|    | 0  | PAddr[31:4] |

#### Table 6.44 LLAddr Register Field Descriptions

| Field       | ds     |                                                                                          |            |             |
|-------------|--------|------------------------------------------------------------------------------------------|------------|-------------|
| Name        | Bit(s) | Description                                                                              | Read/Write | Reset State |
| 0           | 31:28  | Must be written as zeros; returns zeros on reads.                                        | 0          | 0           |
| PAddr[31:4] | 27:0   | This field encodes the physical address read by the most recent Load Linked instruction. | R          | Undefined   |

### 6.2.38 WatchLo Register (CP0 Register 18, Select 0-7)

The *WatchLo* and *WatchHi* registers together provide the interface to a watchpoint debug facility that initiates a watch exception if an instruction or data access matches the address specified in the registers. As such, they duplicate some functions of the EJTAG debug solution. Watch exceptions are taken only if the *EXL* and *ERL* bits are both zero in the *Status* register. If either bit is a one, the *WP* bit is set in the *Cause* register, and the watch exception is deferred until both the *EXL* and *ERL* bits are zero.

The microAptiv UP core can be configured with 0 to 8 Watch register pairs

The *WatchLo* register specifies the base virtual address and the type of reference (instruction fetch, load, store) to match.

#### Figure 6.39 WatchLo Register Format

| 31    | 3 | 2 | 1 | 0 |
|-------|---|---|---|---|
| VAddr |   | Ι | R | W |

| Fiel  | ds   |                                                                                                                                                      |            |             |
|-------|------|------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name  | Bits | Description                                                                                                                                          | Read/Write | Reset State |
| VAddr | 31:3 | This field specifies the virtual address to match. Note that this is a doubleword address, because bits [2:0] are used to control the type of match. | R/W        | Undefined   |
| Ι     | 2    | If this bit is set, watch exceptions are enabled for instruction fetches that match the address.                                                     | R/W        | 0.          |
| R     | 1    | If this bit is set, watch exceptions are enabled for loads that match the address.                                                                   | R/W        | 0           |
| W     | 0    | If this bit is set, watch exceptions are enabled for stores that match the address.                                                                  | R/W        | 0           |

#### Table 6.45 WatchLo Register Field Descriptions

### 6.2.39 WatchHi Register (CP0 Register 19, Select 0-7)

The *WatchLo* and *WatchHi* registers together provide the interface to a watchpoint debug facility that initiates a watch exception if an instruction or data access matches the address specified in the registers. As such, they duplicate some functions of the EJTAG debug solution. Watch exceptions are taken only if the *EXL* and *ERL* bits are zero in the *Status* register. If either bit is a one, then the *WP* bit is set in the *Cause* register, and the watch exception is deferred until both the *EXL* and *ERL* bits are zero.

The *WatchHi* register contains information that qualifies the virtual address specified in the *WatchLo* register: an ASID, a *Global*(*G*) bit, and an optional address mask. If the *G* bit is 1, then any virtual address reference that matches the specified address will cause a watch exception. If the *G* bit is a 0, only those virtual address references for which the ASID value in the *WatchHi* register matches the ASID value in the *EntryHi* register cause a watch exception. The optional mask field provides address masking to qualify the address specified in *WatchLo*.

#### Figure 6.40 WatchHi Register Format

| 31 30 29 | 24 | 23 16 | 15 12 | 11   | 3 | 2 |   | 0 |
|----------|----|-------|-------|------|---|---|---|---|
| M G      | 0  | ASID  | 0     | Mask |   | Ι | R | W |

#### Table 6.46 WatchHi Register Field Descriptions

| Fie  | lds    |                                                                                                                                                                                                                                                                                                        |            |             |
|------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name | Bit(s) | Description                                                                                                                                                                                                                                                                                            | Read/Write | Reset State |
| М    | 31     | Indicates whether additional Watch register pairs beyond this one are present or not                                                                                                                                                                                                                   | R          | Preset      |
| G    | 30     | If this bit is one, any address that matches that specified in<br>the <i>WatchLo</i> register causes a watch exception. If this bit<br>is zero, the <i>ASID</i> field of the <i>WatchHi</i> register must match<br>the <i>ASID</i> field of the <i>EntryHi</i> register to cause a watch<br>exception. | R/W        | Undefined   |
| 0    | 29:24  | Must be written as zeros; returns zeros on read.                                                                                                                                                                                                                                                       | 0          | 0           |
| ASID | 23:16  | ASID value which is required to match that in the <i>EntryHi</i> register if the $G$ bit is zero in the <i>WatchHi</i> register.                                                                                                                                                                       | R/W        | Undefined   |
| 0    | 15:12  | Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                         | 0          | 0           |
| Mask | 11:3   | Bit mask that qualifies the address in the <i>WatchLo</i> register. Any bit in this field that is a set inhibits the corresponding address bit from participating in the address match.                                                                                                                | R/W        | Undefined   |
| Ι    | 2      | This bit is set by hardware when an instruction fetch con-<br>dition matches the values in this watch register pair. When<br>set, the bit remains set until cleared by software, which is<br>accomplished by writing a 1 to the bit.                                                                   | W1C        | Undefined   |
| R    | 1      | This bit is set by hardware when a load condition matches<br>the values in this watch register pair. When set, the bit<br>remains set until cleared by software, which is accom-<br>plished by writing a 1 to the bit.                                                                                 | W1C        | Undefined   |
| W    | 0      | This bit is set by hardware when a store condition matches<br>the values in this watch register pair. When set, the bit<br>remains set until cleared by software, which is accom-<br>plished by writing a 1 to the bit.                                                                                | W1C        | Undefined   |

### 6.2.40 Debug Register (CP0 Register 23, Select 0)

The *Debug* register is used to control the debug exception and provide information about the cause of the debug exception and also when re-entering at the debug exception vector due to a normal exception in debug mode. The read-only information bits are updated every time the debug exception is taken, or when a normal exception is taken when already in debug mode.

Only the *DM* bit and the *EJTAGver* field are valid when read from non-debug mode; the values of all other bits and fields are UNPREDICTABLE. Operation of the processor is UNDEFINED if the *Debug* register is written in non-debug mode.

Some of the bits and fields are only updated on debug exceptions and/or exceptions in debug mode, as shown below:

- DSS, DBp, DDBL, DDBS, DIB, DINT, DIBImpr, DDBLImpr, DDBSImpr are updated on both debug exceptions and on exceptions in debug modes.
- DExcCode is updated on exceptions in debug mode, and is undefined after a debug exception.
- Halt and Doze are updated on a debug exception, and are undefined after an exception in debug mode.
- *DBD* is updated on both debug and on exceptions in debug modes.

All bits and fields are undefined when read from normal mode, except those explicitly described to be defined, e.g., *EJTAGver* and *DM*.

|             |    |     |       |        | •        |      | •       | •     |      |             |      |     |        |      |     |            |
|-------------|----|-----|-------|--------|----------|------|---------|-------|------|-------------|------|-----|--------|------|-----|------------|
| 31          |    | 30  | 29    | 28     | 27       | 26   | 25      | 2     | 24   | 23          | 22   | 2   | 21     | 20   |     | 19         |
| DBD         |    | DM  | NoDCI | R LSNM | Doze     | Halt | CountDM | I IBu | ısEP | MCheckP     | Cach | eEP | DBusEP | IEXI |     | DB-<br>mpr |
| <br>18      | 17 |     | 15    | 14     |          | 10   | 9       | 8     | 7    | 6           | 5    | 4   | 3      | 2    | 1   | 0          |
| DBLI<br>mpr |    | Ver |       | ]      | DExcCode |      | NoSSt   | SSt   | R    | DIBI<br>mpr | DINT | DIB | DDBS   | DDBL | DBp | DSS        |

Figure 6.41 Debug Register Format

| Field | S      |                    |                                                                            |            |             |
|-------|--------|--------------------|----------------------------------------------------------------------------|------------|-------------|
| Name  | Bit(s) |                    | Description                                                                | Read/Write | Reset State |
| DBD   | 31     |                    | her the last debug exception or exception occurred in a branch delay slot: | R          | Undefined   |
|       |        | Encoding           | Meaning                                                                    |            |             |
|       |        | 0                  | Not in delay slot                                                          |            |             |
|       |        | 1                  | In delay slot                                                              |            |             |
| DM    | 30     | Indicates that the | ne processor is operating in debug mode:                                   | R          | 0           |
|       |        | Encoding           | Meaning                                                                    |            |             |
|       |        | 0                  | Processor is operating in non-debug mode                                   |            |             |
|       |        | 1                  | Processor is operating in debug mode                                       |            |             |
| NoDCR | 29     |                    | her the dseg memory segment is present<br>Control Register is accessible:  | R          | 0           |
|       |        | Encoding           | Meaning                                                                    |            |             |
|       |        | 0                  | dseg is present                                                            |            |             |
|       |        | 1                  | No dseg present                                                            |            |             |

#### Table 6.47 Debug Register Field Descriptions

| Field   | S      |                                                                                                |                                                                                                                                                                                                                                                                                                                             |            |             |
|---------|--------|------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name    | Bit(s) |                                                                                                | Description                                                                                                                                                                                                                                                                                                                 | Read/Write | Reset State |
| LSNM    | 28     | Controls access<br>memory:                                                                     | s of load/store between dseg and main                                                                                                                                                                                                                                                                                       | R/W        | 0           |
|         |        | Encoding                                                                                       | Meaning                                                                                                                                                                                                                                                                                                                     |            |             |
|         |        | 0                                                                                              | Load/stores in dseg address range goes to dseg                                                                                                                                                                                                                                                                              |            |             |
|         |        | 1                                                                                              | Load/stores in dseg address range<br>goes to main memory                                                                                                                                                                                                                                                                    |            |             |
| Doze    | 27     |                                                                                                | ne processor was in any kind of low hen a debug exception occurred:                                                                                                                                                                                                                                                         | R          | Undefined   |
|         |        | Encoding                                                                                       | Meaning                                                                                                                                                                                                                                                                                                                     |            |             |
|         |        | 0                                                                                              | Processor not in low-power mode<br>when debug exception occurred                                                                                                                                                                                                                                                            |            |             |
|         |        | 1                                                                                              | Processor in low-power mode when<br>debug exception occurred                                                                                                                                                                                                                                                                |            |             |
| Halt    | 26     |                                                                                                | ne internal system bus clock was stopped g exception occurred:                                                                                                                                                                                                                                                              | R          | Undefined   |
|         |        | Encoding                                                                                       | Meaning                                                                                                                                                                                                                                                                                                                     |            |             |
|         |        | 0                                                                                              | Internal system bus clock stopped                                                                                                                                                                                                                                                                                           |            |             |
|         |        | 1                                                                                              | Internal system bus clock running                                                                                                                                                                                                                                                                                           |            |             |
| CountDM | 25     | Indicates the C                                                                                | ount register behavior in debug mode:                                                                                                                                                                                                                                                                                       | R/W        | 1           |
|         |        | Encoding                                                                                       | Meaning                                                                                                                                                                                                                                                                                                                     |            |             |
|         |        | 0                                                                                              | Count register stopped in debug mode                                                                                                                                                                                                                                                                                        |            |             |
|         |        | 1                                                                                              | Count register is running in debug mode                                                                                                                                                                                                                                                                                     |            |             |
| IBusEP  | 24     | an instruction f<br>written to the b<br>exception on an<br>sor, and by rese<br>a Bus Error exc | h Bus Error exception Pending. Set when<br>etch bus error event occurs, or if a 1 is<br>it by software. Cleared when a Bus Error<br>a instruction fetch is taken by the proces-<br>et. If <i>IBusEP</i> is set when <i>IEXI</i> is cleared,<br>seption on an instruction fetch is taken by<br>and <i>IBusEP</i> is cleared. | R/W1       | 0           |
| MCheckP | 23     | pending. All M                                                                                 | n imprecise Machine Check exception is<br>achine Check exceptions are precise on<br>UP processor, so this bit will always                                                                                                                                                                                                   | R          | 0           |
| CacheEP | 22     | Cache Errors ca                                                                                | n imprecise Cache Error is pending.<br>annot be taken by the microAptiv UP<br>will always read as 0                                                                                                                                                                                                                         | R          | 0           |

 Table 6.47 Debug Register Field Descriptions (Continued)

| Field    | s      |                                                                                                                      |                                                                                                                                                                                                                                                                                                                                                    |            |             |
|----------|--------|----------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name     | Bit(s) |                                                                                                                      | Description                                                                                                                                                                                                                                                                                                                                        | Read/Write | Reset State |
| DBusEP   | 21     | imprecise bus e                                                                                                      | s Error exception Pending. Covers<br>errors on data access, similar to the<br><i>ISEP</i> for imprecise bus errors on an<br>h.                                                                                                                                                                                                                     | R/W1       | 0           |
| IEXI     | 20     | taken due to im<br>processor takes<br>mode. Cleared<br>otherwise modi<br><i>IEXI</i> is set, the<br>error on an inst | r eXception Inhibit controls exceptions<br>precise error indications. Set when the<br>a debug exception or exception in debug<br>by execution of the DERET instruction;<br>fiable by debug mode software. When<br>imprecise error exception from a bus<br>ruction fetch or data access, cache error,<br>ck is inhibited and deferred until the bit | R/W        | 0           |
| DDBSImpr | 19     |                                                                                                                      | n imprecise Debug Data Break Store<br>aken. Imprecise data breaks only occur<br>akpoints.                                                                                                                                                                                                                                                          | R          | Undefined   |
| DDBLImpr | 18     |                                                                                                                      | n imprecise Debug Data Break Load<br>aken. Imprecise data breaks only occur<br>akpoints.                                                                                                                                                                                                                                                           | R          | Undefined   |
| Ver      | 17:15  | EJTAG version                                                                                                        |                                                                                                                                                                                                                                                                                                                                                    | R          | 101         |
| DExcCode | 14:10  | mode. The field<br><i>Cause</i> register<br>occur in debug                                                           | use of the latest exception in debug<br>l is encoded as the <i>ExcCode</i> field in the<br>for those normal exceptions that may<br>mode.<br>ned after a debug exception.                                                                                                                                                                           | R          | Undefined   |
| NoSST    | 9      |                                                                                                                      | er the single-step feature controllable by vailable in this implementation:                                                                                                                                                                                                                                                                        | R          | 0           |
|          |        | Encoding                                                                                                             | Meaning                                                                                                                                                                                                                                                                                                                                            |            |             |
|          |        | 0                                                                                                                    | Single-step feature available                                                                                                                                                                                                                                                                                                                      |            |             |
|          |        | 1                                                                                                                    | No single-step feature available                                                                                                                                                                                                                                                                                                                   |            |             |
| SSt      | 8      | Controls if deb                                                                                                      | ug single step exception is enabled:                                                                                                                                                                                                                                                                                                               | R/W        | 0           |
|          |        | Encoding                                                                                                             | Meaning                                                                                                                                                                                                                                                                                                                                            |            |             |
|          |        | 0                                                                                                                    | No debug single-step exception<br>enabled                                                                                                                                                                                                                                                                                                          |            |             |
|          |        | 1                                                                                                                    | Debug single step exception enabled                                                                                                                                                                                                                                                                                                                |            |             |
| R        | 7      | Reserved. Must reads.                                                                                                | t be written as zeros; returns zeros on                                                                                                                                                                                                                                                                                                            | R          | 0           |
| DIBImpr  | 6      | exception occur                                                                                                      | n Imprecise debug instruction break<br>rred (due to a complex breakpoint).<br>eption in debug mode.                                                                                                                                                                                                                                                | R          | Undefined   |

### Table 6.47 Debug Register Field Descriptions (Continued)

| Field | ls     |          |                                                                         |            |             |
|-------|--------|----------|-------------------------------------------------------------------------|------------|-------------|
| Name  | Bit(s) | _        | Description                                                             | Read/Write | Reset State |
| DINT  | 5      |          | debug interrupt exception occurred.<br>eption in debug mode.            | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug interrupt exception                                            |            |             |
|       |        | 1        | Debug interrupt exception                                               |            |             |
| DIB   | 4      |          | debug instruction break exception<br>red on exception in debug mode.    | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug instruction exception                                          |            |             |
|       |        | 1        | Debug instruction exception                                             |            |             |
| DDBS  | 3      |          | debug data break exception occurred on<br>l on exception in debug mode. | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug data exception on a store                                      |            |             |
|       |        | 1        | Debug instruction exception on a store                                  |            |             |
| DDBL  | 2      |          | debug data break exception occurred on on exception in debug mode.      | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug data exception on a load                                       |            |             |
|       |        | 1        | Debug instruction exception on a load                                   |            |             |
| DBp   | 1      |          | debug software breakpoint exception ed on exception in debug mode.      | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug software breakpoint excep-<br>tion                             |            |             |
|       |        | 1        | Debug software breakpoint exception                                     |            |             |
| DSS   | 0      |          | debug single-step exception occurred.<br>eption in debug mode.          | R          | Undefined   |
|       |        | Encoding | Meaning                                                                 |            |             |
|       |        | 0        | No debug single-step exception                                          |            |             |
|       |        | 1        | Debug single-step exception                                             |            |             |

### Table 6.47 Debug Register Field Descriptions (Continued)

### 6.2.41 Trace Control Register (CP0 Register 23, Select 1)

The *TraceControl* register configuration is shown below. Note the special behavior of the ASID\_M, ASID, and G fields for the microAptiv UP processor.

This register is only implemented if the EJTAG PDtrace capability is present.

|    |    |    |    |    |    |    |    | Fig | gur | e 6 | .42 | TraceControl Regi | ster Format |   |   |      |    |
|----|----|----|----|----|----|----|----|-----|-----|-----|-----|-------------------|-------------|---|---|------|----|
| 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23  | 22  | 21  | 20  | 13                | 12          | 5 | 4 | 3 1  | 0  |
| TS | UT | (  | )  | TB | Ю  | D  | Е  | K   | S   | U   |     | ASID_M            | ASID        |   | G | Mode | On |

| Fie  | lds   |                                                                                                                                                                                                                                                                                                                                             |            |             |
|------|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name | Bits  | Description                                                                                                                                                                                                                                                                                                                                 | Read/Write | Reset State |
| TS   | 31    | The trace select bit is used to select between the hard-<br>ware and the software trace control bits. A value of<br>zero selects the external hardware trace block signals,<br>and a value of one selects the trace control bits in this<br>software control register.                                                                      | R/W        | 0           |
| UT   | 30    | This bit is used to indicate the type of user-triggered trace record. A value of zero implies a user type 1, and a value of one implies a user type 2.<br>The actual triggering of a user trace record occurs on a write to the <i>UserTraceData</i> register.                                                                              | R/W        | Undefined   |
| 0    | 29:28 | Reserved for future use; Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                                     | 0          | 0           |
| ТВ   | 27    | Trace All Branch. When set to one, this tells the pro-<br>cessor to trace the PC value for all taken branches, not<br>just the ones whose branch target address is statically<br>unpredictable.                                                                                                                                             | R/W        | Undefined   |
| ΙΟ   | 26    | Inhibit Overflow. This signal is used to indicate to the core trace logic that slow but complete tracing is desired. When set to one, the core tracing logic does not allow a FIFO overflow or discard trace data. This is achieved by stalling the pipeline when the FIFO is nearly full, so that no trace records are ever lost.          | R/W        | Undefined   |
| D    | 25    | When set to one, this enables tracing in Debug Mode<br>For trace to be enabled in Debug mode, the On bit<br>must be one, and either the <i>G</i> bit must be one, or the<br>current process ASID must match the <i>ASID</i> field in<br>this register.<br>When set to zero, trace is disabled in Debug Mode,<br>irrespective of other bits. | R/W        | Undefined   |

#### Table 6.48 TraceControl Register Field Descriptions

| Fields    |                                                                                                                                                                                                                                                                                                                                                               |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |            |             |
|-----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bits |                                                                                                                                                                                                                                                                                                                                                               | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | Read/Write | Reset State |
| E         | E 24 When set to one, this enables tracing in Exception<br>Mode. For trace to be enabled in Exception mode, the<br>On bit must be one, and either the <i>G</i> bit must be one,<br>or the current process ASID must match the <i>ASID</i><br>field in this register.<br>When set to zero, trace is disabled in Exception Mode,<br>irrespective of other bits. |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | R/W        | Undefined   |
| K         | K23When set to one, this enables tracing in Kernel Mode.<br>For trace to be enabled in Kernel mode, the On bit<br>must be one, and either the <i>G</i> bit must be one, or the<br>current process ASID must match the <i>ASID</i> field in<br>this register.<br>When set to zero, trace is disabled in Kernel Mode,<br>irrespective of other bits.            |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | R/W        | Undefined   |
| 0         | 22                                                                                                                                                                                                                                                                                                                                                            | This bit is reserved. Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 0          | 0           |
| U         | 21                                                                                                                                                                                                                                                                                                                                                            | When set to one, this enables tracing in User Mode.<br>For trace to be enabled in User mode, the On bit must<br>be one, and either the $G$ bit must be one, or the current<br>process ASID must match the $ASID$ field in this regis-<br>ter.<br>When set to zero, trace is disabled in User Mode, irre-<br>spective of other bits.                                                                                                                                                                                                                                                   | R/W        | Undefined   |
| ASID_M    | 20:13                                                                                                                                                                                                                                                                                                                                                         | This is a mask value applied to the ASID comparison<br>(done when the <i>G</i> bit is zero). A "1" in any bit in this<br>field inhibits the corresponding <i>ASID</i> bit from partici-<br>pating in the match. As such, a value of zero in this<br>field compares all bits of <i>ASID</i> . Note that the ability to<br>mask the ASID value is not available in the hardware<br>signal bit; it is only available via the software control<br>register.<br>In an FM-based MMU core in which ASID is not sup-<br>ported, this field is ignored on writes and returns zero<br>on reads. | R/W        | Undefined   |
| ASID 12:5 |                                                                                                                                                                                                                                                                                                                                                               | The $ASID$ field must match when the $G$ bit is zero.<br>When the $G$ bit is one, this field is ignored.<br>In an FM-based MMU core in which ASID is not supported, this field is ignored on writes and returns zero on reads.                                                                                                                                                                                                                                                                                                                                                        | R/W        | Undefined   |
| G         | 4                                                                                                                                                                                                                                                                                                                                                             | Global bit. When set to one, tracing is to be enabled<br>for all processes, provided that other enabling func-<br>tions (like $U$ , $S$ , etc.,) are also true.<br>In an FM-based MMU core in which ASID is not sup-<br>ported, this field is ignored on writes and returns 1 on<br>reads. This causes all match equations to work cor-<br>rectly in the absence of an ASID.                                                                                                                                                                                                          | R/W        | Undefined   |

Table 6.48 TraceControl Register Field Descriptions (Continued)

| Fields    |     |                         |                                                                                                                                                                                                                         |            |             |
|-----------|-----|-------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bits |     |                         | Description                                                                                                                                                                                                             | Read/Write | Reset State |
| Mode      | 3:1 | These the               | ree bits control the trace mode function.                                                                                                                                                                               | R/W        | Undefined   |
|           |     | Mode                    | Trace Mode                                                                                                                                                                                                              |            |             |
|           |     | 000                     | Trace PC                                                                                                                                                                                                                |            |             |
|           |     | 001                     | Trace PC and load address                                                                                                                                                                                               |            |             |
|           |     | 010                     | Trace PC and store address                                                                                                                                                                                              |            |             |
|           |     | 011                     | Trace PC and both load/store addresses                                                                                                                                                                                  |            |             |
|           |     | 100                     | Trace PC and load data                                                                                                                                                                                                  |            |             |
|           |     | 101                     | Trace PC and load address and data                                                                                                                                                                                      |            |             |
|           |     | 110                     | Trace PC and store address and data                                                                                                                                                                                     |            |             |
|           |     | 111                     | Trace PC and both load/store address and data                                                                                                                                                                           |            |             |
|           |     | of these e              | <i>ceControl2</i> <sub>ValidModes</sub> field determines which<br>encodings are supported by the processor. The<br>of the processor is <b>UNPREDICTABLE</b> if<br>is set to a value which is not supported by the<br>r. |            |             |
| On        | 0   | trol. Whe<br>to one, tr | e master trace enable switch in software con-<br>en zero, tracing is always disabled. When set<br>acing is enabled whenever the other enabling<br>are also true.                                                        | R/W        | 0           |

#### Table 6.48 TraceControl Register Field Descriptions (Continued)

## 6.2.42 Trace Control2 Register (CP0 Register 23, Select 2)

The *TraceControl2* register provides additional control and status information. Note that some fields in the *TraceControl2* register are read-only, but have a reset state of "Undefined". This is because these values are loaded from the Trace Control Block (TCB) (see Section 10.8.6 "ITCB Register Interface for Software Configurability"). As such, these fields in the *TraceControl2* register will not have valid values until the TCB asserts these values.

This register is only implemented if the EJTAG PDTrace capability is present.

#### Figure 6.43 TraceControl2 Register Format

| 31 |   | 7 | 65              | 4 | 3   | 2   | 0 |
|----|---|---|-----------------|---|-----|-----|---|
|    | 0 |   | Valid-<br>Modes |   | TBU | SyP |   |

#### Table 6.49 TraceControl2 Register Field Descriptions

| Field | Fields |                                                                         |   |             |
|-------|--------|-------------------------------------------------------------------------|---|-------------|
| Name  | Bits   | s Description                                                           |   | Reset State |
| 0     | 31:5   | Reserved for future use; Must be written as zero; returns zero on read. | 0 | 0           |

| Fields     |                                                        |                                   |                                                                                                                                                                                              |            |                    |
|------------|--------------------------------------------------------|-----------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|--------------------|
| Name       | Bits                                                   | -                                 | Description                                                                                                                                                                                  | Read/Write | Reset State        |
| ValidModes | 6:5                                                    | This field spec<br>by the process | ifies the type of tracing that is supported or.                                                                                                                                              | R          | 10                 |
|            |                                                        | Encoding                          | Meaning                                                                                                                                                                                      |            |                    |
|            |                                                        | 00                                | PC tracing only                                                                                                                                                                              |            |                    |
|            |                                                        | 01                                | PC and load and store address tracing only                                                                                                                                                   |            |                    |
|            | 10 PC, load and store address, and load and store data |                                   |                                                                                                                                                                                              |            |                    |
|            |                                                        | 11                                | Reserved                                                                                                                                                                                     |            |                    |
| TBI        | 4                                                      | This bit indicates mented by the  | tes how many trace buffers are imple-<br>TCB.                                                                                                                                                | R          | Per implementation |
|            |                                                        | Encoding                          | Meaning                                                                                                                                                                                      |            |                    |
|            |                                                        | 0                                 | Only one trace buffer is imple-<br>mented, and the Debug sin-<br>gle-step exception bit of this<br>register indicates which trace<br>buffer is implemented                                   |            |                    |
|            |                                                        | 1                                 | Both on-chip and off-chip trace<br>buffers are implemented by the<br>TCB, and the <i>TBU</i> bit of this<br>register indicates to which trace<br>buffer the trace is currently writ-<br>ten. |            |                    |
| TBU        | 3                                                      | written by the                    | This bit denotes which trace buffer is currently being written by the trace and is used to select the appropriate interpretation of the <i>TraceControl2</i> <sub>SyP</sub> field.           |            | Undefined          |
|            |                                                        | Encoding                          | Meaning                                                                                                                                                                                      |            |                    |
|            |                                                        | 0                                 | Trace data is being sent to an on-chip trace buffer                                                                                                                                          |            |                    |
|            |                                                        | 1                                 | Trace Data is being sent to an off-chip trace buffer                                                                                                                                         |            |                    |

Table 6.49 TraceControl2 Register Field Descriptions (Continued)

| Field | ls   |                                                                                                                                                                                                                                           |                     |                 |   |             |
|-------|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------|-----------------|---|-------------|
| Name  | Bits |                                                                                                                                                                                                                                           | Description         |                 |   | Reset State |
| SyP   | 2:0  | Used to indicate the synchronization period.<br>The period (in cycles) between which the periodic syn-<br>chronization information is to be sent is defined as<br>shown below, for both when the trace buffer is on-chip<br>and off-chip. |                     |                 | R | Undefined   |
|       |      | SyP                                                                                                                                                                                                                                       | On-chip             | Off-chip        |   |             |
|       |      | 000                                                                                                                                                                                                                                       | 2 <sup>2</sup>      | 27              |   |             |
|       |      | 001                                                                                                                                                                                                                                       | 2 <sup>3</sup>      | 2 <sup>8</sup>  |   |             |
|       |      | 010                                                                                                                                                                                                                                       | $2^{4}$             | 2 <sup>9</sup>  |   |             |
|       |      | 011                                                                                                                                                                                                                                       | 2 <sup>5</sup>      | 2 <sup>10</sup> |   |             |
|       |      | 100                                                                                                                                                                                                                                       | 2 <sup>6</sup>      | 2 <sup>11</sup> |   |             |
|       |      | 101                                                                                                                                                                                                                                       | 2 <sup>7</sup>      | 2 <sup>12</sup> |   |             |
|       |      | 110                                                                                                                                                                                                                                       | 2 <sup>8</sup>      | 2 <sup>13</sup> |   |             |
|       |      | 111                                                                                                                                                                                                                                       | 2 <sup>9</sup>      | 2 <sup>14</sup> |   |             |
|       |      | data is being v<br><i>TraceControl</i><br>column is use                                                                                                                                                                                   | d when the trace da |                 |   |             |

### Table 6.49 TraceControl2 Register Field Descriptions (Continued)

# 6.2.43 User Trace Data1 Register (CP0 Register 23, Select 3)/User Trace Data2 Register (CP0 Register 24, Select 3)

A software write to any bits in the *UserTraceData1* or *UserTraceData2* registers will trigger a trace record to be written indicating a type 1 or type 2 user format respectively. The trace output data is **UNPREDICTABLE** if these registers are written in consecutive cycles.

This register is only implemented if the MIPS iFlowtrace capability is present.

#### Figure 6.44 User Trace Data1/User Trace Data2 Register Format

| : | 31   | 0 |   |
|---|------|---|---|
|   | Data |   | 1 |

#### Table 6.50 UserTraceData1/UserTraceData2 Register Field Descriptions

| Fie  | lds  |                                                                                                                                                                     | Read / |             |
|------|------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-------------|
| Name | Bits | Description                                                                                                                                                         | Write  | Reset State |
| Data | 31:0 | Software readable/writable data. When written, this triggers a user format trace record out of the PDtrace interface that transmits the Data field to trace memory. | R/W    | 0           |

### 6.2.44 TraceBPC Register (CP0 Register 23, Select 4)

This register is used to start and stop tracing using an EJTAG Hardware breakpoint. The Hardware breakpoint can then be set as a trigger source and optionally also as a Debug exception breakpoint.

This register is only implemented if hardware breakpoints and the EJTAG PDTrace capability are both present.

#### Figure 6.45 Trace BPC Register Format

| 31 | 30 | 18 17 16 | 5 15 | 14 6 | 5     | 0 |
|----|----|----------|------|------|-------|---|
| DE | 0  | DBPO     | n IE | 0    | IBPOn |   |

| Fields |       |                                                                                                                                                          |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |            |             |
|--------|-------|----------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits  |                                                                                                                                                          | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Read/Write | Reset State |
| DE     | 31    |                                                                                                                                                          | y whether the trigger signal from<br>reakpoint should trigger tracing func-                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | R/W        | 0           |
|        |       | Encoding                                                                                                                                                 | Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |            |             |
|        |       | 0                                                                                                                                                        | Disables trigger signals from data breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |            |             |
|        |       | 1                                                                                                                                                        | Enables trigger signals from data breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |            |             |
| 0      | 30:18 | Reserved                                                                                                                                                 |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 0          | 0           |
| DBPOn  | 17:16 | EJTAG hardw<br>mented. For ex<br>data breakpoin<br>EJTAG impler<br>16 and 17. The<br>logic because<br>A value of one<br>the correspond<br>And a value of | Each of the 2 bits corresponds to the 2 possible<br>EJTAG hardware data breakpoints that may be imple-<br>nented. For example, bit 16 corresponds to the first<br>data breakpoint. If 2 data breakpoints are present in the<br>EJTAG implementation, then they correspond to bits<br>16 and 17. The rest are always ignored by the tracing<br>ogic because they will never be triggered.<br>A value of one for each bit implies that a trigger from<br>he corresponding data breakpoint should start tracing.<br>And a value of zero implies that tracing should be<br>urned off with the trigger signal. |            | 0           |
| IE     | 15    |                                                                                                                                                          | y whether the trigger signal from<br>etion breakpoint should trigger tracing<br>ot:                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | R/W        | 0           |
|        |       | Encoding                                                                                                                                                 | Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |            |             |
|        |       | 0                                                                                                                                                        | Disables trigger signals from instruc-<br>tion breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |            |             |
|        |       | 1                                                                                                                                                        | Enables trigger signals from instruc-<br>tion breakpoints                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |            |             |
| 0      | 14:6  | Reserved                                                                                                                                                 |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | 0          | 0           |

### Table 6.51 TraceBPC Register Field Descriptions

| Fiel      | ds  |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |            |             |
|-----------|-----|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bits |     | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | Read/Write | Reset State |
| IBPOn     | 5:0 | Each of the 6 bits corresponds to the 6 possible<br>EJTAG hardware instruction breakpoints that may be<br>implemented. Bit 0 corresponds to the first instruction<br>breakpoint, and so on. If only 2 instruction breakpoints<br>are present in the EJTAG implementation, then only<br>bits 0 and 1 <b>a</b> re used. The rest are always ignored by<br>the tracing logic because they will never be triggered.<br>A value of one for each bit implies that a trigger from<br>the corresponding instruction breakpoint should start<br>tracing. And a value of zeroimplies that tracing should<br>be turned off with the trigger signal. | R/W        | 0           |

### Table 6.51 TraceBPC Register Field Descriptions (Continued)

### 6.2.45 Debug2 Register (CP0 Register 23, Select 6)

This register holds additional information about Complex Breakpoint exceptions.

This register is only implemented if complex hardware breakpoints are present.

#### Figure 6.46 Debug2 Register Format

| 3 | 1 4 | 3   | 2  | 1   | 0    |
|---|-----|-----|----|-----|------|
|   | 0   | Prm | DQ | Tup | PaCo |

| Fields |      |                                                                                                                                     |            |             |
|--------|------|-------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits | Description                                                                                                                         | Read/Write | Reset State |
| 0      | 31:4 | Reserved                                                                                                                            | 0          | 0           |
| Prm    | 3    | Primed - indicates whether a complex breakpoint with<br>an active priming condition was seen on the last debug<br>exception.        | R          | Undefined   |
| DQ     | 2    | Data Qualified - indicates whether a complex break-<br>point with an active data qualifier was seen on the last<br>debug exception. | R          | Undefined   |
| Tup    | 1    | Tuple - indicates whether a tuple breakpoint was seen<br>on the last debug exception.                                               | R          | Undefined   |
| РаСо   | 0    | Pass Counter - indicates whether a complex breakpoint<br>with an active pass counter was seen on the last debug<br>exception        | R          | Undefined   |

#### Table 6.52 Debug2 Register Field Descriptions

### 6.2.46 Debug Exception Program Counter Register (CP0 Register 24, Select 0)

The Debug Exception Program Counter (*DEPC*) register is a read/write register that contains the address at which processing resumes after a debug exception or debug mode exception has been serviced.

For synchronous (precise) debug and debug mode exceptions, the DEPC contains either:

- The virtual address of the instruction that was the direct cause of the debug exception, or
- The virtual address of the immediately preceding branch or jump instruction, when the debug exception causing instruction is in a branch delay slot, and the *Debug Branch Delay (DBD)* bit in the *Debug* register is set.

For asynchronous debug exceptions (debug interrupt, complex break), the *DEPC* contains the virtual address of the instruction where execution should resume after the debug handler code is executed.

In processors that implement microMIPS, a read of the *DEPC* register (via MFC0) returns the following value in the destination GPR:

 $GPR[rt] \leftarrow DebugExceptionPC_{31,1}$  || ISAMode<sub>0</sub>

That is, the upper 31 bits of the debug exception PC are combined with the lower bit of the *ISAMode* field and written to the GPR.

Similarly, a write to the *DEPC* register (via MTC0) takes the value from the GPR and distributes that value to the debug exception PC and the *ISAMode* field, as follows

```
DebugExceptionPC \leftarrow GPR[rt]<sub>31.1</sub> || 0
ISAMode \leftarrow 2#0 || GPR[rt]<sub>0</sub>
```

That is, the upper 31 bits of the GPR are written to the upper 31 bits of the debug exception PC, and the lower bit of the debug exception PC is cleared. The upper bit of the *ISAMode* field is cleared and the lower bit is loaded from the lower bit of the GPR.

#### Figure 6.47 DEPC Register Format

| 31 |      | 0 |
|----|------|---|
|    | DEPC |   |

#### Table 6.53 DEPC Register Formats

| Fields |        |                                                                                                                                                                                                                                                                                                                                                                                      |            |           |
|--------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-----------|
| Name   | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                          | Read/Write | Reset     |
| DEPC   | 31:0   | The <i>DEPC</i> register is updated with the virtual address of<br>the instruction that caused the debug exception. If the<br>instruction is in the branch delay slot, then the virtual<br>address of the immediately preceding branch or jump<br>instruction is placed in this register.<br>Execution of the DERET instruction causes a jump to the<br>address in the <i>DEPC</i> . | R/W        | Undefined |

### 6.2.47 Performance Counter Register (CP0 Register 25, select 0-3)

The microAptiv UP processor defines two performance counters and two associated control registers, which are mapped to CP0 register 25. The select field of the MTC0/MFC0 instructions are used to select the specific register accessed by the instruction, as shown in Table 6.54.

| Select[2:0] | Register           |
|-------------|--------------------|
| 0           | Register 0 Control |
| 1           | Register 0 Count   |
| 2           | Register 1 Control |
| 3           | Register 1 Count   |

**Table 6.54 Performance Counter Register Selects** 

Each counter is a 32-bit read/write register and is incremented by one each time the countable event, specified in its associated control register, occurs. Each counter can independently count one type of event at a time.

Bit 31 of each of the counters are ANDed with an interrupt enable bit, *IE*, of their respective control register to determine if a performance counter interrupt should be signalled. The two values are then ORed together to create the  $SI_PCI$  output. Traditionally, this signal is combined with one of the SI\_Int pins to signal an interrupt to the microAptiv UP. However, this is no longer needed as the core will internally route the interrupt to the IP number set by the *IntCtl.IPPCI* field. Counting is not affected by the interrupt indication. This output is cleared when the counter wraps to zero, and may be cleared in software by writing a value with bit 31 = 0 to the *Performance Counter Count* registers.

NOTE: The performance counter registers are connected to a clock that is stopped when the processor is in sleep mode (if the top-level clock gater is present). Most events would not be active during that time, but others would be, notably the cycle count. This behavior should be considered when analyzing measurements taken on a system. Further, note that FPGA implementations of the core would generally not have the clock gater present and thus would have different behavior than a typical ASIC implementation.

#### Figure 6.48 Performance Counter Control Register

| 31 30 |   | 15 14   | 11 10 |       | 5 4 | 3  | 2 | 1 | 0   |
|-------|---|---------|-------|-------|-----|----|---|---|-----|
| М     | 0 | EventEx | L     | Event | Ι   | EU | 0 | K | EXL |

| Fields   |       |                                                                                                                                                                      |     |             |
|----------|-------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|-------------|
| Name     | Bits  | ts Description                                                                                                                                                       |     | Reset State |
| М        | 31    | If this bit is one, another pair of <i>Performance Control</i> and <i>Counter</i> registers is implemented at an MTC0 or MFC0 select field value of 'n+2' and 'n+3'. | R   | Preset      |
| EventExt | 14:11 | Event specific to Virtualization Module if supported. Possible events are listed in Table 6.56.                                                                      | R/W | Undefined   |
| Event    | 10:5  | Counter event enabled for this counter. Possible events are listed in Table 6.56.                                                                                    | R/W | Undefined   |
| IE       | 4     | Counter Interrupt Enable. This bit masks bit 31 of the associated count register from the interrupt exception request output.                                        | R/W | 0           |

#### Table 6.55 Performance Counter Control Register Field Descriptions

| Fields    |          |                                                                                                                       |            |             |
|-----------|----------|-----------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name Bits |          | Description                                                                                                           | Read/Write | Reset State |
| U         | 3        | Count in User Mode. When this bit is set, the specified event is counted in User Mode.                                | R/W        | Undefined   |
| K         | 1        | Count in Kernel Mode. When this bit is set, count the event in Kernel Mode when <i>EXL</i> and <i>ERL</i> both are 0. | R/W        | Undefined   |
| EXL       | 0        | Count when $EXL$ . When this bit is set, count the event when $EXL = 1$ and $ERL = 0$ .                               | R/W        | Undefined   |
| 0         | 30:12, 2 | Must be written as zeroes; returns zeroes when read.                                                                  | 0          | 0           |

Table 6.55 Performance Counter Control Register Field Descriptions (Continued)

Table 6.56 describes the events countable with the two performance counters. The mode column indicates whether the event counting is influenced by the mode bits (U,K,EXL). The operation of a counter is **UNPREDICTABLE** for events which are specified as Reserved.

Performance counters never count in debug mode or when ERL = 1.

The performance counter resets to a low-power state, in which none of the counters will start counting events until software has enabled event counting, using an MTC0 instruction to the Performance Counter Control Registers.

| Event Num | Counter 0                      | Mode | Counter 1                         | Mode |
|-----------|--------------------------------|------|-----------------------------------|------|
| 0         | Cycles                         | No   | Cycles                            | No   |
| 1         | Instructions completed         | Yes  | Instructions completed            | Yes  |
| 2         | branch instructions            | Yes  | Reserved                          | NA   |
| 3         | JR r31 (return) instructions   | Yes  | Reserved                          | NA   |
| 4         | JR (not r31) instructions      | Yes  | Reserved                          | NA   |
| 5         | ITLB accesses                  | Yes  | ITLB misses                       | Yes  |
| 6         | DTLB accesses                  | Yes  | DTLB misses                       | Yes  |
| 7         | JTLB instruction accesses      | Yes  | JTLB instruction misses           | Yes  |
| 8         | JTLB data accesses             | Yes  | JTLB data misses                  | Yes  |
| 9         | Instruction Cache accesses     | Yes  | Instruction cache misses          | Yes  |
| 10        | Data cache accesses            | Yes  | Data cache writebacks             | Yes  |
| 11        | Data cache misses              | Yes  | Data cache misses                 | Yes  |
| 12        | Reserved                       | NA   | Reserved                          | NA   |
| 13        | Reserved                       | NA   | Reserved                          | NA   |
| 14        | integer instructions completed | Yes  | Reserved                          | NA   |
| 15        | loads completed                | Yes  | Stores completed                  | Yes  |
| 16        | J/JAL completed                | Yes  | microMIPS instructions completed  | Yes  |
| 17        | no-ops completed               | Yes  | Integer multiply/divide completed | Yes  |
| 18        | Stall cycles                   | No   | Reserved                          | NA   |
| 19        | SC instructions completed      | Yes  | SC instructions failed            | Yes  |

| Event Num | Counter 0                               | Mode         | Counter 1                                      | Mode |
|-----------|-----------------------------------------|--------------|------------------------------------------------|------|
| 20        | Prefetch instructions completed         | Yes          | Prefetch instructions completed with cache hit | Yes  |
| 21        | Reserved                                | NA           | Reserved                                       |      |
| 22        | 2 Reserved                              |              | Reserved                                       | NA   |
| 23        | Exceptions taken                        | Yes          | Reserved                                       | NA   |
| 24        | Cache fixup                             | Yes          | Reserved                                       | NA   |
| 25        | IFU stall cycles                        | No           | ALU stall cycles                               | No   |
| 26        | Reserved                                | NA           | Reserved                                       | NA   |
| 27        | Reserved                                | NA           | Reserved                                       | NA   |
| 28        | Reserved                                | NA           | Implementation-specific CP2 event              | Yes  |
| 29        | Implementation-specific ISPRAM event    | Yes          | Implementation-specific DSPRAM event           | Yes  |
| 30        | Implementation-specific CorExtend event | Yes          | Reserved                                       | NA   |
| 31        | Reserved                                | NA           | Reserved                                       | NA   |
| 32        | Reserved                                | NA           | Reserved                                       | NA   |
| 33        | Uncached Loads                          | Yes          | Uncached Stores                                | Yes  |
| 34        | Reserved                                | NA           | Reserved                                       | NA   |
| 35        | CP2 Arithmetic Instructions Completed   | Yes          | CP2 To/From Instructions completed             | Yes  |
| 36        | Reserved                                | NA           | Reserved                                       | NA   |
| 37        | I-Cache Miss stall cycles               | Yes          | D-Cache miss stall cycles                      | Yes  |
| 38        | Reserved                                | NA           | Reserved                                       | NA   |
| 39        | D-Cache miss cycles                     | No           | Reserved                                       | NA   |
| 40        | Uncached stall cycles                   | Yes          | Reserved                                       | NA   |
| 41        | MDU stall cycles                        | Yes Reserved |                                                | NA   |
| 42        | CP2 stall cycles                        | Yes          | CorExtend stall cycles                         | Yes  |
| 43        | ISPRAM stall cycles                     | Yes          | DSPRAM stall cycles                            | Yes  |
| 44        | CACHE Instn stall cycles                | No           | Reserved                                       | NA   |
| 45        | Load to Use stall cycles                | Yes          | Reserved                                       | NA   |
| 46        | Other interlock stall cycles            | Yes          | Reserved                                       | NA   |
| 47        | Reserved                                | NA           | Reserved                                       | NA   |
| 48        | Reserved                                | NA           | Reserved                                       | NA   |
| 49        | EJTAG Instruction Triggerpoints         | Yes          | EJTAG Data Triggerpoints                       | Yes  |
| 50        | Reserved                                | NA           | Reserved                                       | NA   |
| 51        | Reserved                                | NA           | Reserved                                       | NA   |
| 52        | LDQ < 1/4 full                          | No           | LDQ 1/4-1/2 full                               | No   |
| 53        | LDQ > 1/2 full                          | No           | LDQ full pipeline stall cycles                 | No   |
| 54        | WBB < 1/4 full                          | No           | WBB 1/4-1/2 full                               | No   |
| 55        | WBB > 1/2 full                          | No           | WBB full pipeline stall cycles                 | No   |
| 56-1023   | Reserved                                | NA           | Reserved                                       | NA   |

Table 6.56 Performance Counter Events Sorted by Event Number (Continued)

| Event Name                        | Counter          | Event<br>Number | Description                                                                                                                                                                                                                                                                                                                                                          |
|-----------------------------------|------------------|-----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Cycles                            | 0/1              | 0               | Total number of cycles.<br>The performance counters are clocked by the top-level gated clock. If<br>the microAptiv UP is built with that clock gater present, none of the<br>counters will increment while the clock is stopped, e.g., due to a<br>WAIT instruction.                                                                                                 |
| Instruction Completion: The follo | wing events indi | cate comple     | tion of various types of instructions                                                                                                                                                                                                                                                                                                                                |
| Instructions                      | 0/1              | 1               | Total number of instructions completed.                                                                                                                                                                                                                                                                                                                              |
| Branch instns                     | 0                | 2               | Counts all branch instructions that completed.                                                                                                                                                                                                                                                                                                                       |
| JR R31 (return) instns            | 0                | 3               | Counts all JR R31 instructions that completed.                                                                                                                                                                                                                                                                                                                       |
| JR (not R31)                      | 0                | 4               | Counts all JR \$xx (not \$31) and JALR instructions (indirect jumps).                                                                                                                                                                                                                                                                                                |
| Integer instns                    | 0                | 14              | Non-floating-point, non-Coprocessor 2 instructions.                                                                                                                                                                                                                                                                                                                  |
| Loads                             | 0                | 15              | Includes both integer and coprocessor loads.                                                                                                                                                                                                                                                                                                                         |
| Stores                            | 1                | 15              | Includes both integer and coprocessor stores.                                                                                                                                                                                                                                                                                                                        |
| J/JAL                             | 0                | 16              | Direct Jump (And Link) instruction.                                                                                                                                                                                                                                                                                                                                  |
| microMIPS                         | 1                | 16              | All microMIPS instructions.                                                                                                                                                                                                                                                                                                                                          |
| no-ops                            | 0                | 17              | This includes all instructions that normally write to a GPR, but where the destination register was set to r0.                                                                                                                                                                                                                                                       |
| Integer Multiply/Divide           | 1                | 17              | Counts all Integer Multiply/Divide instructions (MULxx, DIVx, MADDx, MSUBx).                                                                                                                                                                                                                                                                                         |
| SC                                | 0                | 19              | Counts conditional stores regardless of whether they succeeded.                                                                                                                                                                                                                                                                                                      |
| PREF                              | 0                | 20              | Note that this only counts PREFs that are actually attempted. PREFs to uncached addresses or ones with translation errors are not counted                                                                                                                                                                                                                            |
| Uncached Loads                    | 0                | 33              | Includes both Uncached and Uncached Accelerated CCAs.                                                                                                                                                                                                                                                                                                                |
| Uncached Stores                   | 1                | 33              |                                                                                                                                                                                                                                                                                                                                                                      |
| Cp2 Arithmetic instns             | 0                | 35              | Counts Coprocessor 2 register-to-register instructions.                                                                                                                                                                                                                                                                                                              |
| Cp2 To/From instns                | 1                | 35              | Includes move to/from, control to/from, and cop2 loads and stores.                                                                                                                                                                                                                                                                                                   |
| Instruction execution events      |                  |                 | 1                                                                                                                                                                                                                                                                                                                                                                    |
| ITLB accesses                     | 0                | 5               | Counts ITLB accesses that are due to fetches in IF stage of the pipe<br>that do not use fixed mapping or are not in unmapped space. If an<br>address is fetched twice down the pipe (as in the case of a cache miss),<br>that instruction will count 2 ITLB accesses. Also, because each fetch<br>gets us 2 instructions, there is one access marked per doubleword. |
| ITLB misses                       | 1                | 5               | Counts all misses in ITLB except ones that are on the back of another<br>miss. We cannot process back-to-back misses, and thus those are<br>ignored for this purpose. Also ignored if there is some form of address<br>error.                                                                                                                                        |
| DTLB accesses                     | 0                | 6               | Counts DTLB access including those in unmapped address spaces.                                                                                                                                                                                                                                                                                                       |
| DTLB misses                       | 1                | 6               | Counts DTLB misses. Back-to-back misses that result in only one DTLB entry getting refilled are counted as a single miss.                                                                                                                                                                                                                                            |

### Table 6.57 Performance Counter Event Descriptions Sorted by Event Type (Continued)

| Event Name                    | Counter | Event<br>Number | Description                                                                                                                                                                                                                                                                                                                      |
|-------------------------------|---------|-----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| JTLB instruction accesses     | 0       | 7               | Instruction JTLB accesses are counted exactly the same as ITLB misses.                                                                                                                                                                                                                                                           |
| JTLB instruction misses       | 1       | 7               | Counts instruction JTLB accesses that result in no match or a match on an invalid translation.                                                                                                                                                                                                                                   |
| JTLB data accesses            | 0       | 8               | Data JTLB accesses.                                                                                                                                                                                                                                                                                                              |
| JTLB data misses              | 1       | 8               | Counts data JTLB accesses that result in no match or a match on an invalid translation.                                                                                                                                                                                                                                          |
| I-Cache accesses              | 0       | 9               | Counts every time the instruction cache is accessed. All replays, wasted fetches, etc. are counted.                                                                                                                                                                                                                              |
| I-Cache misses                | 1       | 9               | Counts all instruction cache misses that result in a bus request.                                                                                                                                                                                                                                                                |
| D-Cache accesses              | 0       | 10              | Counts cached loads and stores.                                                                                                                                                                                                                                                                                                  |
| D-Cache writebacks            | 1       | 10              | Counts cache lines written back to memory due to replacement or cacheops.                                                                                                                                                                                                                                                        |
| D-Cache misses                | 0/1     | 11              | Counts loads and stores that miss in the cache.                                                                                                                                                                                                                                                                                  |
| SC instructions failed        | 1       | 19              | SC instruction that did not update memory.<br>Note: While this event and the SC instruction count event can be con-<br>figured to count in specific operating modes, the timing of the events<br>is much different, and the observed operating mode could change<br>between them, causing some inaccuracy in the measured ratio. |
| PREF completed with cache hit | 1 20    |                 | Counts PREF instructions that hit in the cache.                                                                                                                                                                                                                                                                                  |
| Exceptions Taken              | 0       | 23              | Any type of exception taken.                                                                                                                                                                                                                                                                                                     |
| EJTAG instruction triggers    | 0       | 49              | Number of times an EJTAG Instruction Trigger Point condition matched.                                                                                                                                                                                                                                                            |
| EJTAG data triggers           | 1       | 49              | Number of times an EJTAG Data Trigger Point condition matched.                                                                                                                                                                                                                                                                   |
| Pipeline Function             |         |                 |                                                                                                                                                                                                                                                                                                                                  |
| Cache fixup                   | 0       | 24              | Counts cycles where the DCC is in a fix-up and cannot accept a new instruction from the ALU. Fix-ups are replays within the DCC that occur when an instruction needs to re-access the cache or the DTLB.                                                                                                                         |
| General Stalls                |         | 1               |                                                                                                                                                                                                                                                                                                                                  |
| IFU stall cycles              | 0       | 25              | Counts the number of cycles in which the fetch unit is not providing a valid instruction to the ALU.                                                                                                                                                                                                                             |
| ALU stall cycles              | 1       | 25              | Counts the number of cycles in which the ALU pipeline cannot advance.                                                                                                                                                                                                                                                            |
| Stall cycles                  | 0       | 18              | Counts the total number of cycles in which no instructions are issued<br>by ICC to ALU (the RF stage does not advance). This includes both of<br>the previous two events. However, this is different from the sum of<br>them, because cycles when both stalls are active will only be counted<br>once.                           |
|                               |         |                 | due to this. This will include bubbles introduced by replays within the ers for each of the active events will be incremented.                                                                                                                                                                                                   |
| I-Cache miss stall cycles     | 0       | 37              | Cycles when ICC stalls because an I-Cache miss caused the ICC not to have any runnable instructions. Ignores the stalls due to ITLB misses as well as the 4 cycles following a redirect.                                                                                                                                         |

| Event Name                             | Counter        | Event<br>Number | Description                                                                                                                                                                                                                                             |
|----------------------------------------|----------------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| D-Cache miss stall cycles              | 1              | 37              | Counts all cycles in which the integer pipeline waits on a Load to return data due to a D-Cache miss.                                                                                                                                                   |
| D-Cache miss cycle cycles              | 0              | 39              | D-Cache miss is outstanding, but not necessarily stalling the pipeline.<br>The difference between this and D-Cache miss stall cycles can show<br>the gain from non-blocking cache misses.                                                               |
| Uncached stall cycles                  | 0              | 40              | Cycles in which the processor is stalled on an uncached fetch, load, or store.                                                                                                                                                                          |
| MDU stall cycles                       | 0              | 41              | Counts all cycles in which the integer pipeline waits on MDU return data.                                                                                                                                                                               |
| Cp2 stall cycles                       | 0              | 42              | Counts all cycles in which the integer pipeline waits on CP2 return data.                                                                                                                                                                               |
| CorExtend stall cycles                 | 1              | 42              | Counts all cycles in which the integer pipeline waits on CorExtend return data.                                                                                                                                                                         |
| ISPRAM stall cycles                    | 0              | 43              | Counts all pipeline bubbles that result from multi-cycle ISPRAM<br>access. Pipeline bubbles are defined as all cycles in which the ICC<br>doesn't present an instruction to the ALU. The four cycles after a redi-<br>rect are not counted.             |
| DSPRAM stall cycles                    | 1              | 43              | Counts stall cycles created by an instruction waiting for access to DSPRAM.                                                                                                                                                                             |
| CACHE instn stall cycles               | 0              | 44              | Counts all cycles in which pipeline is stalled due to CACHE instruc-<br>tions. Includes cycles in which CACHE instructions themselves are<br>stalled in the ALU, and cycles in which CACHE instructions cause<br>subsequent instructions to be stalled. |
| Load to Use stall cycles               | 0              | 45              | Counts all cycles in which the integer pipeline waits on Load return dependent data.                                                                                                                                                                    |
| Other interlocks stall cycles          | 0              | 46              | Counts all cycles in which the integer pipeline waits on return data from MFC0 and RDHWR instructions.                                                                                                                                                  |
| LFB full pipeline stall cycles         | 1              | 53              | Cycles in which the pipeline is stalled because the Load Fill Buffer in the DCC is full.                                                                                                                                                                |
| Write Through Buffer full stall cycles | 1              | 55              | Cycles in which the pipeline is stalled because the Write Through<br>Buffer in the BIU is full.                                                                                                                                                         |
| Implementation-specific events - Mod   | dules that can | be replaced b   | by the customer will have an event signal associated with them.                                                                                                                                                                                         |
| Cp2                                    | 1              | 28              | Set to 1 if COP2 is implemented.                                                                                                                                                                                                                        |
| ISPRAM                                 | 0              | 29              | Set to 1 if ISPRAM is implemented.                                                                                                                                                                                                                      |
| DSPRAM                                 | 1              | 29              | Set to 1 if DSPRAM is implemented.                                                                                                                                                                                                                      |
| CorExtend                              | 0              | 30              | Set to 1 if CorExtend is implemented.                                                                                                                                                                                                                   |

| Event Name                              | Counter | Event<br>Number | Description                                                                                                            |                |                |                  |  |  |
|-----------------------------------------|---------|-----------------|------------------------------------------------------------------------------------------------------------------------|----------------|----------------|------------------|--|--|
| Load Fill Queue < 1/4 full              | 0       | 52              |                                                                                                                        | Yer Occupancy: |                |                  |  |  |
| Load Fill Queue 1/4 to 1/2 full         | 1       | 52              | The following table shows what values fall into each of the bins for<br>the different buffer sizes that can be chosen. |                |                |                  |  |  |
| Load Fill Queue > 1/2 full              | 0       | 53              |                                                                                                                        |                |                |                  |  |  |
| Write Through Buffer < 1/4 full         | 0       | 54              |                                                                                                                        | State          | 4-entry Buffer | 8/9-entry Buffer |  |  |
|                                         |         |                 | -                                                                                                                      | < 1/4          | 0              | 0-1              |  |  |
| Write Through Buffer 1/4 to 1/2 full154 |         |                 | 1/4-1/2                                                                                                                | 1-2            | 2-4            |                  |  |  |
| Write Through Buffer > 1/2 full         | 0       | 55              |                                                                                                                        | > 1/2          | 3+             | 5+               |  |  |
|                                         |         |                 |                                                                                                                        | -              | -              | -                |  |  |

#### Table 6.57 Performance Counter Event Descriptions Sorted by Event Type (Continued)

#### Figure 6.49 Performance Counter Count Register

| 31      | 0 |
|---------|---|
| Counter |   |

#### Table 6.58 Performance Counter Count Register Field Descriptions

| Fie     | lds  |             |              |             |  |
|---------|------|-------------|--------------|-------------|--|
| Name    | Bits | Description | Read / Write | Reset State |  |
| Counter | 31:0 | Counter     | R/W          | Undefined   |  |

### 6.2.48 ErrCtl Register (CP0 Register 26, Select 0)

The *ErrCtl* register controls parity protection of data and instruction caches/SPRAM and provides for software testing of the way-selection and scratchpad RAMs. Parity protection can be enabled or disabled using the *PE* bit.

When parity is enabled and the *PO* bit is deasserted, the CACHE Index Store Tag and Index Store Data operations will internally generate parity to be written into the RAM arrays. However, when the *PO* bit is asserted, tag array parity is written using the *P* bit of the *TagLo* register and data array parity is written using the *Pl/PD* bits of *ErrCtl*.

The way-selection RAM test mode is enabled by setting the *WST* bit. It modifies the functionality of the CACHE Index Load Tag and Index Store Tag operations so that they modify the way-selection RAM and leave the Tag RAMs untouched. When this bit is set, the lower 6 bits of the *PA* field in the *TagLo* register are used as the source and destination for Index Load Tag and Index Store Tag CACHE operations.

The *WST* bit also enables the data RAM test mode. When this bit is set, the Index Store Data CACHE instruction is enabled. This CACHE operation writes the contents of the *DataLo* register to the word in the data array that is indicated by the index and byte address.

The *SPR* bit enables CACHE accesses to the optional Scratchpad RAMs. When this bit is set, Index Load Tag, Index Store Tag, and Index Store Data CACHE instructions will send reads or writes to the Scratchpad RAM port. The effects of these operations are dependent on the particular Scratchpad implementation.

A CACHE Index Load Tag operation to the instruction cache will update the *PI* field with the parity bits from the data array if parity is supported. A CACHE Index Load Tag operation to the data cache will cause the *PD* bits to be

updated with the byte parity for the selected word of the data array if parity is implemented. If parity is disabled or not implemented, the contents of the *PI* and *PD* fields after a CACHE Index Load Tag operation will be 0.

#### Figure 6.50 ErrCtl Register Format

| 31 | 30 | 29  | 28  | 27 | 8 | 7 4 | 3 |    | 0 |
|----|----|-----|-----|----|---|-----|---|----|---|
| PE | PO | WST | SPR | R  |   | PI  | ] | PD |   |

| Fie  | lds                            |                                                 |                                                                                                                                                                                                                                                              |            |             |
|------|--------------------------------|-------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name | Bit(s)                         | 1                                               | Description                                                                                                                                                                                                                                                  | Read/Write | Reset State |
| PE   | 31                             | -                                               | . This bit enables or disables the parity protec-<br>the instruction caches/SPRAM and the data<br>AM .                                                                                                                                                       | R or R/W   | 0           |
|      |                                | Encoding                                        | Meaning                                                                                                                                                                                                                                                      |            |             |
|      |                                | 0                                               | Parity disabled                                                                                                                                                                                                                                              |            |             |
|      |                                | 1                                               | Parity enabled                                                                                                                                                                                                                                               |            |             |
|      |                                | mented when<br>supported, thi                   | only write-able if the parity option was imple-<br>the microAptiv UP was built. If parity is not<br>is field is always read as 0. Software can test<br>port by attempting to write a 1 to this field,<br>k the value.                                        |            |             |
| РО   | over<br>the <i>F</i><br>ity fo |                                                 | rite. If set, the <i>Pl/PD</i> fields of this register<br>lculated parity for the data array. In addition,<br>the <i>TagLo</i> register overwrites calculated par-<br>array. This bit only has significance during<br>x Store Tag and CACHE Index Store Data | R/W        | 0           |
|      |                                | Encoding                                        | Meaning                                                                                                                                                                                                                                                      |            |             |
|      |                                | 0                                               | Use calculated parity                                                                                                                                                                                                                                        |            |             |
|      |                                | 1                                               | Override calculated parity                                                                                                                                                                                                                                   |            |             |
| WST  | 29                             | should be real<br>instructions.<br>Also enables | other the tag array or the way-select array<br>d/written on Index Load/Store Tag CACHE<br>the Index Store Data CACHE instruction<br>the contents of DataLo to the data array.                                                                                | R/W        | 0           |
| SPR  | 28                             |                                                 | ed CACHE instructions to operate on the AM instead of the cache                                                                                                                                                                                              | R/W        | 0           |
| R    | 27:8                           | Must be writt                                   | en as zero; returns zero on reads.                                                                                                                                                                                                                           | 0          | 0           |
| PI   | 7:4                            | Parity bit read<br>RAM.                         | d from or written to instruction cache data                                                                                                                                                                                                                  | R/W        | Undefined   |
| PD   | 3:0                            | Parity bits rea                                 | d from or written to data cache data RAM.                                                                                                                                                                                                                    | R/W        | Undefined   |

### Table 6.59 Errctl Register Field Descriptions

### 6.2.49 CacheErr Register (CP0 Register 27, Select 0)

The *CacheErr* register provides an interface with the cache error-detection logic. When a caches/SPRAM Parity Error exception is signaled, the fields of this register are set accordingly.

|   | Figure 6.51 CacheErr Register (Primary Caches) |    |    |    |    |    |    |    |    |    |      |    |       |   |
|---|------------------------------------------------|----|----|----|----|----|----|----|----|----|------|----|-------|---|
|   | 31                                             | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 2 | 20 | 19    | 0 |
| [ | ER                                             | EC | ED | ET | ES | EE | EB | EF | SP | EW | Wa   | у  | Index |   |

| Fields |      |                                                                                                   |              |             |
|--------|------|---------------------------------------------------------------------------------------------------|--------------|-------------|
| Name   | Bits | Description                                                                                       | Read / Write | Reset State |
| ER     | 31   | Error Reference. Indicates the type of reference that encountered an error.                       | R            | Undefined   |
|        |      | Encoding Meaning                                                                                  |              |             |
|        |      | 0 Instruction                                                                                     |              |             |
|        |      | 1 Data                                                                                            |              |             |
| EC     | 30   | Indicates the cache level at which the error was detected:                                        | R            | 0           |
|        |      | Encoding Meaning                                                                                  |              |             |
|        |      | 0 Primary                                                                                         |              |             |
|        |      | 1 Non-primary                                                                                     |              |             |
| ED     | 29   | Error Data. Indicates a data RAM error.                                                           | R            | Undefined   |
|        |      | Encoding Meaning                                                                                  |              |             |
|        |      | 0 No data RAM error detected                                                                      |              |             |
|        |      | 1 Data RAM error detected                                                                         |              |             |
| ET     | 28   | Error Tag. Indicates a tag RAM error.                                                             | R            | Undefined   |
|        |      | Encoding Meaning                                                                                  |              |             |
|        |      | 0 No tag RAM error detected                                                                       |              |             |
|        |      | 1 Tag RAM error detected                                                                          |              |             |
| ES     | 27   | Error source. Indicates whether error was caused by internal processor or external snoop request. | R            | Undefined   |
|        |      | Encoding Meaning                                                                                  |              |             |
|        |      | 0 Error on internal request                                                                       |              |             |
|        |      | 1 Error on external request                                                                       |              |             |
| EE     | 26   | Error external: Not supported.                                                                    | R            | 0           |

#### Table 6.60 CacheErr Register Field Descriptions (Primary Caches)

| Fields |       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |              |             |  |  |
|--------|-------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------|-------------|--|--|
| Name   | Bits  | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Read / Write | Reset State |  |  |
| EB     | 25    | Error Both. Indicates that a data caches/SPRAM parity error occurred in addition to an instruction caches/SPRAM parity error.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | R            | Undefined   |  |  |
|        |       | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |              |             |  |  |
|        |       | 0 No additional data caches/SPRAM parity error                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |              |             |  |  |
|        |       | 1 Additional data caches/SPRam parity error                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |              |             |  |  |
|        |       | In the case of an additional data caches/SPRAM parity error, the remainder of the bits in this register are set according to the instruction caches/SPRAM parity error.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |              |             |  |  |
| EF     | 24    | Error Fatal. Indicates that a fatal cache error has occurred.There are a few situations in which software will not be able to getall information about a cache error from the <i>CacheErr</i> register.These situations are fatal, because software cannot determine whichmemory locations have been affected by the error. To enable software to detect these cases, the <i>EF</i> bit (bit 24) has been added to the <i>CacheErr</i> register.The following cases are indicated as fatal cache errors by the <i>EF</i> bit:1Dirty parity error in dirty victim (dirty bit cleared)2Tag parity error in dirty victim3Data parity error in dirty victim4WB store miss and EW error at the requested indexIn addition to the above, simultaneous instruction and data cacheerrors as indicated by <i>CacheErr<sub>EB</sub></i> will cause information about thedata cache error to be unavailable. However, that situation is notindicated by <i>CacheErr<sub>EF</sub></i> | R            | Undefined   |  |  |
| SP     | 23    | Scratchpad. Indicates Scratchpad RAM parity error.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | R            | 0           |  |  |
|        |       | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |              |             |  |  |
|        |       | 0 No Scratchpad RAM error detected                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |              |             |  |  |
|        |       | 1 Scratchpad RAM error detected                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |              |             |  |  |
| EW     | 22    | Error Way. Indicates a parity error on the dirty bits that are stored in the way selection RAM array.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | R            | Undefined   |  |  |
|        |       | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |              |             |  |  |
|        |       | 0 No way selection RAM error detected                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |              |             |  |  |
|        |       | 1 Way selection RAM error detected                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |              |             |  |  |
| Way    | 21:20 | Way. Specifies the cache way in which the error was detected. It is not valid if a Tag RAM error is detected ( <i>ET</i> =1) or Scratchpad RAM error is detected ( <i>SP</i> =1).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         | R            | Undefined   |  |  |

| Table 6.60 CacheErr Register Field Descriptions | (Primary Caches) (Continued) |
|-------------------------------------------------|------------------------------|
|-------------------------------------------------|------------------------------|

| Fields     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |              |             |
|------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------|-------------|
| Name Bits  | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | Read / Write | Reset State |
| Index 19:0 | Index. Specifies the cache or Scratchpad RAM index of the double<br>word in which the error was detected. The way of the faulty cache is<br>written by hardware in the <i>Way</i> field. Software must combine the<br><i>Way</i> and <i>Index</i> read in this register with cache configuration infor-<br>mation in the <i>Config1</i> register in order to obtain an index which can<br>be used in an indexed CACHE instruction to access the faulty cache<br>data or tag. Note that <i>Index</i> is aligned as a byte index, so it does not<br>need to be shifted by software before it is used in an indexed<br>CACHE instruction. <i>Index</i> bits [4:3] are undefined upon tag RAM<br>errors, and <i>Index</i> bits above the MSB actually used for cache index-<br>ing will also be undefined.<br>Bits [19:16] are only used for errors in the Scratchpad RAM. | R            | Undefined   |

#### Table 6.60 CacheErr Register Field Descriptions (Primary Caches) (Continued)

### 6.2.50 TagLo Register (CP0 Register 28, Select 0)

The TagLo register acts as the interface to the cache tag array. The Index Store Tag and Index Load Tag operations of the CACHE instruction use the TagLo register as the source of tag information. Note that the microAptiv UP core does not implement the TagHi register.

When the WST bit of the ErrCtl register is asserted, this register becomes the interface to the way-selection RAM. In this mode, the fields are redefined to give appropriate access the contents of the WS array instead of the Tag array.

| Figure 6.52 TagLo Register Format (ErrCtl <sub>WST</sub> =0) |    |           |               |                   |     |    |     |    |   |   |   |   |   |   |   |
|--------------------------------------------------------------|----|-----------|---------------|-------------------|-----|----|-----|----|---|---|---|---|---|---|---|
| 31                                                           |    |           |               |                   | 10  | 9  | 8   | 7  | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|                                                              |    | Р         | A             |                   |     | ŀ  | R   | V  | D | L |   | ] | R |   | Р |
|                                                              |    | Figure 6  | 53 Taglio Reg | gister Format (E  | rrC | FI |     | 1) |   |   |   |   |   |   |   |
|                                                              |    | i igule o | Jo Tageo Re   | gister i ormat (L |     | 'W | st- | "  |   |   |   |   |   |   |   |
| 31                                                           | 04 | 23 2      | 0 19          | 15                | 10  | 0  | Q   | 7  |   | 5 | 1 | 3 | 2 | 1 |   |
| 01                                                           | 24 | 23 2      | 0 19          | 15                | 10  | 9  | 0   | '  |   | 5 | 4 | 5 | 2 | 1 | 0 |

### Table 6.61 TagLo Register Field Descriptions

| Fields |        |                                                                                                                                                                                                                                                                                                                            |            |             |
|--------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bit(s) | Description                                                                                                                                                                                                                                                                                                                | Read/Write | Reset State |
| PA     | 31:10  | $ErrCtl_{WST}=0$ : This field contains the physical address of the cache line. Bit 31 corresponds to bit 31 of the PA and bit 10 corresponds to bit 10 of the PA.<br>ErrCtl <sub>WST</sub> =1 : PA[31:24] is undefined.                                                                                                    | R/W        | Undefined   |
| WSDP   | 23:20  | $ErrCtl_{WST}$ =1 : Dirty Parity (Optional). This field contains<br>the value read from the WS array during a CACHE Index<br>Load WS operation.<br>If the <i>PO</i> field of the <i>ErrCtl</i> register is asserted, then this<br>field is used to store the dirty parity bits during a CACHE<br>Index Store WS operation. | R/W        | Undefined   |

| Fiel  | ds       |                                                                                                                                                                                                                                                            |            |             |
|-------|----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name  | Bit(s)   | Description                                                                                                                                                                                                                                                | Read/Write | Reset State |
| WSD   | 19:16    | $ErrCtl_{WST}=1$ : Dirty bits. This field contains the value read from the WS array after a CACHE Index Load WS operation. It is used to store into the WS array during CACHE Index Store WS operations.                                                   | R/W        | Undefined   |
| WSLRU | 15:10    | $ErrCtl_{WST}=1$ : This field contains the value read from or to be stored to the WS array if the WST bit in the ErrCtl register is set.                                                                                                                   | R/W        | Undefined   |
| R     | 9:8, 4:1 | $ErrCtl_{WST} = 0$ : must be written as zero; returns zero on read.                                                                                                                                                                                        | 0          | 0           |
| R     | 9:8, 4:0 | $ErrCtl_{WST} = 1$ : must be written as zero; returns zero on read.                                                                                                                                                                                        | 0          | 0           |
| V     | 7        | This field indicates whether the cache line is valid.                                                                                                                                                                                                      | R/W        | Undefined   |
| D     | 6        | This field indicates whether the cache line is dirty. It will<br>only be set if bit 7 (valid) is also set.                                                                                                                                                 | R/W        | Undefined   |
| L     | 5        | Specifies the lock bit for the cache tag. When this bit is set, and the valid bit is set, the corresponding cache line will not be replaced by the cache replacement algorithm.                                                                            | R/W        | Undefined   |
| Р     | 0        | Parity. Specifies the parity bit for the cache tag. This bit is<br>updated with tag array parity on CACHE Index Load Tag<br>operations and used as tag array parity on Index Store Tag<br>operations when the PO bit of the <i>ErrCtl</i> register is set. | R/W        | Undefined   |

## 6.2.51 DataLo Register (CP0 Register 28, Select 1)

The DataLo register is a register that acts as the interface to the cache data array and is intended for diagnostic operations only. The Index Load Tag operation of the CACHE instruction reads the corresponding data values into the DataLo register. If the WST bit in the ErrCtl register is set, then the contents of DataLo can be written to the cache data array by doing an Index Store Data CACHE instruction. Note that the microAptiv UP core does not implement the DataHi register.

#### Figure 6.54 DataLo Register Format

31 0 DATA

#### Table 6.62 DataLo Register Field Description

| Fiel | ds     |                                                |            | Reset     |
|------|--------|------------------------------------------------|------------|-----------|
| Name | Bit(s) | Description                                    | Read/Write | State     |
| DATA | 31:0   | Low-order data read from the cache data array. | R/W        | Undefined |

#### 6.2.52 ErrorEPC (CP0 Register 30, Select 0)

The *ErrorEPC* register is a read/write register, similar to the *EPC* register, except that *ErrorEPC* is used on error exceptions. All bits of the *ErrorEPC* register are significant and must be writable. It is also used to store the program counter on Reset, Soft Reset, and nonmaskable interrupt (NMI) exceptions.

The *ErrorEPC* register contains the virtual address at which instruction processing can resume after servicing an error. This address can be:

- The virtual address of the instruction that caused the exception
- The virtual address of the immediately preceding branch or jump instruction when the error causing instruction is in a branch delay slot

Unlike the EPC register, there is no corresponding branch delay slot indication for the ErrorEPC register.

In processors that implement microMIPS, a read of the *ErrorEPC* register (via MFC0) returns the following value in the destination GPR:

 $GPR[rt] \leftarrow ErrorExceptionPC_{31..1} || ISAMode_0$ 

That is, the upper 31 bits of the error exception PC are combined with the lower bit of the *ISAMode* field and written to the GPR.

Similarly, a write to the *ErrorEPC* register (via MTC0) takes the value from the GPR and distributes that value to the error exception PC and the *ISAMode* field, as follows

```
 \begin{array}{l} \texttt{ErrprExceptionPC} \leftarrow \texttt{GPR[rt]}_{31..1} \mid \mid 0 \\ \texttt{ISAMode} \leftarrow 2\#0 \mid \mid \texttt{GPR[rt]}_{0} \end{array}
```

That is, the upper 31 bits of the GPR are written to the upper 31 bits of the error exception PC, and the lower bit of the error exception PC is cleared. The upper bit of the *ISAMode* field is cleared and the lower bit is loaded from the lower bit of the GPR.

#### Figure 6.55 ErrorEPC Register Format

| 31 |          | 0 |
|----|----------|---|
|    | ErrorEPC |   |

#### Table 6.63 ErrorEPC Register Field Description

| Fields   |        |                                  |            |             |
|----------|--------|----------------------------------|------------|-------------|
| Name     | Bit(s) | Description                      | Read/Write | Reset State |
| ErrorEPC | 31:0   | Error Exception Program Counter. | R/W        | Undefined   |

#### 6.2.53 DeSave Register (CP0 Register 31, Select 0)

The Debug Exception Save (DeSave) register is a read/write register that functions as a simple memory location. This register is used by the debug exception handler to save one of the GPRs that is then used to save the rest of the context

to a pre-determined memory area (such as in the EJTAG Probe). This register allows the safe debugging of exception handlers and other types of code in which the existence of a valid stack for context saving cannot be assumed.

#### Figure 6.56 DeSave Register Format

| 31     | 0 |
|--------|---|
| DESAVE |   |

#### Table 6.64 DeSave Register Field Description

| Fields | lds    |                                |            |             |
|--------|--------|--------------------------------|------------|-------------|
| Name   | Bit(s) | Description                    | Read/Write | Reset State |
| DESAVE | 31:0   | Debug exception save contents. | R/W        | Undefined   |

### 6.2.54 KScratchn Registers (CP0 Register 31, Selects 2 to 3)

The *KScratchn* registers are optional read/write registers available for scratchpad storage by kernel-mode software. These registers are 32 bits in width for 32-bit processors and 64 bits for 64-bit processors.

The existence of these registers is indicated by the *KScrExist* field in the *Config4* register. The *KScrExist* field specifies which of the selects are populated with a kernel scratch register.

Debug-mode software should not use these registers, but should instead use the *DeSave* register. If EJTAG is implemented, select 0 should not be used for a *KScratch* register. Select 1 is being reserved for future debug use and should not be used for a *KScratch* register.

#### Figure 6-57 KScratchn Register Format

31 0 Data

#### Table 6.65 KScratchn Register Field Descriptions

| Fie  | lds  |                                            | Read / | Reset     |
|------|------|--------------------------------------------|--------|-----------|
| Name | Bits | Description                                | Write  | State     |
| Data | 31:0 | Scratch pad data saved by kernel software. | R/W    | Undefined |

Chapter 7

## Hardware and Software Initialization of the microAptiv™ UP Core

The microAptiv UP processor core contains only a minimal amount of hardware initialization and relies on software to fully initialize the device.

This chapter contains the following sections:

- Section 7.1 "Hardware-Initialized Processor State"
- Section 7.2 "Software Initialized Processor State"

## 7.1 Hardware-Initialized Processor State

The microAptiv UP processor core, like most other MIPS processors, is not fully initialized by hardware reset. Only a minimal subset of the processor state is cleared. This is enough to bring the core up while running in unmapped and uncached code space. All other processor state can then be initialized by software. *SI\_ColdReset* is asserted after power-up to bring the device into a known state. Soft reset can be forced by asserting the *SI\_Reset* pin. This distinction is made for compatibility with other MIPS processors. In practice, both resets are handled identically with the exception of the setting of *Status<sub>SR</sub>*.

#### 7.1.1 Coprocessor 0 State

Much of the hardware initialization occurs in Coprocessor 0.

- · Random (TLB-based MMU cores only)- cleared to maximum value on Reset/SoftReset
- Wired (TLB-based MMU cores only)- cleared to 0 on Reset/SoftReset
- Status<sub>BEV</sub> cleared to 1 on Reset/SoftReset
- Status<sub>TS</sub> cleared to 0 on Reset/SoftReset
- Status<sub>SR</sub> cleared to 0 on Reset, set to 1 on SoftReset
- Status<sub>NMI</sub> cleared to 0 on Reset/SoftReset
- Status<sub>ERL</sub> set to 1 on Reset/SoftReset
- Status<sub>RP</sub> cleared to 0 on Reset/SoftReset
- WatchLol, R, W cleared to 0 on Reset/SoftReset

- Config fields related to static inputs set to input value by Reset/SoftReset
- Config<sub>K0</sub> set to 010 (uncached) on Reset/SoftReset
- Config<sub>KU</sub> set to 010 (uncached) on Reset/SoftReset (FM based MMU cores only)
- Config<sub>K23</sub> set to 010 (uncached) on Reset/SoftReset (FM based MMU cores only)
- ContextConfig set to 0x007ffff0 on Reset/SoftReset (MIPS32 configuration)
- *PageGrain<sub>Mask</sub>* set to 11 on Reset/SoftReset (MIPS32 compatibility mode)
- *Debug<sub>DM</sub>* cleared to 0 on Reset/SoftReset (unless EJTAGBOOT option is used to boot into DebugMode, see Chapter 10, "EJTAG Debug Support in the microAptiv<sup>™</sup> UP Core" on page 230 for details)
- Debug<sub>LSNM</sub> cleared to 0 on Reset/SoftReset
- *Debug<sub>IBUSEP</sub>* cleared to 0 on Reset/SoftReset
- Debug<sub>DBusEP</sub> cleared to 0 on Reset/SoftReset
- Debug<sub>IEXI</sub> cleared to 0 on Reset/SoftReset
- Debug<sub>SSt</sub> cleared to 0 on Reset/SoftReset

### 7.1.2 TLB Initialization

Each TLB entry has a "hidden" state bit which is set by Reset/SoftReset and is cleared when the TLB entry is written. This bit disables matches and prevents "TLB Shutdown" conditions from being generated by the power-up values in the TLB array (when two or more TLB entries match on a single address). This bit is not visible to software.

#### 7.1.3 Bus State Machines

All pending bus transactions are aborted and the state machines in the bus interface unit are reset when a Reset or SoftReset exception is taken.

#### 7.1.4 Static Configuration Inputs

All static configuration inputs (defining the bus mode and cache size for example) should only be changed during Reset.

#### 7.1.5 Fetch Address

Upon Reset/SoftReset, unless the EJTAGBOOT option is used, the fetch is directed to VA 0xBFC00000 (PA 0x1FC00000). This address is in KSeg1, which is unmapped and uncached, so that the TLB and caches do not require hardware initialization.

## 7.2 Software Initialized Processor State

Software is required to initialize the following parts of the device.

## 7.2.1 Register File

The register file powers up in an unknown state with the exception of r0 which is always 0. Initializing the rest of the register file is not required for proper operation in hardware. However, when simulating the operation of the core, unknown values can cause problems. Thus, initializing the register file in the boot code may avoid simulation problems.

## 7.2.2 TLB

Because of the hidden bit indicating initialization, the TLB-based MMU microAptiv UP does not require TLB initialization upon ColdReset. This is an implementation specific feature of the microAptiv UP core and cannot be relied upon if writing generic code for MIPS32/64 processors. When initializing the TLB, care must be taken to avoid creating a "TLB Shutdown" condition where two TLB entries could match on a single address. Unique virtual addresses should be written to each TLB entry to avoid this.

## 7.2.3 Caches

The cache tag and data arrays power up to an unknown state and are not affected by reset. Every tag in the cache arrays should be initialized to an invalid state using the CACHE instruction (typically the Index Invalidate function). This can be a long process, especially since the instruction cache initialization needs to be run in an uncached address region.

## 7.2.4 Coprocessor 0 State

Miscellaneous COP0 states need to be initialized prior to leaving the boot code. There are various exceptions which are blocked by *ERL*=1 or *EXL*=1 and which are not cleared by Reset. These can be cleared to avoid taking spurious exceptions when leaving the boot code.

- Cause: WP (Watch Pending), SW0/1 (Software Interrupts) should be cleared.
- Config: K0 should be set to the desired Cache Coherency Algorithm (CCA) prior to accessing Kseg0.
- Config: (FM Based MMU cores only) KU and K23 should be set to the desired CCA for USeg/KUSeg and KSeg2/3 respectively prior to accessing those regions.
- *Count*: Should be set to a known value if Timer Interrupts are used.
- *Compare*: Should be set to a known value if Timer Interrupts are used. The write to compare will also clear any pending Timer Interrupts (Thus, *Count* should be set before *Compare* to avoid any unexpected interrupts).
- Status: Desired state of the device should be set.
- Other COP0 state: Other registers should be written before they are read. Some registers are not explicitly writeable, and are only updated as a by-product of instruction execution or a taken exception. Uninitialized bits should be masked off after reading these registers.

## Chapter 8

## Caches of the microAptiv<sup>™</sup> UP Core

This chapter describes the caches present in the microAptiv<sup>™</sup> UP processor core. It contains the following sections:

- Section 8.1, "Cache Configurations"
- Section 8.2, "Cache Protocols"
- Section 8.3, "Instruction Cache"
- Section 8.4, "Data Cache"
- Section 8.5, "CACHE Instruction"
- Section 8.6, "Software Cache Testing"
- Section 8.7, "Memory Coherence Issues"

## 8.1 Cache Configurations

The microAptiv UP processor core supports separate instruction and data caches that may be flexibly configured at build time for various sizes, organizations and set-associativities. The use of separate caches allows instruction and data references to proceed simultaneously. Both caches are virtually indexed and physically tagged, allowing cache access to occur in parallel with virtual-to-physical address translation. Parity protection of the cache arrays is an optional feature.

The instruction and data caches are independently configured. For example, the data cache can be 2 KB in size and 2-way set associative, while the instruction cache can be 8 KB in size and 4-way set associative. Each cache is accessed in a single processor cycle.

Cache refills are performed using a 4-word fill buffer, which holds data returned from memory during a 4-beat burst transaction. The critical miss word is always returned first. The caches are blocking until the critical word is returned, but the pipeline may proceed while the other 3 beats of the burst are still active on the bus.

| Parameter                     | Instruction                   | Data                          |
|-------------------------------|-------------------------------|-------------------------------|
| Size                          | 0 - 64 KB                     | 0 - 64 KB                     |
| Number of Cache Sets          | 0, 64, 128, 256, 512 and 1024 | 0, 64, 128, 256, 512 and 1024 |
| Lines Per Set (Associativity) | 1 - 4 way set associative     | 1 - 4 way set associative     |
| Line Size                     | 16 Bytes                      | 16 Bytes                      |

Table 8.1 lists the instruction and data cache attributes:

## **Table 8.1 Instruction and Data Cache Attributes**

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

| Parameter                      | Instruction | Data                                                                                                                                              |
|--------------------------------|-------------|---------------------------------------------------------------------------------------------------------------------------------------------------|
| Read Unit                      | 32 bits     | 32 bits                                                                                                                                           |
| Minimum Write Unit             | 32 bits     | 8 bits                                                                                                                                            |
| Write Policy                   | N/A         | Software selectable options:<br>• write-back with write-allocate<br>• write-through with write-allocate<br>• write-through without write-allocate |
| Miss restart after transfer of | miss word   | miss word                                                                                                                                         |
| Cache Locking                  | per line    | per line                                                                                                                                          |

## Table 8.1 Instruction and Data Cache Attributes (Continued)

Table 8.2 shows the cache size and organization options; note that the same total cache size may be achieved with several different set associativities. Software can identify the instruction or data cache configuration on a microAptiv UP core by reading the appropriate bits of the *Config1* register; see Section 6.2.30, "Config Register (CP0 Register 16, Select 0)" on page 176.

| Cache Size (bytes) | Way Organization Options |  |
|--------------------|--------------------------|--|
| 0K                 | No cache                 |  |
| 1K                 | One 1K way               |  |
| 2К                 | One 2K way               |  |
|                    | Two 1K ways              |  |
| 3К                 | Three 1K ways            |  |
| 4K                 | One 4K way               |  |
|                    | Two 2K ways              |  |
|                    | Four 1K ways             |  |
| 6K                 | Three 2K ways            |  |
| 8K                 | One 8K way               |  |
|                    | Two 4K ways              |  |
|                    | Four 2K ways             |  |
| 12K                | Three 4K ways            |  |
| 16K                | One 16K way              |  |
|                    | Two 8K ways              |  |
|                    | Four 4K ways             |  |
| 24K                | Three 8K ways            |  |
| 32K                | Two 16K ways             |  |
|                    | Four 8K ways             |  |
| 48K                | Three 16K ways           |  |
| 64K                | Four 16K ways            |  |

## **Table 8.2 Instruction and Data Cache Sizes**

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

## 8.2 Cache Protocols

This section describes cache organization, attributes, and cache-line replacement for the instruction and data caches. This section also discusses issues relating to virtual aliasing.

## 8.2.1 Cache Organization

The instruction and data caches each consist of three arrays: tag, data and way-select. The caches are virtually indexed, since a virtual address is used to select the appropriate line within each of the three arrays. The caches are physically tagged, as the tag array contains a physical, not virtual, address.

The tag and data arrays hold n ways of information per set, corresponding to the n-way set associativity of the cache, where n can be between 1 and 4 for a cache in the microAptiv UP core. The way-select array holds information to choose the way to be filled, as well as dirty bits in the case of the data cache.

Figure 8.1 shows the format of each line in the tag, data and way-select arrays without Parity enabled.

| -               | 22    |     |       | 1    | 1     |   |      |
|-----------------|-------|-----|-------|------|-------|---|------|
| Tag (per way):  | РА    |     |       | Lock | Valid |   |      |
| Data (nor way): | 32    | 2   | 32    | 3    | 2     |   | 32   |
| Data (per way): | Wor   | rd3 | Word2 | Wo   | ord1  | W | ord0 |
| D Way-Select:   | 1-4   | 0-6 |       |      |       |   |      |
| D Way-Select.   | Dirty | LRU | ſ     |      |       |   |      |
| I Way-Select:   | 0-6   |     |       |      |       |   |      |
| i way-Select.   | LRU   |     |       |      |       |   |      |

## Figure 8.1 I-Cache and D-Cache Array Formats

Figure 8.2 shows the format of each line in the I-Cache tag, data and way-select arrays with Parity enabled.



#### Figure 8.2 I-Cache Array Formats

Figure 8.3 shows the format of each line in the D-Cache tag, data and way-select arrays with Parity enabled.

| Tag (per way):  | 1            |             |              | 2           | 2            |             |              | 1            | 1          |
|-----------------|--------------|-------------|--------------|-------------|--------------|-------------|--------------|--------------|------------|
|                 | Parity       |             | РА           |             |              |             | Lock         | Valid        |            |
| Data (per way): | 1<br>Parity1 | 5 By        | 8<br>rte15   |             | 9x<br>       |             |              | 1<br>Parity0 | 8<br>Byte0 |
| D Way-Select:   | 1<br>Parity3 | 1<br>Dirty3 | 1<br>Parity2 | 1<br>Dirty2 | 1<br>Parity1 | 1<br>Dirty1 | 1<br>Parity0 | 1<br>Dirty0  | 0-6<br>LRU |

#### Figure 8.3 D-Cache Array Formats

A tag entry consists of the upper 22 bits of the physical address (bits [31:10]), one valid bit for the line, and a lock bit. A data entry contains the four 32-bit words in the line, for a total of 16 bytes. All four words in the line are present or not in the data array together, hence the single valid bit stored with the tag. Once a valid line is resident in the cache, byte, halfword, triple-byte or full word stores can update all or a portion of the words in that line. The tag and data entries are repeated for each of the n lines in the set, per the associativity.

A way-select entry holds bits choosing the way to be replaced according to a Least Recently Used (LRU) algorithm. The LRU information applies to all the ways and there is one way-select entry for all the ways in the set. The number of bits in the way-select entry depends on the set associativity. In a direct mapped cache (n=1), there is no need for LRU bits, since fills can only go to one place only. Table 8.3 shows the number of LRU bits required as a function of associativity. The array with way-select entries for the data cache also holds dirty bit(s) for the lines. One dirty bit is required per line, as shown in Table 8.3. The instruction cache only supports reads, hence only LRU entries are stored in the instruction way-select array.

| Associativity ( <i>n</i> ) | LRU Bits | Dirty Bits (data cache only) |
|----------------------------|----------|------------------------------|
| 1                          | 0        | 1                            |
| 2                          | 1        | 2                            |

## Table 8.3 LRU and Dirty Width in Way-Select Array

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

| Associativity ( <i>n</i> ) | LRU Bits | Dirty Bits (data cache only) |
|----------------------------|----------|------------------------------|
| 3                          | 3        | 3                            |
| 4                          | 6        | 4                            |

#### Table 8.3 LRU and Dirty Width in Way-Select Array

## 8.2.2 Cacheability Attributes

The microAptiv UP core supports the following cacheability attributes:

- *Uncached*: Addresses in a memory area indicated as uncached are not read from the cache. Stores to such addresses are written directly to main memory, without changing cache contents.
- Write-back with write allocation: Loads and instruction fetches first search the cache, reading main memory only if the desired data does not reside in the cache. On data store operations, the cache is first searched to see if the target address is cache resident. If it is resident, the cache contents are updated, but main memory is not written. If the cache lookup misses on a store, main memory is read to bring the line into the cache and merge it with the new store data. Hence, the allocation policy on a cache miss is read- or write-allocate. Data stores will update the appropriate dirty bit in the way-select array to indicate that the line contains modified data. When a line with dirty data is displaced from the cache, it is written back to memory.
- Write-through with no write allocation: Loads and instruction fetches first search the cache, reading main memory only if the desired data does not reside in the cache. On data store operations, the cache is first searched to see if the target address is cache resident. If it is resident, the cache contents are updated, and main memory is also written. If the cache lookup misses on a store, only main memory is written. Hence, the allocation policy on a cache miss is read-allocate only.
- Write-through with write allocation: Loads and instruction fetches first search the cache, reading main memory only if the desired data does not reside in the cache. On data store operations, the cache is first searched to see if the target address is cache resident. If it is resident, the cache contents are updated, and main memory is also written. If the cache lookup misses on a store, main memory is read to bring the line into the cache and merge it with the new store data. In addition, the store data is also written to main memory. Hence, the allocation policy on a cache miss is read- or write-allocate.

Some segments of memory employ a fixed caching policy; for example the kseg1 is always uncacheable. Other segments of memory allow the caching policy to be selected by software. Generally, the cache policy for these programmable regions is defined by a cacheability attribute field associated with that region of memory. See Chapter 4, "Memory Management of the microAptiv<sup>TM</sup> UP Core" on page 75 for further details.

## 8.2.3 Replacement Policy

The replacement policy refers to how a way is chosen to hold an incoming cache line on a miss which will result in a cache fill, when a cache is at least two-way set associative. In a direct mapped cache (one-way set associative), the replacement policy is irrelevant since there is only one way available. The replacement policy is least recently used (LRU), but excluding any locked ways. The LRU bit(s) in the way-select array encode the order in which ways on that line have been accessed.

On a cache miss, the lock and LRU bits for the tag and way-select entries of the selected line may be used to determine the way which will be chosen. The number of lock bits and the number of LRU bits depend on the set associativity of the cache.

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

The LRU field in the way select array is updated as follows:

- On a cache hit, the associated way is updated to be the most recently used. The order of the other ways relative to each another is unchanged.
- On a cache refill, the filled way is updated to be the most recently used.
- On CACHE instructions, the update of the LRU bits depends on the type of operation to be performed:
  - Index (Writeback) Invalidate: Least-recently used.
  - Index Load Tag: No update.
  - Index Store Tag, WST=0: Most-recently used if valid bit is set in TagLo CP0 register. Least-recently used if valid bit is cleared in TagLo CP0 register.
  - Index Store Tag, WST=1: Update the field with the contents of the *TagLo* CP0 register (refer to Table 8.5, Table 8.6 or Table 8.7 for the valid values of this field).
  - Index Store Data: No update.
  - Hit Invalidate: Least-recently used if a hit is generated, otherwise unchanged.
  - Fill: Most-recently used.
  - Hit (Writeback) Invalidate: Least-recently used if a hit is generated, otherwise unchanged.
  - Hit Writeback: No update.
  - Fetch and Lock: Most-recently used.

If all ways are valid, then any locked ways will be excluded from consideration for replacement. For the unlocked ways, the LRU bits are used to identify the way which has been used least recently, and that way is selected for replacement. If all ways are locked: Fill data will not fill into the cache, and Write-back stores turn into Write-through Write-allocate stores.

If the way selected for replacement has its dirty bit asserted in the way-select array, then that 16-byte line will be written back to memory before the new fill can occur.

## 8.2.4 Virtual Aliasing

Since the caches are virtually indexed and physically tagged, a potential issue referred to as *virtual aliasing* might exist. Virtual aliasing occurs if the virtual bits used to index a cache array are not consistent with the overlapping physical bits, after the virtual address has been translated to a physical address. The possibility of virtual aliasing only occurs in address regions which are mapped through a TLB-based memory management unit, so it is only relevant when TLB option is selected for MMU implementation, otherwise it contains a fixed memory management unit.

In TLB-mapped address regions, virtual aliasing may occur if the cache size per way is greater than the page size. For example, consider a 16 KB cache organized as 2-way set associative. The size per way is then 8 KB, so virtual address bits [12:0] are used to index the array. If the address is in a translated region with a page size of 4 KB, then address bits [11:0] are untranslated but address bits [31:12] will be mapped and for these bits the virtual and physical addresses may be different. In this example, bit [12] could pose a potential problem due to virtual aliasing. Imagine two virtual addresses, VA0 and VA1, whose only difference is the value of bit [12], which map to the same physical address. These two virtual addresses would be indexed to two different lines by the cache, even though they were intended to represent the same physical address. Then if a program does a load using VA0 and a store using VA1, or vice-versa, the cache may not return the expected data.

Table 8.4 shows the overlapped virtual/physical address bits which could potentially be involved in virtual aliasing, given the possible minimum page sizes and cache way sizes supported by the microAptiv UP core. Virtual aliasing is generally only a problem for the D-cache, since stores don't happen to the I-cache. No special hardware mechanism is provided to prevent the possibility of virtual aliasing, so it must be handled by software. The software solution must ensure that the mapping of virtual address bits which overlap with physical address bits be handled consistently. The simplest approach is to ensure that the overlapping bits are unity-mapped (VA equals PA).

| Minimum Page Size<br>(KB) | Cache Way Size<br>(KB) | Overlapped address<br>bits with possible<br>aliasing |
|---------------------------|------------------------|------------------------------------------------------|
| 1                         | 2                      | [10]                                                 |
|                           | 4                      | [11:10]                                              |
|                           | 8                      | [12:10]                                              |
|                           | 16                     | [13:10]                                              |
| 4                         | 8                      | [12]                                                 |
|                           | 16                     | [13:12]                                              |
| 8                         | 16                     | [13]                                                 |

**Table 8.4 Potential Virtual Aliasing Bits** 

A related issue can occur in virtually indexed, physically tagged caches if the number of physical bits stored in the tag array do not fully overlap the physically translated bits for the smallest page size. For the microAptiv UP core, there are always 22 address bits stored in the cache tag, representing bits [31:10] of the physical address. Since the minimum page size is 1 KB for the microAptiv UP, with bits [31:10] physically translated by the TLB, the cache tag size does overlap the translated bits and this issue will not occur.

## 8.3 Instruction Cache

The instruction cache (I-cache) is an optional on-chip memory block of up to 64 KB. The virtually indexed, physically tagged cache allows the virtual-to-physical address translation to occur in parallel with the cache access rather than having to wait for the physical address translation.

The core supports instruction cache locking. Cache locking allows critical code or data segments to be locked into the cache on a "per-line" basis, enabling the system programmer to maximize the efficiency of the system cache.

The cache locking function is always enabled on all instruction cache entries. Entries can then be marked as locked or unlocked on a per entry basis using the CACHE instruction.

## 8.4 Data Cache

The data cache (D-cache) is an optional on-chip memory block of up to 64 KB. The virtually indexed, physically tagged cache allows the virtual-to-physical address translation to occur in parallel with the cache access rather than having to wait for the physical address translation.

The core also supports a data cache locking mechanism identical to the instruction cache. Critical data segments to be locked into the cache on a "per-line" basis. The locked contents can be updated on a store hit, but cannot be selected for replacement on a miss.

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

The cache locking function is always enabled on all data cache entries. Entries can then be marked as locked or unlocked on a per entry basis using the CACHE instruction.

## 8.5 CACHE Instruction

Both caches support the CACHE instructions, which allow users to manipulate the contents of the Data and Tag arrays, including the locking of individual cache lines. Note that before the CACHE instructions are allowed to execute, all initiated refills are completed and stores are sent to the write buffer. The CACHE instructions are described in detail in Chapter 12, "microAptiv<sup>TM</sup> UP Processor Core Instructions" on page 310.

The CACHE Index Load Tag and Index Store Tag instructions can be used to read and write the WS- RAM by setting the *WST* bit in the *ErrCtl* register. (The *ErrCtl* register is described in Section 6.2.48, "ErrCtl Register (CP0 Register 26, Select 0)" on page 207.) Note that when the *WST* bit is zero, the CACHE index instructions access the cache Tag array.

Not all values of the WS field are valid for defining the order in which the ways are selected. This is only an issue, however, if the WS-RAM is written after the initialization (invalidation) of the Tag array. Valid WS field encodings for way selection order is shown in Table 8.5, Table 8.6, and Table 8.7.

| Selection Order <sup>1</sup> | WS[5:0] | Selection Order | WS[5:0] |
|------------------------------|---------|-----------------|---------|
| 0123                         | 000000  | 2013            | 100010  |
| 0132                         | 000001  | 2031            | 110010  |
| 0213                         | 000010  | 2103            | 100110  |
| 0231                         | 010010  | 2130            | 101110  |
| 0312                         | 010001  | 2301            | 111010  |
| 0321                         | 010011  | 2310            | 111110  |
| 1023                         | 000100  | 3012            | 011001  |
| 1032                         | 000101  | 3021            | 011011  |
| 1203                         | 100100  | 3102            | 011101  |
| 1230                         | 101100  | 3120            | 111101  |
| 1302                         | 001101  | 3201            | 111011  |
| 1320                         | 101101  | 3210            | 111111  |

Table 8.5 Way Selection Encoding, 4 Ways

1. The order is indicated by listing the least-recently used way to the left and the most-recently used way to the right, etc.

| Table 8.6 Wa | y Selection | Encoding, | 3 Ways |
|--------------|-------------|-----------|--------|
|--------------|-------------|-----------|--------|

| Selection Order <sup>1</sup> | WS[5:0] <sup>2</sup> | Selection Order | WS[5:0] |
|------------------------------|----------------------|-----------------|---------|
| 012                          | 0xx00x               | 120             | 1xx10x  |
| 021                          | 0xx01x               | 201             | 1xx01x  |
| 102                          | 0xx10x               | 210             | 1xx11x  |

- 1. The order is indicated by listing the least-recently used way to the left and the most-recently used way to the right, etc.
- 2. A "?" indicates a don't care when written and unpredictable when read.

| Selection Order <sup>1</sup> | WS[5:0] <sup>2</sup> | Selection Order | WS[5:0] |
|------------------------------|----------------------|-----------------|---------|
| 01                           | xxx0xx               | 10              | xxx1xx  |

## Table 8.7 Way Selection Encoding, 2 Ways

1. The order is indicated by listing the least-recently used way to the left and the most-recently used way to the right, etc.

2. A "?" indicates a don't care when written and unpredictable when read.

## 8.6 Software Cache Testing

Typically, the cache RAM arrays will be tested using BIST. It is, however, possible for software running on the processor to test all of the arrays. Of course, testing of the I-cache arrays should be done from an uncacheable space with interrupts disabled in order to maintain the cache contents. There are multiple methods for testing these arrays in software, only one is presented here.

## 8.6.1 I-Cache/D-cache Tag Arrays

These arrays can be tested via the Index Load Tag and Index Store Tag varieties of the CACHE instruction. Index Store Tag will write the contents of the *TagLo* register into the selected tag entry. Index Load Tag will read the selected tag entry into the *TagLo*.

## 8.6.2 I-Cache Data Array

This array can be tested using the Index Store Data and Index Load Tag varieties of the CACHE instruction. The Index Store Data variety is enabled by setting the *WST* bit in the *ErrCtl* register.

The precode bits in the array can be tested by setting the *PCO* bit in the *ErrCtl* register. This will write the *PCl* field in the *ErrCtl* register instead of calculating the precode bits on a write.

The parity bits in the array can be tested by setting the *PO* bit in the *ErrCtl* register. This will use the *Pl* field in *ErrCtl* instead of calculating the parity on a write.

The rest of the data bits are read/written to/from the DataLo and DataHi registers.

## 8.6.3 I-Cache WS Array

The testing of this array is very similar to the testing of the tag array. By setting the *WST* bit in the *ErrCtl* register, Index Load Tag and Index Store Tag CACHE instructions will read and write the WS array instead of the tag array.

## 8.6.4 D-Cache Data Array

This array can be tested using the Index Store Tag CACHE, SW, and LW instructions. First, use Index Store Tag to set the initial state of the tags to valid with a known physical address (PA). Write the array using SW instructions to the PAs that are resident in the cache. The value can then be read using LW instructions and compared to the expected data.

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

## 8.6.5 D-cache WS Array

The dirty bits in this array will be tested when the data tag is tested. The LRU bits can be tested using the same mechanism as the I-cache WS array.

## 8.7 Memory Coherence Issues

A cache presents coherency issues within the memory hierarchy which must be considered in the system design. Since a cache holds a copy of memory data, it is possible for another memory master to modify a memory location, thus making other copies of that location stale if those copies are still in use. A detailed discussion of memory coherence is beyond the scope of this document, but following are a few related comments.

The microAptiv UP processor contains no direct hardware support for managing coherency with respect to its caches, so it must be handled via system design or software. The microAptiv UP data cache supports either write-back or write-through protocols.

In write-through mode, all data writes will eventually be sent to memory. Due to write buffers, however, there could be a delay in how long it takes for the write to memory to actually occur. If another memory master updates cacheable memory which could also be in the microAptiv UP caches, then those locations may need to be flushed from the cache. The only way to accomplish this invalidation is by use of the CACHE instruction.

In write-back mode, data writes only go to the cache and not to memory. So the processor cache may contain the *only* copy of data in the system until that data is written to main memory. Dirty lines are only written to memory when displaced from the cache as a new line is filled or if explicitly forced by certain flavors of the CACHE or PREF instructions.

The SYNC instruction may also be useful to software enforcing memory coherence, as it flushes the microAptiv UP core's write buffers.

Chapter 9

# Power Management of the microAptiv<sup>™</sup> UP Core

The microAptiv UP processor core offers a number of power management features, including low-power design, active power management and power-down modes of operation. The core is a static design that supports a WAIT instruction designed to signal the rest of the device that execution and clocking should be halted, reducing system power consumption during idle periods.

The core provides two mechanisms for system level low-power support discussed in the following sections.

- Section 9.1 "Register-Controlled Power Management"
- Section 9.2 "Instruction-Controlled Power Management"

## 9.1 Register-Controlled Power Management

The *RP* bit in the CP0 *Status* register enables a standard software mechanism for placing the system into a low power state. The state of the RP bit is available externally via the *SI\_RP* output signal. Three additional pins, *SI\_EXL*, *SI\_ERL*, and *EJ\_DebugM* support the power management function by allowing the user to change the power state if an exception or error occurs while the core is in a low power state.

Setting the *RP* bit of the CP0 *Status* register causes the core to assert the *SI\_RP* signal. The external agent can then decide whether to reduce the clock frequency and place the core into power down mode.

If an interrupt is taken while the device is in power down mode, that interrupt may need to be serviced depending on the needs of the application. The interrupt causes an exception which in turn causes the *EXL* bit to be set. The setting of the *EXL* bit causes the assertion of the *SI\_EXL* signal on the external bus, indicating to the external agent that an interrupt has occurred. At this time the external agent can choose to either speed up the clocks and service the interrupt or let it be serviced at the lower clock speed.

The setting of the *ERL* bit causes the assertion of the *SI\_ERL* signal on the external bus, indicating to the external agent that an error has occurred. At this time the external agent can choose to either speed up the clocks and service the error or let it be serviced at the lower clock speed.

Similarly, the *EJ\_DebugM* signal indicates that the processor is in debug mode. Debug mode is entered when the processor takes a debug exception. If fast handling of this is desired, the external agent can speed up the clocks.

The core provides four power down signals that are part of the system interface. Three of the pins change state as the corresponding bits in the CP0 *Status* register are set or cleared. The fourth pin indicates that the processor is in debug mode:

- The SI\_RP signal represents the state of the RP bit (27) in the CP0 Status register.
- The SI\_EXL signal represents the state of the EXL bit (1) in the CP0 Status register.
- The SI\_ERL signal represents the state of the ERL bit (2) in the CP0 Status register.

• The *EJ\_DebugM* signal indicates that the processor has entered debug mode.

## 9.2 Instruction-Controlled Power Management

The second mechanism for invoking power down mode is through execution of the WAIT instruction. If the bus is idle at the time the WAIT instruction reaches the M stage of the pipeline the internal clocks are suspended and the pipeline is frozen. However, the internal timer and some of the input pins (*Sl\_Int*[5:0], *Sl\_NMI*, *Sl\_Reset*, *Sl\_ColdReset*, and *EJ\_DINT*) continue to run. If the bus is not idle at the time the WAIT instruction reaches the M stage, the pipeline stalls until the bus becomes idle, at which time the clocks are stopped. When the CPU is in instruction controlled power management mode, any enabled interrupt, NMI, debug interrupt, or reset condition causes the CPU to exit this mode and resume normal operation. While the part is in this low-power mode, the *Sl\_SLEEP* signal is asserted to indicate to external agents what the state of the chip is.

## Chapter 10

# EJTAG Debug Support in the microAptiv<sup>™</sup> UP Core

The EJTAG debug logic in the microAptiv UP processor core provides three optional modules:

- 1. Hardware breakpoints
- 2. Test Access Port (TAP) for a dedicated connection to a debug host
- 3. Tracing of program counter/data address/data value trace to On-chip memory or to a Trace probe

These features are covered in the following sections:

- Section 10.1 "Debug Control Register"
- Section 10.2 "Hardware Breakpoints"
- Section 10.3 "Complex Breakpoint Usage"
- Section 10.4 "Test Access Port (TAP)"
- Section 10.5 "EJTAG TAP Registers"
- Section 10.6 "TAP Processor Accesses"
- Section 10.7 "SecureDebug"
- Section 10.8 "iFlowtrace<sup>™</sup> Mechanism"
- Section 10.9 "PC/Data Address Sampling"
- Section 10.10 "Fast Debug Channel"
- Section 10.11 "cJTAG Interface"

## **10.1 Debug Control Register**

The *Debug Control Register* (*DCR*) register controls and provides information about debug issues, and is always provided with the CPU core. The register is memory-mapped in drseg at offset 0x0.

The DataBrk and InstBrk bits indicate if hardware breakpoints are included in the implementation, and debug software is expected to read hardware breakpoint registers for additional information.

Hardware and software interrupts are maskable for non-debug mode with the *INTE* bit, which works in addition to the other mechanisms for interrupt masking and enabling. NMI is maskable in non-debug mode with the *NMIE* bit, and a pending NMI is indicated through the *NMIP* bit.

The *SRE* bit allows implementation-dependent masking of none, some or all sources for soft reset. The soft reset masking may only be applied to a soft reset source if that source can be efficiently masked in the system, thus resulting in no reset at all. If that is not possible, then that soft reset source should not be masked, since a partial soft reset may cause the system to fail or hang. There is no automatic indication of whether the SRE is effective, so the user must consult system documentation.

The *PE* bit reflects the *ProbEn* bit from the *EJTAG Control* register (*ECR*), whereby the probe can indicate to the debug software running on the CPU if the probe expects to service dmseg accesses. The reset value in the table below takes effect on both hard and soft resets.

| 31                     | 30  | 29  | 28 | 27        | 26   | 25           | 24   | 23   | 22  | 21   | 20   | 19   | 18          | 17          | 16          |
|------------------------|-----|-----|----|-----------|------|--------------|------|------|-----|------|------|------|-------------|-------------|-------------|
| EJTAG_Brk_<br>Override | 0   | ENM | (  | )         | PCIM | PCno<br>ASID | DASQ | DASe | DAS |      | 0    |      | FDC<br>Impl | Data<br>Brk | Inst<br>Brk |
| 15                     | 14  | 13  | 12 | 11        | 10   | 9            | 8    | 7    | 6   | 5    | 4    | 3    | 2           | 1           | 0           |
| IVM                    | DVM | C   | )  | RD<br>Vec | CBT  | PCS          |      | PCR  |     | PCSe | IntE | NMIE | NMI<br>pend | SRstE       | Prob<br>En  |

#### Figure 10.1 DCR Register Format

| Fields                 |      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | Read /                                                                                               | Reset             |
|------------------------|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------|-------------------|
| Name                   | Bits | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | Write                                                                                                | State             |
| EJTAG_Brk<br>_Override | 31   | <ul> <li>Override <i>EjtagBrk</i> and <i>DINT</i> disable. Refer to Section 10.7 "SecureDebug".</li> <li>Re-enable <i>EjtagBrk</i> and <i>DINT</i> signal during boot.</li> <li>Allows <i>EjtagBrk</i> to be asserted by an EJTAG probe (or assertion of <i>DINT</i> signal), resulting in a request for a Debug Interrupt exception from the processor. This provides a means of recovering the cpu from crash, hang, loop or low-power mode.</li> <li>This feature can allow a Debug Executive to communicate with the probe over the Fast Debug Channel (FDC) and provides a host-based debugger the ability to query the target processor via Debug Executive commands, useful for determining cause of hang.</li> <li>Software can write this bit and read back to determine if the Secure Debug feature is implemented.</li> </ul> | R/W<br>If not<br>imple-<br>mented,<br>must be<br>written as<br>zero;<br>return<br>zeros on<br>reads. | 0                 |
| ENM                    | 29   | Endianess in which the processor is running in kernel<br>and Debug Mode:<br>Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | R                                                                                                    | Externally<br>Set |
|                        |      | 0     Little endian       1     Big endian                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |                                                                                                      |                   |

#### Table 10.1 DCR Register Field Descriptions

| Field    | S    |                                                       |                                                                                                                 | Read / | Reset |
|----------|------|-------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------|--------|-------|
| Name     | Bits | _                                                     | Write                                                                                                           | State  |       |
| PCIM     | 26   | addresses or only th                                  | pling to capture all executed<br>nose that miss the instruction cache<br>supported and this bit will read as 0. | R      | 0     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      | 0 All I                                               | PCs captured                                                                                                    |        |       |
|          |      | 1 Capt                                                | ture only PCs that miss the cache.                                                                              |        |       |
| PCnoASID | 25   | or omits the ASID                                     | ne PCSAMPLE scan chain includes<br>field<br>included, so this bit will read as 0.                               | R      | 0     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      |                                                       | D included in PCSAMPLE scan                                                                                     |        |       |
|          |      | 1 ASI                                                 | D omitted from PCSAMPLE scan                                                                                    |        |       |
| DASQ     | 24   | point.                                                | ress Sampling using a data break-<br>ing is not supported so this bit will                                      | R      | 0     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      |                                                       | data addresses are sampled                                                                                      |        |       |
|          |      | 1 Sam                                                 | ple matches of data breakpoint 0                                                                                |        |       |
| DASe     | 23   | Enables Data Addre<br>Data address sampl<br>read as 0 | ess Sampling<br>ing is not supported so this bit will                                                           | R      | 0     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      |                                                       | a Address sampling disabled.                                                                                    |        |       |
|          |      | 1 Data                                                | a Address sampling enabled.                                                                                     |        |       |
| DAS 22   |      | mented.                                               | a Address Sampling feature is imple-<br>ing is not supported so this bit will                                   | R      | 0     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      |                                                       | DA Sampling implemented                                                                                         |        |       |
|          |      | 1 DA                                                  | Sampling implemented                                                                                            |        |       |
| FDCImpl  | 18   | Indicates if the fast                                 | debug channel is implemented                                                                                    | R      | 1     |
|          |      | Encoding                                              | Meaning                                                                                                         |        |       |
|          |      | 0 No f                                                | fast debug channel implemented                                                                                  |        |       |
|          |      | 1 Fast                                                | debug channel implemented                                                                                       |        |       |

| Fields  |      |                                    | Deed (                                                                                                                                | Deset           |                |
|---------|------|------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------|-----------------|----------------|
| Name    | Bits |                                    | Description                                                                                                                           | Read /<br>Write | Reset<br>State |
| DataBrk | 17   | Indicates if da                    | R                                                                                                                                     | Preset          |                |
|         |      | Encoding                           | Meaning                                                                                                                               |                 |                |
|         |      | 0                                  | No data hardware breakpoint imple-<br>mented                                                                                          |                 |                |
|         |      | 1                                  | Data hardware breakpoint imple-<br>mented                                                                                             |                 |                |
| InstBrk | 16   | Indicates if ins<br>mented:        | struction hardware breakpoint is imple-                                                                                               | R               | Preset         |
|         |      | Encoding                           | Meaning                                                                                                                               |                 |                |
|         |      | 0                                  | No instruction hardware breakpoint implemented                                                                                        |                 |                |
|         |      | 1                                  | Instruction hardware breakpoint implemented                                                                                           |                 |                |
| IVM     | 15   | Indicates if in<br>breakpoints is  | verted data value match on data hardware implemented:                                                                                 | R               | Preset         |
|         |      | Encoding                           | _                                                                                                                                     |                 |                |
|         |      | 0                                  | No inverted data value match on data<br>hardware breakpoints implemented                                                              |                 |                |
|         |      | 1                                  | Inverted data value match on data<br>hardware breakpoints implemented                                                                 |                 |                |
| DVM     | 14   | Indicates if a c<br>match is imple | lata value store on a data value breakpoint emented:                                                                                  | R               | Preset         |
|         |      | Encoding                           | Meaning                                                                                                                               |                 |                |
|         |      | 0                                  | No data value store on a data value<br>breakpoint match implemented                                                                   |                 |                |
|         |      | 1                                  | Data value store on a data value break-<br>point match implemented                                                                    |                 |                |
| RDVec   | 11   | value in the D                     | ation of the debug exception vector. The<br>ebugVectorAddr register is used for<br>tions when <i>ProbTrap</i> =0,and <i>RDVec</i> =1. | R/W             | 0              |
| CBT     | 10   | Indicates if co                    | mplex breakpoint block is implemented:                                                                                                | R               | Preset         |
|         |      | Encoding                           | Meaning                                                                                                                               |                 |                |
|         |      | 0                                  | No complex breakpoint block imple-<br>mented                                                                                          |                 |                |
|         |      | 1                                  | Complex breakpoint block imple-<br>mented                                                                                             |                 |                |

## Table 10.1 DCR Register Field Descriptions (Continued)

| Fields    |     |                                                                                                                                                                                                                                                                                         | Read / | Depat          |
|-----------|-----|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|----------------|
| Name Bits |     | Description                                                                                                                                                                                                                                                                             | Write  | Reset<br>State |
| PCS       | 9   | Indicates if the PC Sampling feature is implemented.                                                                                                                                                                                                                                    | R      | 1              |
|           |     | Encoding Meaning                                                                                                                                                                                                                                                                        |        |                |
|           |     | 0 No PC Sampling implemented                                                                                                                                                                                                                                                            |        |                |
|           |     | 1 PC Sampling implemented                                                                                                                                                                                                                                                               |        |                |
| PCR       | 8:6 | PC Sampling rate. Values 0 to 7 map to values $2^5$ to $2^{12}$ cycles, respectively. That is, a PC sample is written out every 32, 64, 128, 256, 512, 1024, 2048, or 4096 cycles respectively. The external probe or software is allowed to set this value to the desired sample rate. | R/W    | 7              |
| PCSe      | 5   | If the PC sampling feature is implemented, then indi-<br>cates whether PC sampling is initiated or not. That is, a<br>value of 0 indicates that PC sampling is not enabled and<br>when the bit value is 1, then PC sampling is enabled and<br>the counters are operational.             | R/W    | 0              |
| IntE      | 4   | Hardware and software interrupt enable for Non-Debug<br>Mode, in conjunction with other disable mechanisms:                                                                                                                                                                             | R/W    | 1              |
|           |     | Encoding Meaning                                                                                                                                                                                                                                                                        |        |                |
|           |     | 0 Interrupt disabled                                                                                                                                                                                                                                                                    |        |                |
|           |     | 1 Interrupt enabled depending on other<br>enabling mechanisms                                                                                                                                                                                                                           |        |                |
| NMIE      | 3   | Non-Maskable Interrupt (NMI) enable for Non-Debug<br>Mode:                                                                                                                                                                                                                              | R/W    | 1              |
|           |     | Encoding Meaning                                                                                                                                                                                                                                                                        |        |                |
|           |     | 0 NMI disabled                                                                                                                                                                                                                                                                          |        |                |
|           |     | 1 NMI enabled                                                                                                                                                                                                                                                                           |        |                |
| NMIpend   | 2   | Indication for pending NMI:                                                                                                                                                                                                                                                             | R      | 0              |
|           |     | Encoding Meaning                                                                                                                                                                                                                                                                        |        |                |
|           |     | 0 No NMI pending                                                                                                                                                                                                                                                                        |        |                |
|           |     | 1 NMI pending                                                                                                                                                                                                                                                                           |        |                |
| SRstE     | 1   | Soft Reset Enable<br>This bit allows the system to mask soft resets. The core<br>does not internally mask soft resets. Rather the state of<br>this bit appears on the <i>EJ_SRstE</i> external output signal,<br>allowing the system to mask soft resets if desired.                    | R/W    | 1              |

| Table 10.1 | DCR Register | Field Descriptions | (Continued) |
|------------|--------------|--------------------|-------------|
|------------|--------------|--------------------|-------------|

| Fields   |                               |                                                                                             |                                                        | Read / | Reset                                           |
|----------|-------------------------------|---------------------------------------------------------------------------------------------|--------------------------------------------------------|--------|-------------------------------------------------|
| Name     | Bits                          |                                                                                             | Description                                            |        | State                                           |
| ProbEn 0 |                               | Probe Enable. This bit reflects the <i>ProbEn</i> bit in the <i>EJTAG Control</i> register: |                                                        | R      | Same value<br>as <i>ProbEn</i><br>in <i>ECR</i> |
|          |                               | Encoding                                                                                    | Meaning                                                |        | (see Table                                      |
|          |                               | 0                                                                                           | No accesses to dmseg allowed                           |        | 9-4)                                            |
|          |                               | 1                                                                                           | Accesses to dmseg by EJTAG probe ser-<br>vices allowed |        |                                                 |
| 0        | 30, 28:27,<br>21:19,<br>13:12 | Must be writt                                                                               | Must be written as zeros; return zeros on reads.       |        | 0                                               |

## Table 10.1 DCR Register Field Descriptions (Continued)

## **10.2 Hardware Breakpoints**

Hardware breakpoints provide for the comparison by hardware of executed instructions and data load/store transactions. It is possible to set instruction breakpoints on addresses even in ROM area. Data breakpoints can be set to cause a debug exception on a specific data transaction. Instruction and data hardware breakpoints are alike for many aspects, and are thus described in parallel in the following. The term hardware is not generally added to breakpoint, unless required to distinguish it from a software breakpoint.

There are two types of simple hardware breakpoints implemented in the microAptiv UP core: Instruction breakpoints and Data breakpoints. The microAptiv UP core may also contain a complex breakpoint unit.

A core may be configured with the following breakpoint options:

- No data or instruction breakpoints, without complex break support
- Two instruction and one data breakpoint, without complex break support
- Four instruction and two data breakpoints, without complex break support
- Six instruction and two data breakpoints, without support for complex breaks
- Six instruction and two data breakpoints, with support for complex breaks
- Eight instruction and four data breakpoints, without support for complex breaks
- Eight instruction and four data breakpoints, with support for complex breaks

Instruction breaks occur on instruction fetch operations, and the break is set on the virtual address on the bus between the CPU and the instruction cache. Instruction breaks can also be made on the ASID value used by the MMU. Finally, a mask can be applied to the virtual address to set breakpoints on a range of instructions.

Instruction breakpoints compare the virtual address of the executed instructions (the value of PC) and the ASID with the registers for each instruction breakpoint, including masking of address and ASID. When an instruction breakpoint

matches, a debug exception and/or a trigger is generated. An internal bit in the instruction breakpoint registers is set to indicate that the match occurred.

## 10.2.1 Data Breakpoints

Data breakpoints occur on load/store transactions. Breakpoints are set on virtual address and ASID values, similar to the Instruction breakpoint. Data breakpoints can be set on a load, a store, or both. Data breakpoints can also be set based on the value of the load/store operation. Finally, masks can be applied to both the virtual address and the load/store value.

Data breakpoints compare the transaction type (TYPE), which may be load or store, the virtual address of the transaction (ADDR), the ASID, accessed bytes (BYTELANE) and data value (DATA), with the registers for each data breakpoint including masking or qualification on the transaction properties. When a data breakpoint matches, a debug exception and/or a trigger is generated, and an internal bit in the data breakpoint registers is set to indicate that the match occurred. The match is precise in that the debug exception or trigger occurs on the instruction that caused the breakpoint to match.

## **10.2.2 Complex Breakpoints**

The complex breakpoint unit utilizes the instruction and data breakpoint hardware and looks for more specific matching conditions. There are several different types of enabling that allow more exact breakpoint specification. Tuples add an additional condition to data breakpoints of requiring an instruction breakpoint on the same instructions. Pass counters are counters that decrement each time a matching breakpoint condition is taken. When the counter reaches 0, the break or trigger effect of the breakpoint is enabled. Priming allows a breakpoint to only be enabled when another trigger condition has been detected. Data qualification allows instruction breakpoints to only be enabled when a corresponding load data triggerpoint has matched both address and data. Data qualified breakpoints are also disabled if a load is executed that matches on the address portion of the triggerpoint, but has a mismatching data value. The complex breakpoint features can be combined to create very complex sequences to match on.

In addition to the breakpoint logic, the complex break unit also includes a Stopwatch Timer block. This counter can be used to measure time spent in various sections. It can either be free-running, or it can be set up to start and stop counting based on a trigger from instruction breakpoints.

## **10.2.3 Conditions for Matching Breakpoints**

A number of conditions must be fulfilled in order for a breakpoint to match on an executed instruction or a data transaction, and the conditions for matching instruction and data breakpoints are described below. The breakpoints only match for instructions executed in non-debug mode, thus never on instructions executed in debug mode.

The match of an enabled breakpoint can either generate a debug exception or a trigger indication. The *BE* and/or *TE* bits in the *IBCn* or *DBCn* registers are used to enable the breakpoints.

Debug software should not configure breakpoints to compare on an ASID value unless a TLB is present in the implementation.

#### 10.2.3.1 Conditions for Matching Instruction Breakpoints

There are two methods for matching conditions:, Equality and Mask or Address Range.

#### Equality and Mask

When an instruction breakpoint is enabled, that breakpoint is evaluated for the address of every executed instruction in non-debug mode, including execution of instructions at an address causing an address error on an instruction fetch. The breakpoint is not evaluated on instructions from a speculative fetch or execution, nor for addresses which are unaligned with an executed instruction.

A breakpoint match depends on the virtual address of the executed instruction (PC) which can be masked at bit level, and match also can include an optional compare of ASID value. The registers for each instruction breakpoint have the values and mask used in the compare, and the equation that determines the match is shown below in C-like notation.

```
\label{eq:lb_match} \begin{array}{l} \text{IB\_match} = & ( & ! & \textit{IBCn}_{\text{ASIDuse}} & | & | & ( & \text{ASID} == & \textit{IBASIDn}_{\text{ASID}} & ) & ) & \& \\ & ( & < \text{all 1's>} == & ( & \textit{IBMn}_{\text{IBM}} & | & ~ ( & \text{PC} & \textit{IBAn}_{\text{IBA}} & ) & ) \end{array}
```

The match indication for instruction breakpoints is always precise, i.e. indicated on the instruction causing the IB\_match to be true.

#### Address Range

Cores may optionally support the address range triggered instruction breakpoints. When this feature is configured, the following changes are made to the instruction breakpoint registers:

- *IBAn* : represents the upper limit of a address range boundary
- IBMn : represents the lower limit of the address range boundary

In addition, the following bits must be supported:

*IBCn[6].hwarts* : a preset value of 1 indicates that the address range triggered instruction breakpoint feature is supported for this particular instruction breakpoint channel. This bit is read-only.

*IBCn*[5].excl: a value of 0 indicates that the breakpoint will match for addresses inclusive (within) the range defined by *IBMn* and *IBAn*. A value of 1 indicates that the breakpoint will match for addresses exclusive (outside) to the range defined by *IBMn* and *IBAn*. This bit is writeable.

*IBCn[4].hwart* : a value of 0 indicates that the breakpoint will match using the "Equality and Mask" equation as found in Section 10.2.3.1 "Conditions for Matching Instruction Breakpoints". A value of 1 indicates that the breakpoint will match using address ranges using the equation below:

```
IB_match =
(!IBCnTCuse || ( TC == IBCnTC ) ) &&
( ! IBCnASIDuse || ( ASID == IBASIDnASID ) ) &&
( (~IBCnhwarts || ~IBCnhwart) &&
        (( IBMnIBM | ~ ( PC ^ IBAnIBA ) ) == ~0) ||
        (( IBCnhwarts && IBCnhwart) &&
        ((~IBCnhwarts && IBCnhwart) &&
        ((~IBCnexcl && (IBM <= PC <= IBA)) ||
        ( IBCnexcl && (IBM > PC || PC > IBA)
)
```

Or if microMIPS is supported:

```
IB_range_match =
(!IBCnTCuse || ( TC == IBCnTC ) ) &&
( ! IBCnASIDuse || ( ASID == IBASIDnASID ) ) &&
```

Also note that addresses that overlap a boundary is considered for both exclusive and inclusive breakpoint matches.

#### 10.2.3.2 Conditions for Matching Data Breakpoints

There are two methods for matching conditions, namely 1) by Equality and Mask or 2) by Address Range:

#### Equality and Mask

When a data breakpoint is enabled, that breakpoint is evaluated for every data transaction due to a load/store instruction executed in non-debug mode, including load/store for coprocessor, and transactions causing an address error on data access. The breakpoint is not evaluated due to a PREF instruction or other transactions which are not part of explicit load/store transactions in the execution flow, nor for addresses which are not the explicit load/store source or destination address.

A breakpoint match depends on the transaction type (TYPE) as load or store, the address, and optionally the data value of a transaction. The registers for each data breakpoint have the values and mask used in the compare, and the equation that determines the match is shown below in C-like notation.

The overall match equation is the DB\_match.

```
DB_match =
    ( ( TYPE == load ) && ! DBCn<sub>NoLB</sub> ) ||
    ( ( TYPE == store ) && ! DBCn<sub>NoSB</sub> ) ) &&
    DB_addr_match && ( DB_no_value_compare || DB_value_match )
```

The match on the address part, DB\_addr\_match, depends on the virtual address of the transaction (ADDR), the ASID value, and the accessed bytes (BYTELANE) where BYTELANE[0] is 1 only if the byte at bits [7:0] on the bus is accessed, and BYTELANE[1] is 1 only if the byte at bits [15:8] is accessed, etc. The DB\_addr\_match is shown below.

```
DB_addr_match =
    ( ! DBCn<sub>ASIDuse</sub> || ( ASID == DBASIDn<sub>ASID</sub> ) ) &&
    ( <all 1's> == ( DBMn<sub>DBM</sub> | ~ ( ADDR ^ DBAn<sub>DBA</sub> ) ) ) &&
    ( <all 0's> != ( ~ BAI & BYTELANE ) )
```

The size of *DBCn*<sub>BAI</sub> and BYTELANE is 4 bits.

Data value compare is included in the match condition for the data breakpoint depending on the bytes (BYTELANE as described above) accessed by the transaction, and the contents of breakpoint registers. The DB\_no\_value\_compare is shown below.

```
DB_no_value_compare = ( <all 1's> == ( DBCn_{BLM} | DBCn_{BAI} | ~ BYTELANE ) )
```

The size of  $DBCn_{BLM}$ ,  $DBCn_{BAI}$  and BYTELANE is 4 bits.

In case a data value compare is required, DB\_no\_value\_compare is false, then the data value from the data bus (DATA) is compared and masked with the registers for the data breakpoint. The *DBCIVM* bit inverts the sense of the match - if set, the value match term will be high if the data value is not the same as the data in the *DBVn* register. The endianess is not considered in these match equations for value, as the compare uses the data bus value directly, thus debug software is responsible for setup of the breakpoint corresponding with endianess.

```
DB_value_match =

DBCnIVM ^

(((DATA[7:0] == DBVn<sub>DBV[7:0]</sub>) || ! BYTELANE[0] || DBCn<sub>BLM[0]</sub> || DBCn<sub>BAI[0]</sub>) &&

((DATA[15:8] == DBVn<sub>DBV[15:8]</sub>) || ! BYTELANE[1] || DBCn<sub>BLM[1]</sub> || DBCn<sub>BAI[1]</sub>) &&

((DATA[23:16] == DBVn<sub>DBV[23:16]</sub>) || ! BYTELANE[2] || DBCn<sub>BLM[2]</sub> || DBCn<sub>BAI[2]</sub>) &&

((DATA[31:24] == DBVn<sub>DBV[31:24]</sub>) || ! BYTELANE[3] || DBCn<sub>BLM[3]</sub> || DBCn<sub>BAI[3]</sub>))
```

The match for a data breakpoint is always precise, since the match expression is fully evaluated at the time the load/store instruction is executed. A true DB\_match can thereby be indicated on the very same instruction causing the DB\_match to be true.

#### Address Range

Cores may optionally support the address range triggered data breakpoints. When this feature is configured, the following changes are made to the data breakpoint registers:

- DBAn : represents the upper limit of a address range boundary
- *DBMn* : represents the lower limit of the address range boundary

In addition, the following bits must be supported:

*DBCn*[10].*hwarts*: a preset value of 1 indicates that the address range triggered data breakpoint feature is supported for this particular data breakpoint channel. This bit is read-only.

*DBCn*[9].*exc*: a value of 0 indicates that the breakpoint will match for addresses inclusive (within) the range defined by *DBMn* and *DBAn*. A value of 1 indicates that the breakpoint will match for addresses exclusive (outside) to the range defined by *DBMn* and *DBAn*. This bit is writeable.

*DBCn*[8].*hwart*: a value of 0 indicates that the breakpoint will match using the "Equality and Mask" equation as found in Section 10.2.3.2 "Conditions for Matching Data Breakpoints". A value of 1 indicates that the breakpoint will match using address ranges using the equation below:

When address range triggered data breakpoints is enabled, *DBCn.BLM*[3:0] must be set to 4'b1111 because value matching is not supported with this feature. Addresses that overlap a boundary is considered for both exclusive and inclusive breakpoint matches.

## **10.2.4 Debug Exceptions from Breakpoints**

Instruction and data breakpoints may be set up to generate a debug exception when the match condition is true, as described below.

## 10.2.4.1 Debug Exception by Instruction Breakpoint

If the breakpoint is enabled by BE bit in the *IBCn* register, then a debug instruction break exception occurs if the IB\_match equation is true. The corresponding BS[n] bit in the *IBS* register is set when the breakpoint generates the debug exception.

The debug instruction break exception is always precise, so the *DEPC* register and *DBD* bit in the *Debug* register point to the instruction that caused the IB\_match equation to be true.

The instruction receiving the debug exception does not update any registers due to the instruction, nor does any load or store by that instruction occur. Thus a debug exception from a data breakpoint can not occur for instructions receiving a debug instruction break exception.

The debug handler usually returns to the instruction causing the debug instruction break exception, whereby the instruction is executed. Debug software is responsible for disabling the breakpoint when returning to the instruction, otherwise the debug instruction break exception reoccurs.

## 10.2.4.2 Debug Exception by Data Breakpoint

If the breakpoint is enabled by BE bit in the DBCn register, then a debug exception occurs when the DB\_match condition is true. The corresponding BS[n] bit in the DBS register is set when the breakpoint generates the debug exception.

A debug data break exception occurs when a data breakpoint indicates a match. In this case the *DEPC* register and *DBD* bit in the *Debug* register points to the instruction that caused the DB\_match equation to be true.

The instruction causing the debug data break exception does not update any registers due to the instruction, and the following applies to the load or store transaction causing the debug exception:

- A store transaction is not allowed to complete the store to the memory system.
- A load transaction with no data value compare, i.e. where the DB\_no\_value\_compare is true for the match, is not allowed to complete the load.
- A load transaction for a breakpoint with data value compare must occur from the memory system, since the value is required in order to evaluate the breakpoint.

The result of this is that the load or store instruction causing the debug data break exception appears as not executed, with the exception that a load from the memory system does occur for a breakpoint with data value compare, but the register file is not updated by the load.

If both data breakpoints without and with data value compare would match the same transaction and generate a debug exception, then the following rules apply with respect to updating the BS[n] bits.

- On both a load and store the *BS*[*n*] bits are required to be set for all matching breakpoints without a data value compare.
- On a store the *BS*[*n*] bits are allowed but not required to be set for all matching breakpoints with a data value compare, but either all or none of the *BS*[*n*] bits must be set for these breakpoints.
- On a load then none of the *BS*[*n*] bits for breakpoints with data value compare are allowed to be set, since the load is not allowed to occur due to the debug exception from a breakpoint without a data value compare, and a valid data value is therefore not returned.

Any BS[n] bit set prior to the match and debug exception are kept set, since BS[n] bits are only cleared by debug software.

The debug handler usually returns to the instruction causing the debug data break exception, whereby the instruction is re-executed. This re-execution may result in a repeated load from system memory, since the load may have occurred previously in order to evaluate the breakpoint as described above. I/O devices with side effects on loads may not be re-accessible without changing the system behavior. The Load Data Value register was introduced to capture the value that was read and allow debug software to synthesize the load instruction without re-accessing memory. Debug software is responsible for disabling breakpoints when returning to the instruction, otherwise the debug data break exception will reoccur.

## 10.2.5 Breakpoint Used as Triggerpoint

Both instruction and data hardware breakpoints can be setup by software so that a matching breakpoint does not generate a debug exception, but only an indication through the BS[n] bit. The *TE* bit in the *IBCn* or *DBCn* register controls if an instruction or data breakpoint is used as a so-called triggerpoint. The triggerpoints are, like breakpoints, only compared for instructions executed in non-debug mode.

The BS[n] bit in the IBS or DBS register is set when the respective IB\_match or DB\_match bit is true.

The triggerpoint feature can be used to start and stop tracing.

## **10.2.6 Instruction Breakpoint Registers**

The registers for instruction breakpoints are described below. These registers have implementation information and are used to set up the instruction breakpoints. All registers are in drseg, and the addresses are shown in Table 10.2.

| Offset in drseg                                                                           | Register<br>Mnemonic | Register Name and Description            |  |  |  |  |  |
|-------------------------------------------------------------------------------------------|----------------------|------------------------------------------|--|--|--|--|--|
| 0x1000                                                                                    | IBS                  | Instruction Breakpoint Status            |  |  |  |  |  |
| 0x1100 + n * 0x100                                                                        | IBAn                 | Instruction Breakpoint Address n         |  |  |  |  |  |
| 0x1108 + n * 0x100                                                                        | IBMn                 | Instruction Breakpoint Address Mask n    |  |  |  |  |  |
| 0x1110 + n * 0x100                                                                        | IBASIDn              | Instruction Breakpoint ASID n            |  |  |  |  |  |
| 0x1118 + n * 0x100                                                                        | IBCn                 | Instruction Breakpoint Control n         |  |  |  |  |  |
| 0x1120 + n * 0x100                                                                        | IBCCn                | Instruction Breakpoint Complex Control n |  |  |  |  |  |
| 0x1128 + n * 0x100                                                                        | IBPCn                | Instruction Breakpoint Pass Counter n    |  |  |  |  |  |
| n is breakpoint number in range 0 to 5 (or 3 or 1, depending on the implemented hardware) |                      |                                          |  |  |  |  |  |

 Table 10.2 Addresses for Instruction Breakpoint Registers

An example of some of the registers; IBAO is at offset 0x1100 and IBC2 is at offset 0x1318.

## 10.2.6.1 Instruction Breakpoint Status (IBS) Register (0x1000)

The *Instruction Breakpoint Status (IBS)* register holds implementation and status information about the instruction breakpoints. This register is required only if instruction breakpoints are implemented.

0

| Figure 10.2 IBS Register Format |         |       |       |      |    |  |  |  |  |  |
|---------------------------------|---------|-------|-------|------|----|--|--|--|--|--|
| 31                              | 30      | 29 28 | 27 24 | 23 6 | 5  |  |  |  |  |  |
| Res                             | ASIDsup | Res   | BCN   | Res  | BS |  |  |  |  |  |

| Fields  |        |                                                                                                                                                                                                                | Read/Wr |                              |  |
|---------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------|------------------------------|--|
| Name    | Bit(s) | Description                                                                                                                                                                                                    | ite     | Reset State                  |  |
| Res     | 31     | Must be written as zero; returns zero on read.                                                                                                                                                                 | R       | 0                            |  |
| ASIDsup | 30     | Indicates that ASID compare is supported in instruction<br>breakpoints.<br>0: No ASID compare.<br>1: ASID compare ( <i>IBASIDn</i> register implemented).                                                      | R       | 1                            |  |
| Res     | 29:28  | Must be written as zero; returns zero on read.                                                                                                                                                                 | R       | 0                            |  |
| BCN     | 27:24  | Number of instruction breakpoints implemented.                                                                                                                                                                 | R       | 0, 2, 4, 6 or 8 <sup>a</sup> |  |
| Res     | 23:8   | Must be written as zero; returns zero on read.                                                                                                                                                                 | R       | 0                            |  |
| BS      | 7:0    | Break status for breakpoint n is at $BS[n]$ , with n from 0 to 7 <sup>b</sup> . The bit is set to 1 when the condition for the corresponding breakpoint has matched and <i>IBCnTE</i> or <i>IBCnBE</i> are set | R/W     | Undefined                    |  |

## **Table 10.3 IBS Register Field Descriptions**

#### 10.2.6.2 Instruction Breakpoint Address n (IBAn) Register (0x1100 + n \* 0x100)

The *Instruction Breakpoint Address n* (*IBAn*) register has the address used in the condition for instruction breakpoint *n*. This register is required only if instruction breakpoints are implemented.

#### Figure 10.3 IBAn Register Format

| 31  | 0 |
|-----|---|
| IBA |   |

## Table 10.4 IBAn Register Field Descriptions

| Fields |        |                                               | Read/W |           |  |  |
|--------|--------|-----------------------------------------------|--------|-----------|--|--|
| Name   | Bit(s) | Description                                   |        |           |  |  |
| IBA    | 31:0   | Instruction breakpoint address for condition. | R/W    | Undefined |  |  |

#### 10.2.6.3 Instruction Breakpoint Address Mask n (IBMn) Register (0x1108 + n\*0x100)

The Instruction *Breakpoint Address Mask n (IBMn)* register has the mask for the address compare used in the condition for instruction breakpoint n. A 1 indicates that the corresponding address bit will not be considered in the match. A mask value of all 0's would require an exact address match, while a mask value of all 1's would match on any address. This register is required only if instruction breakpoints are implemented.

## Figure 10.4 IBMn Register Format

| 31 |     | 0 |
|----|-----|---|
|    | IBM |   |

## Table 10.5 IBMn Register Field Descriptions

|        |                                                    |                                       | Read/W                                                                                                                                   | 1                                                                                                                     |
|--------|----------------------------------------------------|---------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------|
| Bit(s) |                                                    | Description                           | rite                                                                                                                                     | Reset State                                                                                                           |
| 31:0   | Instruction breakpoint address mask for condition: |                                       | R/W                                                                                                                                      | Undefined                                                                                                             |
|        | Encoding                                           | Encoding Meaning                      |                                                                                                                                          |                                                                                                                       |
|        | 0                                                  | Corresponding address bit not masked. |                                                                                                                                          | 1                                                                                                                     |
|        | 1                                                  | Corresponding address bit masked.     |                                                                                                                                          | l                                                                                                                     |
|        |                                                    | 31:0 Instruction b                    | 31:0     Instruction breakpoint address mask for condition:       Encoding     Meaning       0     Corresponding address bit not masked. | 31:0     Instruction breakpoint address mask for condition:     R/W       0     Corresponding address bit not masked. |

#### 10.2.6.4 Instruction Breakpoint ASID n (IBASIDn) Register (0x1110 + n\*0x100)

The Instruction Breakpoint ASID n (IBASIDn) register has the ASID value used in the compare for instruction breakpoint n. The number of bits in the ASID field is 8, to match the ASID size in the TLB. This register is required only if instruction breakpoints are implemented.

#### Figure 10.5 IBASIDn Register Format

| 31 | 8   | 7 0  |
|----|-----|------|
|    | Res | ASID |

## Table 10.6 IBASIDn Register Field Descriptions

| Fie  | lds    | Read/Wr                                          |     |             |
|------|--------|--------------------------------------------------|-----|-------------|
| Name | Bit(s) | Description                                      | ite | Reset State |
| Res  | 31:8   | Must be written as zero; returns zero on read.   | R 0 |             |
| ASID | 7:0    | Instruction breakpoint ASID value for a compare. |     | Undefined   |

#### 10.2.6.5 Instruction Breakpoint Control n (IBCn) Register (0x1118 + n\*0x100)

The *Instruction Breakpoint Control n (IBCn)* register controls the setup of instruction breakpoint *n*. This register is required only if instruction breakpoints are implemented.

## Figure 10.6 IBCn Register Format

| 31  | 24 | 23      | 22 7 | 6      | 5    | 4     | 3   | 2  | 1   | 0  |  |
|-----|----|---------|------|--------|------|-------|-----|----|-----|----|--|
| Res |    | ASIDuse | Res  | hwarts | excl | hwart | Res | TE | Res | BE |  |

## Table 10.7 IBCn Register Field Descriptions

| Fields  |       |                                                                                         |                                                                                                                                                                                                                                              |            |             |
|---------|-------|-----------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name    | Bits  |                                                                                         | Description                                                                                                                                                                                                                                  | Read/Write | Reset State |
| Res     | 31:24 | Must be written                                                                         | n as zero; returns zero on read.                                                                                                                                                                                                             | R          | 0           |
| ASIDuse | 23    | Use ASID valu                                                                           | e in compare for instruction breakpoint n:                                                                                                                                                                                                   | R/W        | Undefined   |
|         |       | Encoding                                                                                | Meaning                                                                                                                                                                                                                                      |            |             |
|         |       | 0                                                                                       | Don't use ASID value in compare                                                                                                                                                                                                              |            |             |
|         |       | 1                                                                                       | Use ASID value in compare                                                                                                                                                                                                                    |            |             |
| Res     | 22:7  | Must be writter                                                                         | n as zero; returns zero on read.                                                                                                                                                                                                             | R          | 0           |
| hwarts  | 6     | gered instruction                                                                       | of 1 indicates that the address- range trig-<br>on breakpoint feature is supported for this<br>action breakpoint channel.                                                                                                                    | R          | Preset      |
| excl    | 5     | addresses withi<br><i>IBMn</i> and <i>IBA</i><br>point will mate                        | dicates that the breakpoint will match for<br>n (inclusive of) the range defined by<br>n. A value of 1 indicates that the break-<br>h for addresses outside (exclusive to) the<br>by <i>IBMn</i> and <i>IBAn</i> .                           | R/W        | 0           |
| hwart   | 4     | using the "Equa<br>under 10.2.3.1<br>Breakpoints".<br>A value of 1 in<br>using the "Add | dicates that the breakpoint will match<br>ality and Mask" equation as found section<br>"Conditions for Matching Instruction<br>dicates that the breakpoint will match<br>ress Range" equation in section<br>ditions for Matching Instruction | R/W        | 0           |
| Res     | 3     | Must be written                                                                         | n as zero; returns zero on read.                                                                                                                                                                                                             | R          | 0           |
| TE      | 2     | Use instruction                                                                         | breakpoint n as triggerpoint:                                                                                                                                                                                                                | R/W        | 0           |
|         |       | Encoding                                                                                | Meaning                                                                                                                                                                                                                                      |            |             |
|         |       | 0                                                                                       | Don't use it as triggerpoint                                                                                                                                                                                                                 |            |             |
|         |       | 1                                                                                       | Use it as triggerpoint                                                                                                                                                                                                                       |            |             |
| Res     | 1     | Must be written                                                                         | n as zero; returns zero on read.                                                                                                                                                                                                             | R          | 0           |
| BE      | 0     | Use instruction                                                                         | breakpoint n as breakpoint:                                                                                                                                                                                                                  | R/W        | 0           |
|         |       | Encoding                                                                                | Meaning                                                                                                                                                                                                                                      |            |             |
|         |       | 0                                                                                       | Don't use it as breakpoint                                                                                                                                                                                                                   |            |             |
|         |       | 1                                                                                       | Use it as breakpoint                                                                                                                                                                                                                         |            |             |

## 10.2.6.6 Instruction Breakpoint Complex Control n (IBCCn) Register (0x1120 + n\*0x100)

The Instruction Breakpoint Complex Control n (IBCCn) register controls the complex break conditions for instruction breakpoint *n*. This register is required only if complex breakpoints are implemented and only for implemented instruction breakpoints.

|    | Figure 10.7 IBCCn | Re | gister F | orm | nat |         |   |   |   |    |    |   |
|----|-------------------|----|----------|-----|-----|---------|---|---|---|----|----|---|
| 31 |                   | 14 | 13       | 10  | 9   | 8       | 5 | 4 | 3 | 2  | 1  | 0 |
|    | Res               |    | PrCn     | ł   | CBE | DBrkNur | n | Q |   | Re | es |   |

| Fields  |               |                                                                                                                                                                                                             |            |                                                                                                                                                                              |
|---------|---------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Name    | Bits          | Description                                                                                                                                                                                                 | Read/Write | Reset State                                                                                                                                                                  |
| Res     | 31:14,<br>3:0 | Must be written as zero; returns zero on read.                                                                                                                                                              | R          | 0                                                                                                                                                                            |
| PrCnd   | 13:12         | Upper bits of priming condition for instruction breakpoint<br>n. The microAptiv UP core only supports 4 priming con-<br>ditions, so the upper 2 bits are read as 0.                                         | R          | 0                                                                                                                                                                            |
| PrCnd   | 11:10         | Priming condition for instruction breakpoint n.<br>00 - Bypass, no priming needed<br>Other - Varies depending on the break number; refer to<br>Table 10.10 for mapping.                                     | R/W        | 0                                                                                                                                                                            |
| CBE     | 9             | Complex Break Enable. Enables this breakpoint for use<br>in a complex sequence as a priming condition for another<br>breakpoint, to start or stop the stopwatch timer, or as part<br>of a tuple breakpoint. | R/W        | 0                                                                                                                                                                            |
| DBrkNum | 8:5           | Indicates which data breakpoint channel is used to qualify this instruction breakpoint.                                                                                                                     | R          | 6I/2D Complex Breakpoint<br>Configuration:<br>IBCC02 - 0<br>IBCC36 - 1<br>8I/4D Complex Breakpoint<br>Configuration:<br>IBCC01 - 0<br>IBCC23 - 1<br>IBCC45 - 2<br>IBCC67 - 3 |
| Q       | 4             | Qualify this breakpoint based on the data breakpoint indi-<br>cated in <i>DBrkNum</i> .<br>0 - Not dependent on qualification<br>1 - Breakpoint must be qualified to be taken                               | R/W        | 0                                                                                                                                                                            |

## **Table 10.8 IBCCn Register Field Descriptions**

#### 10.2.6.7 Instruction Breakpoint Pass Counter n (IBPCn) Register (0x1128 + n\*0x100)

The Instruction Breakpoint Pass Counter n (IBPCn) register controls the pass counter associated with instruction breakpoint *n*. This register is required only if complex breakpoints are implemented and only for implemented instruction breakpoints.

If complex breakpoints are implemented, there will be an 8b pass counter for each of the instruction breakpoints on the microAptiv UP core.

## Figure 10.8 IBPCn Register Format

| 31 | 8 | 7 0     |
|----|---|---------|
|    | 0 | PassCnt |

# Table 10.9 IBPCn Register Field Descriptions

| Fiel    | ds   |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |            |             |
|---------|------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name    | Bits | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | Read/Write | Reset State |
| 0       | 31:8 | Ignored on write, returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | R          | 0           |
| PassCnt | 7:0  | Prevents a break/trigger action until the matching condi-<br>tions on breakpoint n have been seen this number of<br>times.<br>Each time the matching condition is seen, this value will<br>be decremented by 1. When the value reaches 0, subse-<br>quent matches will cause a break or trigger as requested<br>and the counter will stay at 0.<br>The break or trigger action is imprecise if the <i>PassCnt</i><br>register was last written to a non-zero value. It will<br>remain imprecise until this register is written to 0 by soft-<br>ware.<br>The instruction pass counter should not be set on instruc-<br>tion breakpoints that are being used as part of a tuple<br>breakpoint. | R/W        | 0           |

## 10.2.7 Data Breakpoint Registers

The registers for data breakpoints are described below. These registers have implementation information and are used the setup the data breakpoints. All registers are in drseg, and the addresses are shown in Table 10.10.

| Offset in drseg                                                                           | Register<br>Mnemonic | Register Name and Description     |  |  |  |  |  |
|-------------------------------------------------------------------------------------------|----------------------|-----------------------------------|--|--|--|--|--|
| 0x2000                                                                                    | DBS                  | Data Breakpoint Status            |  |  |  |  |  |
| 0x2100 + 0x100 * n                                                                        | DBAn                 | Data Breakpoint Address n         |  |  |  |  |  |
| 0x2108 + 0x100 * n                                                                        | DBMn                 | Data Breakpoint Address Mask n    |  |  |  |  |  |
| 0x2110 + 0x100 * n                                                                        | DBASIDn              | Data Breakpoint ASID n            |  |  |  |  |  |
| 0x2118 + 0x100 * n                                                                        | DBCn                 | Data Breakpoint Control n         |  |  |  |  |  |
| 0x2120 + 0x100 * n                                                                        | DBVn                 | Data Breakpoint Value n           |  |  |  |  |  |
| 0x2128 + 0x100 * n                                                                        | DBCCn                | Data Breakpoint Complex Control n |  |  |  |  |  |
| 0x2130 + 0x100 * n                                                                        | DBPCn                | Data Breakpoint Pass Counter n    |  |  |  |  |  |
| 0x2ff0 DVM Data Value Match Register                                                      |                      |                                   |  |  |  |  |  |
| n is breakpoint number as 0, 1, 2 or 3 (or just 0, depending on the implemented hardware) |                      |                                   |  |  |  |  |  |

 Table 10.10 Addresses for Data Breakpoint Registers

2 1 0

BS

An example of some of the registers; DBM0 is at offset 0x2108 and DBV1 is at offset 0x2220.

#### 10.2.7.1 Data Breakpoint Status (DBS) Register (0x2000)

The *Data Breakpoint Status* (*DBS*) register holds implementation and status information about the data breakpoints. This register is required only if data breakpoints are implemented.

Res

Figure 10.9 DBS Register Format

The ASIDsup field indicates whether ASID compares are supported.

BCN

24 23

| Res HSI                                     | Doup Res |                                                                                                                                                                      |         | B                         |  |  |  |  |
|---------------------------------------------|----------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------|---------------------------|--|--|--|--|
| Table 10.11 DBS Register Field Descriptions |          |                                                                                                                                                                      |         |                           |  |  |  |  |
| Fie                                         | elds     |                                                                                                                                                                      | Read/Wr |                           |  |  |  |  |
| Name                                        | Bit(s)   | Description                                                                                                                                                          | ite     | Reset State               |  |  |  |  |
| Res                                         | 31       | Must be written as zero; returns zero on read.                                                                                                                       | R       | 0                         |  |  |  |  |
| ASID                                        | 30       | Indicates that ASID compares are supported in data<br>breakpoints.<br>0: Not supported<br>1: Supported                                                               | R       | 1                         |  |  |  |  |
| Res                                         | 29:28    | Must be written as zero; returns zero on read.                                                                                                                       | R       | 0                         |  |  |  |  |
| BCN                                         | 27:24    | Number of data breakpoints implemented.                                                                                                                              | R       | 4, 2, 1 or 0 <sup>a</sup> |  |  |  |  |
| Res                                         | 23:4     | Must be written as zero; returns zero on read.                                                                                                                       | R       | 0                         |  |  |  |  |
| BS                                          | 3:0      | Break status for breakpoint n is at $BS[n]$ , with n from 0 to 1 <sup>b</sup> . The bit is set to 1 when the condition for the corresponding breakpoint has matched. | R/W0    | Undefined                 |  |  |  |  |

[a] Based on actual hardware implemented.

[b] In case of only 1 data breakpoint bit 1 become reserved.

## 10.2.7.2 Data Breakpoint Address n (DBAn) Register (0x2100 + 0x100 \* n)

The Data Breakpoint Address n (DBAn) register has the address used in the condition for data breakpoint n. This register is required only if data breakpoints are implemented.

#### Figure 10.10 DBAn Register Format

31

31

30

Res ASIDsup Res

29 28 27

DBA

0

| Fie  | lds    |                                        | Read/W      |           |  |
|------|--------|----------------------------------------|-------------|-----------|--|
| Name | Bit(s) | Description                            | Reset State |           |  |
| DBA  | 31:0   | Data breakpoint address for condition. | R/W         | Undefined |  |

## Table 10.12 DBAn Register Field Descriptions

#### 10.2.7.3 Data Breakpoint Address Mask n (DBMn) Register (0x2108 + 0x100 \* n)

The Data Breakpoint Address Mask n (DBMn) register has the mask for the address compare used in the condition for data breakpoint n. A 1 indicates that the corresponding address bit will not be considered in the match. A mask value of all 0's would require an exact address match, while a mask value of all 1's would match on any address. This register is required only if data breakpoints are implemented.

#### Figure 10.11 DBMn Register Format

| 31 |     | 0 |
|----|-----|---|
|    | DBM |   |

## Table 10.13 DBMn Register Field Descriptions

| Fiel | ds     |                                                                                                                               |      |             |  |  |
|------|--------|-------------------------------------------------------------------------------------------------------------------------------|------|-------------|--|--|
| Name | Bit(s) | Description                                                                                                                   | rite | Reset State |  |  |
| DBM  | 31:0   | Data breakpoint address mask for condition:<br>0: Corresponding address bit not masked<br>1: Corresponding address bit masked | R/W  | Undefined   |  |  |

#### 10.2.7.4 Data Breakpoint ASID n (DBASIDn) Register (0x2110 + 0x100 \* n)

The Data Breakpoint ASID n (DBASIDn) register has the ASID value used in the compare for data breakpoint n. This register is required only if data breakpoints are implemented.

#### Figure 10.12 DBASIDn Register Format

| 31 |     | 8 | 7    | 0 |
|----|-----|---|------|---|
|    | Res |   | ASID |   |

## Table 10.14 DBASIDn Register Field Descriptions

| Fie  | lds    |                                                | Read/Wr |             |
|------|--------|------------------------------------------------|---------|-------------|
| Name | Bit(s) | Description                                    | ite     | Reset State |
| Res  | 31:8   | Must be written as zero; returns zero on read. | R       | 0           |
| ASID | 7:0    | Data breakpoint ASID value for compares.       | R/W     | Undefined   |

\_\_\_\_\_

## 10.2.7.5 Data Breakpoint Control n (DBCn) Register (0x2118 + 0x100 \* n)

\_\_\_

The Data Breakpoint Control n (DBCn) register controls the setup of data breakpoint n. This register is required only if data breakpoints are implemented.

\_

| Figure 10.13 DBCn Register Format |    |         |       |       |      |      |     |        |      |       |     |     |    |     |    |  |
|-----------------------------------|----|---------|-------|-------|------|------|-----|--------|------|-------|-----|-----|----|-----|----|--|
| 31                                | 24 | 23      | 22 18 | 17 14 | 13   | 12   | 11  | 10     | 9    | 8     | 7 4 | 3   | 2  | 1   | 0  |  |
|                                   | Re | ASIDuse | Res   | BAI   | NoSB | NoLB | Res | hwarts | excl | hwart | BLM | Res | TE | IVM | BE |  |

.. .. \_\_\_

| Fields  |       |                                     |                                                                                                                                              |            |             |
|---------|-------|-------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name    | Bits  | -                                   | Description                                                                                                                                  | Read/Write | Reset State |
| Res     | 31:24 | Must be writte                      | n as zero; returns zero on reads.                                                                                                            | R          | 0           |
| ASIDuse | 23    | Use ASID value                      | ue in compare for data breakpoint n:                                                                                                         | R/W        | Undefined   |
|         |       | Encoding                            | Meaning                                                                                                                                      |            |             |
|         |       | 0                                   | Don't use ASID value in compare                                                                                                              |            |             |
|         |       | 1                                   | Use ASID value in compare                                                                                                                    |            |             |
| Res     | 22:18 | Must be writte                      | n as zero; returns zero on reads.                                                                                                            | R          | 0           |
| BAI     | 17:14 | byte. BAI[0] ig                     | nore controls ignore of access to a specific<br>gnores access to byte at bits [7:0] of the<br>1] ignores access to byte at bits [15:8], etc. | R/W        | Undefined   |
|         |       | Encoding                            | Meaning                                                                                                                                      |            |             |
|         |       | 0                                   | Condition depends on access to corre-<br>sponding byte                                                                                       |            |             |
|         |       | 1                                   | Access for corresponding byte is ignored                                                                                                     |            |             |
| NoSB    | 13    | Controls if cor<br>on a store trans | ndition for data breakpoint is not fulfilled saction:                                                                                        | R/W        | Undefined   |
|         |       | Encoding                            | Meaning                                                                                                                                      |            |             |
|         |       | 0                                   | Condition may be fulfilled on store transaction                                                                                              |            |             |
|         |       | 1                                   | Condition is never fulfilled on store transaction                                                                                            |            |             |
| NoLB    | 12    | Controls if cor<br>on a load trans  | ndition for data breakpoint is not fulfilled action:                                                                                         | R/W        | Undefined   |
|         |       | Encoding                            | Meaning                                                                                                                                      |            |             |
|         |       | 0                                   | Condition may be fulfilled on load transaction                                                                                               |            |             |
|         |       | 1                                   | Condition is never fulfilled on load transaction                                                                                             |            |             |
| Res     | 11    | Must be writte                      | n as zero; returns zero on reads.                                                                                                            | R          | 0           |
| hwarts  | 10    | gered data brea                     | of 1 indicates that the address range trig-<br>akpoint feature is supported for this par-<br>eakpoint channel.                               | R          | Preset      |

## Table 10.15 DBCn Register Field Descriptions

| Fields<br>Name Bits |     |                                                                                                                                                                                                                                                                                                                                                      |            |             |  |
|---------------------|-----|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|--|
|                     |     | Description                                                                                                                                                                                                                                                                                                                                          | Read/Write | Reset State |  |
| excl                | 9   | A value of 0 indicates that the breakpoint will match for<br>addresses inclusive (within) the range defined by <i>DBMn</i><br>and <i>DBAn</i> . A value of 1 indicates that the breakpoint<br>will match for addresses exclusive (outside) of the<br>range defined by <i>DBMn</i> and <i>DBAn</i> .                                                  | R/W        | 0           |  |
| hwart               | 8   | A value of 0 indicates that the breakpoint will match<br>using the "Equality and Mask" equation as found sec-<br>tion under 10.2.3.2 "Conditions for Matching Data<br>Breakpoints".<br>A value of 1 indicates that the breakpoint will match<br>using the "Address Range" equation in section<br>10.2.3.2 "Conditions for Matching Data Breakpoints" | R/W        | 0           |  |
| BLM                 | 7:4 | Byte lane mask for value compare on data breakpoint.<br>BLM[0] masks byte at bits [7:0] of the data bus, BLM[1] masks byte at bits [15:8], etc.:                                                                                                                                                                                                     | R/W        | Undefined   |  |
|                     |     | Encoding Meaning                                                                                                                                                                                                                                                                                                                                     |            |             |  |
|                     |     | 0 Compare corresponding byte lane                                                                                                                                                                                                                                                                                                                    |            |             |  |
|                     |     | 1 Mask corresponding byte lane                                                                                                                                                                                                                                                                                                                       |            |             |  |
| Res                 | 3   | Must be written as zero; returns zero on reads.                                                                                                                                                                                                                                                                                                      | R          | 0           |  |
| TE                  | 2   | Use data breakpoint n as triggerpoint:                                                                                                                                                                                                                                                                                                               | R/W        | 0           |  |
|                     |     | Encoding Meaning                                                                                                                                                                                                                                                                                                                                     |            |             |  |
|                     |     | 0 Don't use it as triggerpoint                                                                                                                                                                                                                                                                                                                       |            |             |  |
|                     |     | 1 Use it as triggerpoint                                                                                                                                                                                                                                                                                                                             |            |             |  |
| IVM                 | 1   | Invert Value Match. When set, the data value compare<br>will be inverted. i.e., a break or trigger will be taken if<br>the value does not match the specified value                                                                                                                                                                                  | R/W        | 0           |  |
| BE                  | 0   | Use data breakpoint n as breakpoint:                                                                                                                                                                                                                                                                                                                 | R/W        | 0           |  |
|                     |     | Encoding Meaning                                                                                                                                                                                                                                                                                                                                     |            |             |  |
|                     |     | 0 Don't use it as breakpoint                                                                                                                                                                                                                                                                                                                         |            |             |  |
|                     |     | 1 Use it as breakpoint                                                                                                                                                                                                                                                                                                                               |            |             |  |

## 10.2.7.6 Data Breakpoint Value n (DBVn) Register (0x2120 + 0x100 \* n)

The Data Breakpoint Value n (DBVn) register has the value used in the condition for data breakpoint n. This register is required only if data breakpoints are implemented.

## Figure 10.14 DBVn Register Format

| 31 | 0   |
|----|-----|
|    | DBV |

| Fie  | lds    |                                      | Read/Wr |             |  |
|------|--------|--------------------------------------|---------|-------------|--|
| Name | Bit(s) | Description                          |         | Reset State |  |
| DBV  | 31:0   | Data breakpoint value for condition. | R/W     | Undefined   |  |

## **Table 10.16 DBVn Register Field Descriptions**

## 10.2.7.7 Data Breakpoint Complex Control n (DBCCn) Register (0x2128 + n\*0x100)

The Data Breakpoint Complex Control n (*DBCCn*) register controls the complex break conditions for data breakpoint n. This register is required only if complex breakpoints are implemented and only for implemented data breakpoints.

| Figure 10.15 DBCCn Register Format |          |     |     |       |     |         |   |   |     |    |
|------------------------------------|----------|-----|-----|-------|-----|---------|---|---|-----|----|
| 31 20                              | 19 16    | 15  | 14  | 13 10 | 9   | 8 5     | 4 | 3 | 2 1 | 10 |
| Res                                | TIBrkNum | TUP | Res | PrCnd | CBE | DBrkNum | Q |   | Res | 5  |

## Table 10.17 DBCCn Register Field Descriptions

| Fields   |                   |                                                                                                                                                                                                                                     |            |                                                                                                                                                                        |  |  |
|----------|-------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| Name     | Bits              | Description                                                                                                                                                                                                                         | Read/Write | Reset State                                                                                                                                                            |  |  |
| Res      | 31:20,<br>14, 3:0 | Must be written as zero; returns zero on read.                                                                                                                                                                                      | R          | 0                                                                                                                                                                      |  |  |
| TIBrkNum | 19:16             | Tuple Instruction Break Number. Indicates which<br>instruction breakpoint will be paired with this data break-<br>point to form a tuple breakpoint.                                                                                 | R          | 6I/2D Complex Breakpoint<br>Configuration:<br>DBCC0 - 0<br>DBCC1 - 3<br>8I/4D Complex Breakpoint<br>Configuration:<br>DBCC0 - 0<br>DBCC1 - 2<br>DBCC2 - 4<br>DBCC3 - 6 |  |  |
| TUP      | 15                | Tuple Enable. Qualify this data breakpoint with a match<br>on the TIBrkNum instruction breakpoint on the same<br>instruction.                                                                                                       | R/W        | 0                                                                                                                                                                      |  |  |
| PrCnd    | 13:12             | Upper bits of priming condition for D breakpoint n.<br>microAptiv UP only supports 4 priming conditions so the<br>upper 2 bits are read only as 0.                                                                                  | R          | 0                                                                                                                                                                      |  |  |
| PrCnd    | 11:10             | Priming condition for D Breakpoint n.<br>00 - Bypass, no priming needed<br>Other - Varies depending on the break number, refer to<br>Table 10.20 for mapping.                                                                       | R/W        | 0                                                                                                                                                                      |  |  |
| CBE      | 9                 | Complex Break Enable - enables this breakpoint for use<br>as a priming or qualifying condition for another break-<br>point.                                                                                                         | R/W        | 0                                                                                                                                                                      |  |  |
| DQBrkNum | 8:5               | Indicates which data breakpoint channel is used to qualify<br>this data breakpoint.<br>Data qualification of data breakpoints is not supported on<br>the microAptiv UP core and this field will read as 0 and<br>cannot be written. | R          | 0                                                                                                                                                                      |  |  |

| Fields |      |                                                                                                                                                                                                                                         |            |             |
|--------|------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits | Description                                                                                                                                                                                                                             | Read/Write | Reset State |
| DQ     | 4    | Qualify this breakpoint based on the data breakpoint indi-<br>cated in <i>DBrkNum</i> .<br>Data qualification of data breakpoints is not supported on<br>the microAptiv UP core and this field will read as 0 and<br>cannot be written. | R          | 0           |

## Table 10.17 DBCCn Register Field Descriptions (Continued)

## 10.2.7.8 Data Breakpoint Pass Counter n (DBPCn) Register (0x2130 + n\*0x100)

The *Data Breakpoint Pass Counter n* (*DBPCn*) register controls the pass counter associated with data breakpoint n. This register is required only if complex breakpoints are implemented and only for implemented data breakpoints.

If complex breakpoints are implemented, there will be an 16b pass counter for each of the data breakpoints on the microAptiv UP core.

### Figure 10.16 DBPCn Register Format

| 31 | 16 | 15 0    |
|----|----|---------|
|    | 0  | PassCnt |

## Table 10.18 DBPCn Register Field Descriptions

| Fields       |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |     |             |  |  |
|--------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|-------------|--|--|
| Name Bits    | Name Bits Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |     | Reset State |  |  |
| 0 31:16      | Ignored on write, returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | R   | 0           |  |  |
| PassCnt 15:0 | Prevents a break/trigger action until the matching condi-<br>tions on data breakpoint n have been seen this number of<br>times.<br>Each time the matching condition is seen, this value will<br>be decremented by 1. When the value reaches 0, subse-<br>quent matches will cause a break or trigger as requested<br>and the counter will stay at 0.<br>The break or trigger action is imprecise if the <i>PassCnt</i><br>register was last written to a non-zero value. It will<br>remain imprecise until this register is written to 0 by soft-<br>ware. | R/W | 0           |  |  |

## 10.2.7.9 Data Value Match (DVM) Register (0x2ffo)

The Data Value Match (DVM) register captures the data value of a load that takes a precise data value breakpoint. This allows debug software to synthesize the load instruction without re-executing it in case it is to a system register that has destructive reads. This register is required only if data breakpoints are implemented.

## Figure 10.17 DVM Register Format

31

0 LDV

| Fie  | lds    |                                                                        | Read/W |             |
|------|--------|------------------------------------------------------------------------|--------|-------------|
| Name | Bit(s) | Description                                                            | rite   | Reset State |
| LDV  | 31:0   | Load data value for the last precise load data value breakpoint taken. | R      | Undefined   |

#### Table 10.19 DVM Register Field Descriptions

## **10.2.8 Complex Breakpoint Registers**

The registers for complex breakpoints are described Table 10.20. These registers have implementation information and are used to setup the data breakpoints. All registers are in drseg.

| Offset in drseg                           | Register<br>Mnemonic | Register Name and Description                                                                               |
|-------------------------------------------|----------------------|-------------------------------------------------------------------------------------------------------------|
| 0x1120 + 0x100 * n                        | IBCCn                | Instruction Breakpoint Complex Control n - described above with instruction breakpoint registers            |
| 0x1128 + 0x100 * n                        | IBPCn                | Instruction Breakpoint Pass Counter n - described above with instruction breakpoint registers               |
| 0x2128 + 0x100 * n                        | DBCCn                | Data Breakpoint Complex Control n - described above with data breakpoint registers                          |
| 0x2130 + 0x100 * n                        | DBPCn                | Data Breakpoint Pass Counter n - described above with data breakpoint registers                             |
| 0x8000                                    | CBTControl           | Complex Break and Triggerpoint Control - indicates which of the complex breakpoint features are implemented |
| 0x8300 + 0x20 * n                         | PrCndAIn             | Prime Condition Register A for Instruction breakpoint n                                                     |
| 0x84e0 + 0x20 * n                         | PrCndADn             | Prime Condition Register A for Data breakpoint n                                                            |
| 0x8900                                    | STCtl                | Stopwatch Timer Control                                                                                     |
| 0x8908                                    | STCnt                | Stopwatch Timer Count                                                                                       |
| <i>n</i> is breakpoint number from 0 to 7 | range dependent on   | implemented hardware)                                                                                       |

Table 10.20 Addresses for Complex Breakpoint Registers

mber from 0 to 7 (range dependent on implemented hardware)

#### 10.2.8.1 Complex Break and Trigger Control (CBTC) Register (0x8000)

The CBTC register contains configuration bits that indicate which features of complex break are implemented as well as a control bit for the stopwatch timer. On the microAptiv UP core, if complex break is implemented, all of the separate features will be present. This register is required only if complex breakpoints are implemented.

#### Figure 10.18 CBTC Register Format

| 31  | 9 | 8      | 7 5 | 4   | 3  | 2   | 1  | 0   |
|-----|---|--------|-----|-----|----|-----|----|-----|
| Res |   | STMode | Res | STP | РР | DQP | ТР | PCP |

| Fields |           |                                                                                                                                                                                      |            |             |
|--------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|-------------|
| Name   | Bits      | Description                                                                                                                                                                          | Read/Write | Reset State |
| Res    | 31:9, 7:5 | Reserved                                                                                                                                                                             | R          | 0           |
| STMode | 8         | Stopwatch Timer Mode: controls whether the stopwatch<br>timer is free-running or controlled by triggerpoints:<br>0 - free-running<br>1 - started and stopped by instruction triggers | R/W        | 1           |
| STP    | 4         | Stopwatch Timer Present - indicates whether stopwatch timer is implemented.                                                                                                          | R          | 1           |
| РР     | 3         | Priming Present - indicates whether primed breakpoints are supported                                                                                                                 | R          | 1           |
| DQP    | 2         | Data Qualify Present - indicates whether data qualified breakpoints are supported.                                                                                                   | R          | 1           |
| TP     | 1         | Tuple Present - indicates whether any tuple breakpoints are implemented.                                                                                                             | R          | 1           |
| PCP    | 0         | Pass Counters Present - indicates whether any break-<br>points have pass counters associated with them.                                                                              | R          | 1           |

Table 10.21 CBTC Register Field Descriptions

#### 10.2.8.2 Priming Condition A (PrCndAl/Dn) Registers

The Prime Condition registers hold implementation specific information about which triggerpoints are used for the priming conditions for each breakpoint register. On the microAptiv UP core, these connections are predetermined and these registers are read-only. This register is required only if complex breakpoints are implemented.

The architecture allows for up to 16 priming conditions to be specified and there can be up to 4 priming condition registers per breakpoint (A/B/C/D). The microAptiv UP core only allows for 4 priming conditions and thus only implements the PrCndA registers. The general description is shown in Table 10.22. The actual priming conditions for each of the breakpoints are shown in Table 10.23.

| Figure 10.19 PrCndA Register Format |      |       |       |       |  |  |  |
|-------------------------------------|------|-------|-------|-------|--|--|--|
| 31                                  | 24   | 23 16 | 15 8  | 7 0   |  |  |  |
| Co                                  | ond3 | Cond2 | Cond1 | Cond0 |  |  |  |

| Fiel          | lds                            |                                                                                                                       | Read/Wr |             |
|---------------|--------------------------------|-----------------------------------------------------------------------------------------------------------------------|---------|-------------|
| Name          | Bit(s)                         | Description                                                                                                           | ite     | Reset State |
| CondN         | 31:24<br>23:16<br>15:8<br>7:0  | Specifies which triggerpoint is connected to priming condition 3, 2, 1, or 0 <sup>a</sup> for the current breakpoint. | R       | Preset      |
|               | 31:30<br>23:22<br>15:14<br>7:6 | Reserved                                                                                                              | R       | 0           |
|               | 29:28<br>21:20<br>13:12<br>5:4 | Trigger type<br>00 - Special/Bypass<br>01 - Instruction<br>10 - Data<br>11 - Reserved                                 | R       | Preset      |
|               | 27:24<br>19:16<br>11:8<br>3:0  | Break Number, 0-14                                                                                                    | R       | Preset      |
| [a] Condition | 0 is always B                  | ypass and will read as 8'b0                                                                                           |         |             |

## Table 10.22 PrCndA Register Field Descriptions

## Table 10.23 Priming Conditions and Register Values for 6I/2D Configuration

| Break | Cond0  | Cond1 | Cond2 | Cond3 | PrCndA Value | drseg<br>offset |
|-------|--------|-------|-------|-------|--------------|-----------------|
| Inst0 | Bypass | Data0 | Inst1 | Inst2 | 0x1211_2000  | 0x8300          |
| Inst1 | Bypass | Data0 | Inst0 | Inst2 | 0x1210_2000  | 0x8320          |
| Inst2 | Bypass | Data0 | Inst0 | Inst1 | 0x1110_2000  | 0x8340          |
| Inst3 | Bypass | Data1 | Inst4 | Inst5 | 0x1514_2100  | 0x8360          |
| Inst4 | Bypass | Data1 | Inst3 | Inst5 | 0x1513_2100  | 0x8380          |
| Inst5 | Bypass | Data1 | Inst3 | Inst4 | 0x1413_2100  | 0x83a0          |
| Data0 | Bypass | Inst0 | Inst1 | Inst2 | 0x1211_1000  | 0x84e0          |
| Data1 | Bypass | Inst3 | Inst4 | Inst5 | 0x1514_1300  | 0x8500          |

## Table 10.24 Priming Conditions and Register Values for 8I/4D Configuration

| Break | Cond0  | Cond1 | Cond2 | Cond3 | PrCndA Value | drseg<br>offset |
|-------|--------|-------|-------|-------|--------------|-----------------|
| Inst0 | Bypass | Data0 | Inst1 | Inst2 | 0x1211_2000  | 0x8300          |
| Inst1 | Bypass | Data0 | Inst0 | Inst2 | 0x1210_2000  | 0x8320          |
| Inst2 | Bypass | Data1 | Inst3 | Inst4 | 0x1413_2100  | 0x8340          |
| Inst3 | Bypass | Data1 | Inst2 | Inst4 | 0x1412_2100  | 0x8360          |
| Inst4 | Bypass | Data2 | Inst5 | Inst6 | 0x1615_2200  | 0x8380          |

| Break | Cond0  | Cond1 | Cond2 | Cond3 | PrCndA Value | drseg<br>offset |
|-------|--------|-------|-------|-------|--------------|-----------------|
| Inst5 | Bypass | Data2 | Inst4 | Inst6 | 0x1614_2200  | 0x83a0          |
| Inst6 | Bypass | Data3 | Inst7 | Inst0 | 0x1017_2300  | 0x83c0          |
| Inst7 | Bypass | Data3 | Inst6 | Inst0 | 0x1016_2300  | 0x83e0          |
| Data0 | Bypass | Inst0 | Inst1 | Data1 | 0x2111_1000  | 0x84e0          |
| Data1 | Bypass | Inst2 | Inst3 | Data2 | 0x2213_1200  | 0x8500          |
| Data2 | Bypass | Inst4 | Inst5 | Data3 | 0x2315_1400  | 0x8520          |
| Data3 | Bypass | Inst6 | Inst7 | Data0 | 0x2017_1600  | 0x8540          |

#### 10.2.8.3 Stopwatch Timer Control (STCtl) Register (0x8900)

The *Stopwatch Timer Control* (*STCtl*) register gives configuration information about how the stopwatch timer register is controlled. On the microAptiv UP core, the break channels that control the stopwatch timer are fixed and this register is read-only. This register is required only if stopwatch timer is implemented.

#### Figure 10.20 STCtl Register Format

| 31  | 18 | 17        | 14 | 13 10      | 9   | 8    | 5     | 4 1        | 0  |   |
|-----|----|-----------|----|------------|-----|------|-------|------------|----|---|
| Res |    | StopChan1 | 1  | StartChan1 | En1 | Stop | Chan0 | StartChan0 | En | 0 |

| Fields     |        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | Read/Wr |             |
|------------|--------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------|-------------|
| Name       | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | ite     | Reset State |
| Res        | 31:18  | Must be written as zero; returns zero on read.                                                                                                                                                                                                                                                                                                                                                                                                                                           | R       | 0           |
| StopChan1  | 17:14  | Indicates the instruction breakpoint channel that will<br>stop the counter if the timer is under pair1 breakpoint<br>control                                                                                                                                                                                                                                                                                                                                                             | R       | 0           |
| StartChan1 | 13:10  | Indicates the instruction breakpoint channel that will<br>start the counter if the timer is under pair1 breakpoint<br>control                                                                                                                                                                                                                                                                                                                                                            | R       | 0           |
| En1        | 9      | Enables the second pair (pair1) of breakpoint registers to<br>control the timer when under breakpoint control. If the<br>stopwatch timer is configured to be under breakpoint<br>control (by setting <i>CBTControlSTM</i> )and this bit is set,<br>the breakpoints indicated in the StartChan1 and<br>StopChan1 fields will control the timer.<br>The microAptiv UP core only supports 1 pair of stop-<br>watch control breakpoints so this field is not writable<br>and will read as 0. | R       | 0           |
| StopChan0  | 8:5    | Indicates the instruction breakpoint channel that will<br>stop the counter if the timer is under pair0 breakpoint<br>control.                                                                                                                                                                                                                                                                                                                                                            | R       | 0x4         |

| Fiel       | ds     |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           | Read/Wr |             |
|------------|--------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------|-------------|
| Name       | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | ite     | Reset State |
| StartChan0 | 4:1    | Indicates the instruction breakpoint channel that will<br>start the counter if the timer is under pair0 breakpoint<br>control.                                                                                                                                                                                                                                                                                                                                                            | R       | 0x1         |
| En0        | 0      | <ul> <li>Enables the first pair (pair0) of breakpoint registers to control the timer when under breakpoint control. If the stopwatch timer is configured to be under breakpoint control (by setting <i>CBTControlSTM</i>)and this bit is set, the breakpoints indicated in the StartChan0 and StopChan0 fields will control the timer.</li> <li>The microAptiv UP core only supports 1 pair of stopwatch control breakpoints so this field is not writable and will read as 1.</li> </ul> | R       | 1           |

#### **Table 10.25 STCtl Register Field Descriptions**

#### 10.2.8.4 Stopwatch Timer Count (STCnt) Register (0x8908)

The Stopwatch Timer Count (STCnt) register is the count value for the stopwatch timer. This register is required only if the stopwatch timer is implemented.

#### Figure 10.21 STCnt Register Format

| 31 | 0     |
|----|-------|
|    | Count |

### Table 10.26 STCtl Register Field Descriptions

| Fie   | lds    |                       | Read/Wr |             |
|-------|--------|-----------------------|---------|-------------|
| Name  | Bit(s) | Description           | ite     | Reset State |
| Count | 31:0   | Current counter value | R/W     | 0           |

## 10.3 Complex Breakpoint Usage

## 10.3.1 Checking for Presence of Complex Break Support

Software should verify that the complex breakpoint hardware is implemented prior to attempting to use it. The full sequence of steps is shown below for general use. Spots where the microAptiv UP core has restricted behavior are noted.

1. Read the *Config1EP* bit to check for the presence of EJTAG logic. EJTAG logic is always present on the microAptiv UP core.

- 2. Read the *DebugNoDCR* bit to check for the presence of the *Debug Control Register* (*DCR*). The *DCR* will always be implemented on the microAptiv UP core.
- 3. Read the DCRCBT bit to check for the presence of any complex break and trigger features
- 4. Read the *CBTControl* register to check for the presence of each individual feature. If the microAptiv UP core implements any complex break and trigger features, it will implement all of them
- 5. If Pass Counters are implemented, they may not be implemented for all break channels and may have different counter sizes. To determine the size and presence of each pass counter, software can write -1 to each of the *IBPCn* and *DBPCn* registers and read it back. If the microAptiv UP core implements pass counters, it will implement an 8b counter for each instruction breakpoint and a 16b counter for each data breakpoint.
- 6. If tuples are implemented, they may only be supported on a subset of the data breakpoint channels. This can be checked by seeing if the DBBCn<sub>TUP</sub> bit can be set to 1. Additionally, some cores may support dynamically changing which instruction breakpoint is associated with a given data breakpoint. This can be checked by attempting to write the DBCCn<sub>TIBrkNum</sub> field. If the microAptiv UP core implements tuple support, it will support it for all data breakpoint channels and the instruction breakpoint association will be fixed.
- 7. If Priming Conditions are supported, a core may only support a subset of the possible priming condition values. This can be checked by 4'hf to the *xBCCnPrCnd* field. If only 1 or 2 bits can be written, the available priming conditions will be described in the *PrCndA* registers. If 3 bits are writable, *PrCndA* and *PrCndB* will describe the conditions, and if all 4 bits are writable, the *PrCndA*,*PrCndB*,*PrCndC*, and *PrCndD* registers will all exist. Some cores may also support changing the priming conditions and this can be checked by attempting to write to the *PrCnd* registers. If the microAptiv UP core supports priming conditions, it will support 4 statically mapped priming conditions per breakpoint which will be described in the *PrCndA* registers.
- 8. If support for qualified breakpoints is indicated, it may only be supported for some of the breakpoints. Additionally, the data breakpoint used for the qualification may be configurable. Software can check this by writing to the *xBCCnDQ* and *xBCCnDQBrkNum* fields. If the microAptiv UP core support qualified breakpoints, it will only support it on instruction breakpoints and the data break used for qualification will be fixed for each instruction breakpoint.
- 9. If the stopwatch timer is implemented, either one or two pairs of instruction breakpoints may be available for controlling it and it may be possible to dynamically select which instruction breakpoints are used. This can be tested by writing to the *STCtl* register.

## 10.3.2 General Complex Break Behavior

There is some general complex break behavior that is common to all complex breakpoints. . This behavior is described below:

- Resets to a disabled state when the core is reset, the complex break functionality will be disabled and debug software that is not aware of complex break should continue to function normally.
- Complex break state is not updated on exceptional instructions
- Complex breakpoints are evaluated at the end of the pipeline and complex breakpoint exceptions are taken imprecisely on the following instruction.
- There is no hazard between enabling and enabled events. When an instruction causes an enabling event, the following instruction sees the enabled state and reacts accordingly.

## 10.3.3 Usage of Pass Counters

Pass counters specify that the breakpoint conditions must match N times before the breakpoint action will be enabled.

- Controlled by writing to the per-breakpoint pass counter register
- Resets to 0
- Writing to a non-zero value enables the pass counter. When enabled, each time the breakpoint conditions match, the counter will be decremented by 1. After the counter value reaches 0, the breakpoint action (breakpoint exception, trigger, or complex break enable) will occur on any subsequent matches and the counter will not decrement further. The action does not occur on the match that causes the 1->0 counter decrement.
- If the breakpoint also has priming conditions and/or data qualified specified, the pass counter will only decrement when the priming and/or qualified conditions have been met.
- If a data breakpoint is configured to be a tuple breakpoint, the data pass counter will only decrement on instructions where both the instruction and data break conditions match. The pass counter for the instruction break involved in a tuple should not be enabled if the tuple is enabled.
- When a pass counter has been enabled, it will be treated as enabled until the pass counter is explicitly written to 0. Namely, breakpoint exceptions will continue to be taken imprecisely until the pass counter is disabled by writing to 0.
- The counter register will be updated as matches are detected. The current count value can be read from the register while operating in debug mode. Note that this behavior is architecturally recommended, but not required.

## 10.3.4 Usage of Tuple Breakpoints

A tuple breakpoint is the logical AND of a data breakpoint and an instruction breakpoint. Tuple breakpoints are specified as a condition on a data breakpoint. If the *DBCCnTUP* bit is set, the data breakpoint will not match unless there the corresponding instruction breakpoint conditions are also met.

- Uses the data breakpoint resources to specify the break action, break status, pass counters, and priming conditions.
- The instruction breakpoint involved in the tuple should be configured as follows:
  - $IBCCn_{CBE} = 1$
  - $IBCCn_{PrCnd} = IBCCn_{DQ} = IBCn_{TE} = IBCn_{BE} = IBPCn = 0$

### 10.3.5 Usage of Priming Conditions

Priming conditions provide a way to have one breakpoint enabled by another one. Prior to the priming condition being satisfied, any breakpoint matches are ignored.

- Priming condition resets to bypass which specifies that no priming is required
- 3 other priming conditions are available for each breakpoint. These condition vary from breakpoint to breakpoint (since it makes no sense for a breakpoint to prime itself). The conditions for each of the breakpoints are listed in Table 10.23.

- The priming breakpoint must have *xBCn<sub>TE</sub>* or *xBCCn<sub>CBE</sub>* set.
- When the priming condition has been seen, the primed breakpoint will remain primed until its *xBCCn* register is written
- The primed state is stored with the breakpoint being primed and not with the breakpoint that is doing the priming.
- Each Prime condition is the comparator output after it has been qualified by its own Prime condition, data qualification, and pass counter. Using this, several stages of priming are possible (e.g. data cycle D followed by instruction A followed by instruction B N times followed by instruction C).

## 10.3.6 Usage of Data Qualified Breakpoints

Each of the instruction breakpoints can be set to be data qualified. In qualified mode, a breakpoint will recognize its conditions only after the specified data breakpoint matches both address and data. If the data breakpoint matches address, but has a mismatch on the data value, the instruction breakpoint will be unqualified and will not match until a subsequent qualifying match.

This feature can be used similarly to the ASID qualification that is available on cores with TLBs. If an RTOS loads a process ID for the current process, that load can be used as the qualifying breakpoint. When a matching process ID is loaded (entering the desired RTOS process), qualified instruction breakpoints will be enabled. When a different process ID is loaded (leaving the desired RTOS process), the qualified instruction breakpoints are disabled. Alternatively, with the InvertValueMatch feature of the data breakpoint, the instruction breakpoints could be enabled on any process ID other than the specified one.

- The qualifying data break must have DBCn<sub>TE</sub> or DBCCn<sub>CBE</sub> set.
- The qualifying data break should have data comparison enabled (via settings of DBCn<sub>BLM</sub> and DBCn<sub>BAI</sub>)
- The qualifying data break should not have pass counters, priming conditions, or tuples enabled.
- The qualifying data access can be either a load or store, depending on the settings of DBCn<sub>NoSB</sub> and DBCn<sub>NoLB</sub>
- The Qualified/Unqualified state is stored with the instruction breakpoint that is being qualified. Writing its *IBCCn* register will disqualify that breakpoint.
- Qualified instruction breakpoint can also have priming conditions and/or pass counters enabled. The pass counter will only decrement when the priming and qualifying conditions have been met. The instruction breakpoint action (break, trigger, or complex enable) will only occur when all priming, qualifying, and pass counter conditions have been met.
- Qualified instruction breakpoint can be used to prime another breakpoint

## 10.3.7 Usage of Stopwatch Timers

The stopwatch timer is a drseg memory mapped count register. It can be configured to be free running or controlled by instruction breakpoints. This could be used to measure the amount of time that is spent in a particular function by starting the counter upon function entry and stopping it upon exit.

• Count value is reset to 0

- Reset state has counter stopped and under breakpoint control so that the counter is not running when the core is not being debugged.
- Bit in CBTControl register controls whether the counter is free-running or breakpoint controlled.
- Counter does not count in debug mode
- When breakpoint controlled, the involved instruction breakpoints must have *IBCn<sub>TE</sub>* or *IBCCn<sub>CBE</sub>* set in order to start or stop the timer.

## 10.4 Test Access Port (TAP)

The following main features are supported by the TAP module:

- 5-pin industry standard JTAG Test Access Port (*TCK*, *TMS*, *TDI*, *TDO*, *TRST\_N*) interface which is compatible with IEEE Std. 1149.1.
- Target chip and EJTAG feature identification available through the Test Access Port (TAP) controller.
- The processor can access external memory on the EJTAG Probe serially through the EJTAG pins. This is achieved through Processor Access (PA), and is used to eliminate the use of the system memory for debug routines.
- Support for both ROM based debugger and debugging both through TAP.

## 10.4.1 EJTAG Internal and External Interfaces

The external interface of the EJTAG module consists of the 5 signals defined by the IEEE standard.

| Pin | Туре | Description                                                                                                                                                                                                                                                                                                       |
|-----|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| ТСК | I    | Test Clock Input<br>Input clock used to shift data into or out of the Instruction or data regis-<br>ters. The <i>TCK</i> clock is independent of the processor clock, so the EJTAG<br>probe can drive <i>TCK</i> independently of the processor clock frequency.<br>The core signal for this is called $EJ_TCK$ . |
| TMS | Ι    | Test Mode Select Input<br>The <i>TMS</i> input signal is decoded by the TAP controller to control test<br>operation. <i>TMS</i> is sampled on the rising edge of <i>TCK</i> .<br>The core signal for this is called $EJ_TMS$ .                                                                                    |
| TDI | I    | Test Data Input<br>Serial input data ( <i>TDI</i> ) is shifted into the Instruction register or data regis-<br>ters on the rising edge of the <i>TCK</i> clock, depending on the TAP controller<br>state.<br>The core signal for this is called <i>EJ_TDI</i> .                                                   |
| TDO | 0    | Test Data Output<br>Serial output data is shifted from the Instruction or data register to the<br>TDO pin on the falling edge of the $TCK$ clock. When no data is shifted<br>out, the $TDO$ is 3-stated.<br>The core signal for this is called $EJ_TDO$ with output enable controlled<br>by $EJ_TDOzstate$ .      |

#### Table 10.27 EJTAG Interface Pins

| Pin    | Туре | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|--------|------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| TRST_N | Ι    | Test Reset Input (Optional pin)<br>The $TRST_N$ pin is an active-low signal for asynchronous reset of the<br>TAP controller and instruction in the TAP module, independent of the<br>processor logic. The processor is not reset by the assertion of $TRST_N$ .<br>The core signal for this is called $EJ_TRST_N$<br>This signal is optional, but power-on reset must apply a low pulse on this<br>signal at power-on and then leave it high, in case the signal is not available<br>as a pin on the chip. If available on the chip, then it must be low on the<br>board when the EJTAG debug features are unused by the probe. |

#### Table 10.27 EJTAG Interface Pins (Continued)

## 10.4.2 Test Access Port Operation

The TAP controller is controlled by the Test Clock (*TCK*) and Test Mode Select (*TMS*) inputs. These two inputs determine whether an Instruction register scan or data register scan is performed. The TAP consists of a small controller, driven by the *TCK* input, which responds to the *TMS* input as shown in the state diagram in Figure 10.22. The TAP uses both clock edges of *TCK*. *TMS* and *TDI* are sampled on the rising edge of *TCK*, while *TDO* changes on the falling edge of *TCK*.

At power-up the TAP is forced into the *Test-Logic-Reset* by low value on *TRST\_N*. The TAP instruction register is thereby reset to IDCODE. No other parts of the EJTAG hardware are reset through the *Test-Logic-Reset* state.

When test access is required, a protocol is applied via the *TMS* and *TCK* inputs, causing the TAP to exit the *Test-Logic-Reset* state and move through the appropriate states. From the *Run-Test/Idle* state, an Instruction register scan or a data register scan can be issued to transition the TAP through the appropriate states shown in Figure 10.22.

The states of the data and instruction register scan blocks are mirror images of each other adding symmetry to the protocol sequences. The first action that occurs when either block is entered is a capture operation. For the data registers, the *Capture-DR* state is used to capture (or parallel load) the data into the selected serial data path. In the Instruction register, the *Capture-IR* state is used to capture status information into the Instruction register.

From the *Capture* states, the TAP transitions to either the *Shift* or *Exit1* states. Normally the *Shift* state follows the *Capture* states of that test data or status information can be shifted out for inspection and new data shifted in. Following the *Shift* state, the TAP either returns to the *Run-Test/Idle* state via the *Exit1* and *Update* states or enters the *Pause* state via *Exit1*. The reason for entering the *Pause* state is to temporarily suspend the shifting of data through either the Data or Instruction Register while a required operation, such as refilling a host memory buffer, is performed. From the Pause state shifting can resume by re-entering the *Shift* state via the *Exit2* state or terminate by entering the *Run-Test/Idle* states.

Upon entering the data or Instruction register scan blocks, shadow latches in the selected scan path are forced to hold their present state during the Capture and Shift operations. The data being shifted into the selected scan path is not output through the shadow latch until the TAP enters the *Update-DR* or *Update-IR* state. The *Update* state causes the shadow latches to update (or parallel load) with the new data that has been shifted into the selected scan path.



Figure 10.22 TAP Controller State Diagram

#### 10.4.2.1 Test-Logic-Reset State

In the *Test-Logic-Reset* state the boundary scan test logic is disabled. The test logic enters the *Test-Logic-Reset* state when the *TMS* input is held HIGH for at least five rising edges of *TCK*. The BYPASS instruction is forced into the instruction register output latches during this state. The controller remains in the *Test-Logic-Reset* state as long as *TMS* is HIGH.

#### 10.4.2.2 Run-Test/Idle State

The controller enters the *Run-Test/Idle* state between scan operations. The controller remains in this state as long as *TMS* is held LOW. The instruction register and all test data registers retain their previous state. The instruction cannot change when the TAP controller is in this state.

When TMS is sampled HIGH on the rising edge of TCK, the controller transitions to the Select\_DR state.

#### 10.4.2.3 Select\_DR\_Scan State

This is a temporary controller state in which all test data registers selected by the current instruction retain their previous state. If *TMS* is sampled LOW at the rising edge of *TCK*, then the controller transitions to the *Capture\_DR* state. A HIGH on *TMS* causes the controller to transition to the *Select\_IR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.4 Select\_IR\_Scan State

This is a temporary controller state in which all test data registers selected by the current instruction retain their previous state. If *TMS* is sampled LOW on the rising edge of *TCK*, the controller transitions to the *Capture\_IR* state. A

HIGH on *TMS* causes the controller to transition to the *Test-Reset-Logic* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.5 Capture\_DR State

In this state the boundary scan register captures the value of the register addressed by the Instruction register, and the value is then shifted out in the *Shift\_DR*. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Shift\_DR* state. A HIGH on *TMS* causes the controller to transition to the *Exit1\_DR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.6 Shift\_DR State

In this state the test data register connected between *TDI* and *TDO* as a result of the current instruction shifts data one stage toward its serial output on the rising edge of *TCK*. If *TMS* is sampled LOW on the rising edge of *TCK*, the controller remains in the *Shift\_DR* state. A HIGH on *TMS* causes the controller to transition to the *Exit1\_DR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.7 Exit1\_DR State

This is a temporary controller state in which all test data registers selected by the current instruction retain their previous state. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Pause\_DR* state. A HIGH on *TMS* causes the controller to transition to the *Update\_DR* state which terminates the scanning process. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.8 Pause\_DR State

The *Pause\_DR* state allows the controller to temporarily halt the shifting of data through the test data register in the serial path between *TDI* and *TDO*. All test data registers selected by the current instruction retain their previous state. If *TMS* is sampled LOW on the rising edge of *TCK*, the controller remains in the *Pause\_DR* state. A HIGH on *TMS* causes the controller to transition to the *Exit2\_DR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.9 Exit2\_DR State

This is a temporary controller state in which all test data registers selected by the current instruction retain their previous state. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Shift\_DR* state to allow another serial shift of data. A HIGH on *TMS* causes the controller to transition to the *Update\_DR* state which terminates the scanning process. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.10 Update\_DR State

When the TAP controller is in this state the value shifted in during the *Shift\_DR* state takes effect on the rising edge of the *TCK* for the register indicated by the Instruction register.

If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Run-Test/Idle* state. A HIGH on *TMS* causes the controller to transition to the *Select\_DR\_Scan* state. The instruction cannot change while the TAP controller is in this state and all shift register stages in the test data registers selected by the current instruction retain their previous state.

#### 10.4.2.11 Capture\_IR State

In this state the shift register contained in the Instruction register loads a fixed pattern  $(00001_2)$  on the rising edge of *TCK*. The data registers selected by the current instruction retain their previous state.

If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Shift\_IR* state. A HIGH on *TMS* causes the controller to transition to the *Exit1\_IR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.12 Shift\_IR State

In this state the instruction register is connected between *TDI* and *TDO* and shifts data one stage toward its serial output on the rising edge of *TCK*. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller remains in the *Shift\_IR* state. A HIGH on *TMS* causes the controller to transition to the *Exit1\_IR* state.

#### 10.4.2.13 Exit1\_IR State

This is a temporary controller state in which all registers retain their previous state. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Pause\_IR* state. A HIGH on *TMS* causes the controller to transition to the *Update\_IR* state which terminates the scanning process. The instruction cannot change while the TAP controller is in this state and the instruction register retains its previous state.

#### 10.4.2.14 Pause\_IR State

The *Pause\_IR* state allows the controller to temporarily halt the shifting of data through the instruction register in the serial path between *TDI* and *TDO*. If *TMS* is sampled LOW at the rising edge of *TCK*, the controller remains in the *Pause\_IR* state. A HIGH on *TMS* causes the controller to transition to the *Exit2\_IR* state. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.15 Exit2\_IR State

This is a temporary controller state in which the instruction register retains its previous state. If *TMS* is sampled LOW at the rising edge of *TCK*, then the controller transitions to the *Shift\_IR* state to allow another serial shift of data. A HIGH on *TMS* causes the controller to transition to the *Update\_IR* state which terminates the scanning process. The instruction cannot change while the TAP controller is in this state.

#### 10.4.2.16 Update\_IR State

The instruction shifted into the instruction register takes effect on the rising edge of TCK.

If *TMS* is sampled LOW at the rising edge of *TCK*, the controller transitions to the *Run-Test/Idle* state. A HIGH on *TMS* causes the controller to transition to the *Select\_DR\_Scan* state.

#### 10.4.3 Test Access Port (TAP) Instructions

The TAP Instruction register allows instructions to be serially input into the device when TAP controller is in the *Shift-IR* state. Instructions are decoded and define the serial test data register path that is used to shift data between *TDI* and *TDO* during data register scanning.

The Instruction register is a 5-bit register. In the current EJTAG implementation only some instructions have been decoded; the unused instructions default to the BYPASS instruction.

| Value | Instruction | Function                                                     |
|-------|-------------|--------------------------------------------------------------|
| 0x01  | IDCODE      | Select Chip Identification data register                     |
| 0x03  | IMPCODE     | Select Implementation register                               |
| 0x08  | ADDRESS     | Select Address register                                      |
| 0x09  | DATA        | Select Data register                                         |
| 0x0A  | CONTROL     | Select EJTAG Control register                                |
| 0x0B  | ALL         | Select the Address, Data, and EJTAG Control registers        |
| 0x0C  | EJTAGBOOT   | Set EjtagBrk, ProbEn, and ProbTrap to 1 as reset value       |
| 0x0D  | NORMALBOOT  | Set EjtagBrk, ProbEn, and ProbTrap to 0 as reset value       |
| 0x0E  | FASTDATA    | Selects the Data and Fastdata registers                      |
| 0x10  | TCBCONTROLA | Selects the TCBTCONTROLA register in the Trace Control Block |
| 0x11  | TCBCONTROLB | Selects the TCBTCONTROLB register in the Trace Control Block |
| 0x12  | TCBDATA     | Selects the TCBDATA register in the Trace Control Block      |
| 0x14  | PCSAMPLE    | Selects the PCsample register                                |
| 0x17  | FDC         | Selects Fast Debug Channel.                                  |
| 0x1F  | BYPASS      | Bypass mode                                                  |

**Table 10.28 Implemented EJTAG Instructions** 

#### 10.4.3.1 BYPASS Instruction

The required BYPASS instruction allows the processor to remain in a functional mode and selects the Bypass register to be connected between *TDI* and *TDO*. The BYPASS instruction allows serial data to be transferred through the processor from *TDI* to *TDO* without affecting its operation. The bit code of this instruction is defined to be all ones by the IEEE 1149.1 standard. Any unused instruction is defaulted to the BYPASS instruction.

#### 10.4.3.2 IDCODE Instruction

The IDCODE instruction allows the processor to remain in its functional mode and selects the Device Identification (ID) register to be connected between *TDI* and *TDO*. The Device ID register is a 32-bit shift register containing information regarding the IC manufacturer, device type, and version code. Accessing the Identification Register does not interfere with the operation of the processor. Also, access to the Identification Register is immediately available, via a TAP data scan operation, after power-up when the TAP has been reset with on-chip power-on or through the optional *TRST\_N* pin.

#### 10.4.3.3 IMPCODE Instruction

This instruction selects the Implementation register for output, which is always 32 bits.

#### 10.4.3.4 ADDRESS Instruction

This instruction is used to select the Address register to be connected between *TDI* and *TDO*. The EJTAG Probe shifts 32 bits through the *TDI* pin into the Address register and shifts out the captured address via the *TDO* pin.

#### 10.4.3.5 DATA Instruction

This instruction is used to select the Data register to be connected between *TDI* and *TDO*. The EJTAG Probe shifts 32 bits of *TDI* data into the Data register and shifts out the captured data via the *TDO* pin.

#### 10.4.3.6 CONTROL Instruction

This instruction is used to select the *EJTAG Control* register to be connected between *TDI* and *TDO*. The EJTAG Probe shifts 32 bits of *TDI* data into the *EJTAG Control* register and shifts out the *EJTAG Control* register bits via *TDO*.

#### 10.4.3.7 ALL Instruction

This instruction is used to select the concatenation of the Address and Data register, and the *EJTAG Control* register between *TDI* and *TDO*. It can be used in particular if switching instructions in the instruction register takes too many *TCK* cycles. The first bit shifted out is bit 0.

#### Figure 10.23 Concatenation of the EJTAG Address, Data and Control Registers



#### 10.4.3.8 EJTAGBOOT Instruction

When the EJTAGBOOT instruction is given and the Update-IR state is left, then the reset values of the ProbTrap, ProbEn and *EjtagBrk* bits in the *EJTAG Control* register are set to 1 after a hard or soft reset.

This EJTAGBOOT indication is effective until a NORMALBOOT instruction is given, *TRST\_N* is asserted or a rising edge of *TCK* occurs when the TAP controller is in Test-Logic-Reset state.

It is possible to make the CPU go into debug mode just after a hard or soft reset, without fetching or executing any instructions from the normal memory area. This can be used for download of code to a system which have no code in ROM.

The Bypass register is selected when the EJTAGBOOT instruction is given.

#### 10.4.3.9 NORMALBOOT Instruction

When the NORMALBOOT instruction is given and the Update-IR state is left, then the reset value of the ProbTrap, ProbEn and *EjtagBrk* bits in the *EJTAG Control* register are set to 0 after hard or soft reset.

The Bypass register is selected when the NORMALBOOT instruction is given.

#### 10.4.3.10 FASTDATA Instruction

This selects the Data and the Fastdata registers at once, as shown in Figure 10.24.

#### Figure 10.24 TDI to TDO Path When in Shift-DR State and FASTDATA Instruction is Selected



#### 10.4.3.11 PCsample Register (PCSAMPLE Instruction)

This selects the PCsample Register. The use of the PCsample Register is described in more detail in Section 10.9 "PC/Data Address Sampling"

#### 10.4.3.12 FDC Instruction

This selects the Fast Debug Channel. The use of the FDC is described in more detail in Section 10.10 "Fast Debug Channel".

#### 10.4.3.13 TCBCONTROLA Instruction

This instruction is used to select the TCBCONTROLA register to be connected between *TDI* and *TDO*. This register is only implemented if the Trace Control Block is present. If no TCB is present, then this instruction will select the Bypass register.

#### 10.4.3.14 TCBCONTROLB Instruction

This instruction is used to select the TCBCONTROLB register to be connected between *TDI* and *TDO*. This register is only implemented if the Trace Control Block is present. If no TCB is present, then this instruction will select the Bypass register.

#### 10.4.3.15 TCBDATA Instruction

This instruction is used to select the TCBDATA register to be connected between *TDI* and *TDO*. This register is only implemented if the Trace Control Block is present. If no TCB is present, then this instruction will select the Bypass register. It should be noted that the TCBDATA register is only an access register to other TCB registers. The width of the TCBDATA register is dependent on the specific TCB register.

## **10.5 EJTAG TAP Registers**

The EJTAG TAP Module has one Instruction register and a number of data registers, all accessible through the TAP:

### 10.5.1 Instruction Register

The Instruction register is accessed when the TAP receives an Instruction register scan protocol. During an Instruction register scan operation the TAP controller selects the output of the Instruction register to drive the *TDO* pin. The shift register consists of a series of bits arranged to form a single scan path between *TDI* and *TDO*. During an Instruction register scan operations, the TAP controls the register to capture status information and shift data from *TDI* to *TDO*. Both the capture and shift operations occur on the rising edge of *TCK*. However, the data shifted out from the *TDO* occurs on the falling edge of *TCK*. In the Test-Logic-Reset and *Capture-IR* state, the instruction shift register is set to  $00001_2$ , as for the IDCODE instruction. This forces the device into the functional mode and selects the Device ID register. The Instruction register is 5 bits wide. The instruction shifted in takes effect for the following data register scan operation. A list of the implemented instructions are listed in Table 10.28.

## 10.5.2 Data Registers Overview

The EJTAG uses several data registers, which are arranged in parallel from the primary *TDI* input to the primary *TDO* output. The Instruction register supplies the address that allows one of the data registers to be accessed during a data register scan operation. During a data register scan operation, the addressed scan register receives TAP control signals to capture the register and shift data from *TDI* to *TDO*. During a data register scan operation, the TAP selects the output of the data register to drive the *TDO* pin. The register is updated in the *Update-DR* state with respect to the write bits.

This description applies in general to the following data registers:

- Bypass Register
- Device Identification Register
- Implementation Register
- EJTAG Control Register (ECR)
- Processor Access Address Register
- Processor Access Data Register
- FastData Register

#### 10.5.2.1 Bypass Register

The *Bypass* register consists of a single scan register bit. When selected, the Bypass register provides a single bit scan path between *TDI* and *TDO*. The Bypass register allows abbreviating the scan path through devices that are not involved in the test. The Bypass register is selected when the Instruction register is loaded with a pattern of all ones to satisfy the IEEE 1149.1 Bypass instruction requirement.

#### 10.5.2.2 Device Identification (ID) Register

The *Device Identification* register is defined by IEEE 1149.1, to identify the device's manufacturer, part number, revision, and other device-specific information. Table 10.29 shows the bit assignments defined for the read-only Device Identification Register, and inputs to the core determine the value of these bits. These bits can be scanned out of the *ID* register after being selected. The register is selected when the Instruction register is loaded with the IDCODE instruction.

# 31 28 27 12 11 1 0 Version PartNumber ManufID R

#### Figure 10.25 Device Identification Register Format

| Field      | S      |                                                                                                                                                                      | Read/ |                     |
|------------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|---------------------|
| Name       | Bit(s) | Description                                                                                                                                                          | Write | Reset State         |
| Version    | 31:28  | <b>Version</b> (4 bits)<br>This field identifies the version number of the processor derivative.                                                                     | R     | EJ_Version[3:0]     |
| PartNumber | 27:12  | <b>Part Number</b> (16 bits)<br>This field identifies the part number of the processor<br>derivative.                                                                | R     | EJ_PartNumber[15:0] |
| ManufID    | 11:1   | Manufacturer Identity (11 bits)<br>Accordingly to IEEE 1149.1-1990, the manufacturer<br>identity code shall be a compressed form of the<br>JEDEC Publications 106-A. | R     | EJ_ManufID[10:0]    |
| R          | 0      | Reserved                                                                                                                                                             | R     | 1                   |

Table 10.29 Device Identification Register

#### 10.5.2.3 Implementation Register

This 32-bit read-only register is used to identify the features of the EJTAG implementation. Some of the reset values are set by inputs to the core. The register is selected when the Instruction register is loaded with the IMPCODE instruction.

|         |    |          | Figu         | re 10.26 | 5 I | mplementa | tion Reg | gist | er Form | at |          |   |
|---------|----|----------|--------------|----------|-----|-----------|----------|------|---------|----|----------|---|
| 31 2    | 9  | 28 25    | 24           | 23 2     | 1   | 20 17     | 16       | 15   | 14      | 13 |          | 0 |
| EJTAGve | er | Reserved | DINT-<br>sup | ASIDsiz  | e   | Reserved  | MIPS16   | 0    | NoDMA   |    | Reserved |   |

### **Table 10.30 Implementation Register Descriptions**

| Field    | S      |                                  |                                                           |        | Read/Wr |             |
|----------|--------|----------------------------------|-----------------------------------------------------------|--------|---------|-------------|
| Name     | Bit(s) |                                  | Description                                               |        | ite     | Reset State |
| EJTAGver | 31:29  | EJTAG Version.<br>2: Version 2.6 |                                                           |        | R       | 5           |
| Reserved | 28:25  | Reserved                         |                                                           |        | R       | 0           |
| DINTsup  | 24     |                                  | f the <i>DINT</i> signal from the probe is supp           | orted: | R       | EJ_DINTsup  |
|          |        | Encoding                         | Meaning                                                   |        |         |             |
|          |        | 0                                | <i>DINT</i> signal from the probe is not supported        |        |         |             |
|          |        | 1                                | Probe can use <i>DINT</i> signal to make debug interrupt. |        |         |             |

| Field    | ls     |                     |                           | Read/Wr |                            |  |
|----------|--------|---------------------|---------------------------|---------|----------------------------|--|
| Name     | Bit(s) |                     | Description               | ite     | Reset State                |  |
| ASIDsize | 23:21  | Size of ASID field  | in implementation:        | R       | TLB-based MMU              |  |
|          |        | Encoding            | Meaning                   |         | cores - 2;<br>FM-based MMU |  |
|          |        | 0                   | No ASID in implementation |         | cores - 0                  |  |
|          |        | 1                   | 6-bit ASID                | _       |                            |  |
|          |        | 2                   | 8-bit ASID                | _       |                            |  |
|          |        | 3                   | Reserved                  |         |                            |  |
| Reserved | 20:17  | Reserved            |                           | R       | 0                          |  |
| MIPS16   | 16     | Indicates whether I | MIPS16 is implemented:    | R       | 0                          |  |
|          |        | Encoding            | Meaning                   |         |                            |  |
|          |        | 0                   | No MIPS16 support         |         |                            |  |
|          |        | 1                   | MIPS16 implemented        |         |                            |  |
| Reserved | 15     | Reserved            |                           | R       | 0                          |  |
| NoDMA    | 14     | No EJTAG DMA S      | Support                   | R       | 1                          |  |
| Reserved | 13:0   | Reserved            |                           | R       | 0                          |  |

#### **Table 10.30 Implementation Register Descriptions**

#### 10.5.2.4 EJTAG Control Register

This 32-bit register controls the various operations of the TAP modules. This register is selected by shifting in the CONTROL instruction. Bits in the *EJTAG Control* register can be set/cleared by shifting in data; status is read by shifting out the contents of this register. This *EJTAG Control* register can only be accessed by the TAP interface.

The *EJTAG Control* register is not updated in the *Update-DR* state unless the Reset occurred (Rocc) bit 31, is either 0 or written to 0. This is in order to ensure prober handling of processor accesses.

The value used for reset indicated in the table below takes effect on both hard and soft CPU resets, but not on TAP controller resets by e.g. *TRST\_N. TCK* clock is not required when the hard or soft CPU reset occurs, but the bits are still updated to the reset value when the *TCK* applies. The first 5 *TCK* clocks after hard or soft CPU resets may result in reset of the bits, due to synchronization between clock domains.

| 0.4  | ~~~~~ | ~ ~ |      |      |        |      |       |     |       | •      |               |     | 10            |     |    | 0 0 |
|------|-------|-----|------|------|--------|------|-------|-----|-------|--------|---------------|-----|---------------|-----|----|-----|
|      |       |     |      |      |        |      |       |     |       |        | 14            |     |               |     |    |     |
| Rocc | Psz   | Res | Doze | Halt | PerRst | PRnW | PrAcc | Res | PrRst | ProbEn | Prob-<br>Trap | Res | Ejtag-<br>Brk | Res | DM | Rs  |

Figure 10.27 EJTAG Control Register Format

| Fields<br>Name Bit(s) |        |                                                                                           |                                                                |                                                                                                                                                                                   | Read/       |           |
|-----------------------|--------|-------------------------------------------------------------------------------------------|----------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------|-----------|
| Name                  | Bit(s) |                                                                                           | Des                                                            | Write                                                                                                                                                                             | Reset State |           |
| Rocc                  | 31     | Reset Occurre<br>The bit indica                                                           |                                                                | reset has occurred:                                                                                                                                                               | R/W         | 1         |
|                       |        | Encoding                                                                                  | 9                                                              | Meaning                                                                                                                                                                           |             |           |
|                       |        | 0                                                                                         | No reset o cleared.                                            | ccurred since bit last                                                                                                                                                            |             |           |
|                       |        | 1                                                                                         | Reset occu                                                     | urred since bit last cleared.                                                                                                                                                     |             |           |
|                       |        | applied.<br>This bit must<br>that the incide<br>The <i>EJTAG</i> (<br><i>Update-DR</i> st | be cleared b<br>ent was detec<br>Control regis<br>ate unless R | e 1 value as long as reset is<br>y the probe, to acknowledge<br>cted.<br>ster is not updated in the<br><i>OCC</i> is 0, or written to 0. This is<br>handling of processor access. |             |           |
| Psz[1:0]              | 30:29  | address bits o                                                                            | e used in con<br>f the Addres<br>r access trans                | bination with the lower two<br>s register to determine the size<br>saction. The bits are only valid                                                                               | R           | Undefined |
|                       |        | PAA[1:0]                                                                                  | Psz[1:0]                                                       |                                                                                                                                                                                   |             |           |
|                       |        | 00                                                                                        | 00                                                             | Byte (LE, byte 0; BE, byte 3)                                                                                                                                                     |             |           |
|                       |        | 01                                                                                        | 00                                                             | Byte (LE, byte 1; BE, byte 2)                                                                                                                                                     |             |           |
|                       |        | 10                                                                                        | 00                                                             | Byte (LE, byte 2; BE, byte 1)                                                                                                                                                     |             |           |
|                       |        | 11                                                                                        | 00                                                             | Byte (LE, byte 3; BE, byte 0)                                                                                                                                                     |             |           |
|                       |        | 00                                                                                        | 01                                                             | Halfword (LE, bytes 1:0;<br>BE, bytes 3:2)                                                                                                                                        |             |           |
|                       |        | 10                                                                                        | 01                                                             | Halfword (LE, bytes 3:2;<br>BE, bytes 1:0)                                                                                                                                        |             |           |
|                       |        | 00                                                                                        | 10                                                             | Word (LE, BE; bytes 3, 2, 1, 0)                                                                                                                                                   |             |           |
|                       |        | 00                                                                                        | 11                                                             | Triple (LE, bytes 2, 1, 0; BE,<br>bytes 3, 2,1)                                                                                                                                   |             |           |
|                       |        | 01                                                                                        | 11                                                             | Triple (LE, bytes 3, 2, 1; BE, bytes 2, 1, 0)                                                                                                                                     |             |           |
|                       |        | All o                                                                                     | others                                                         | Reserved                                                                                                                                                                          |             |           |
|                       |        | the byte numb                                                                             | ber in a 32-b<br>bits 23:16                                    | E=big endian, the byte# refers to<br>it register, where byte 3 = bits<br>; byte 1 = bits 15:8; byte 0=bits<br>endianess.                                                          |             |           |
| Res                   | 28:23  | Reserved                                                                                  |                                                                |                                                                                                                                                                                   | R           | 0         |

Table 10.31 EJTAG Control Register Descriptions

| Fields |                                                                                                                                                                             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | Read/ |             |
|--------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|-------------|
| Name   | Bit(s)                                                                                                                                                                      | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Write | Reset State |
| Doze   | 22                                                                                                                                                                          | Doze state<br>The Doze bit indicates any kind of low-power mode. The<br>value is sampled in the Capture-DR state of the TAP con-<br>troller:                                                                                                                                                                                                                                                                                                                                                                                               | R     | 0           |
|        |                                                                                                                                                                             | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |       |             |
|        |                                                                                                                                                                             | 0 CPU not in low-power mode.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |       |             |
|        |                                                                                                                                                                             | 1 CPU is in low-power mode.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |       |             |
|        |                                                                                                                                                                             | Doze includes the Reduced Power (RP) and WAIT power-reduction modes.                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |       |             |
| Halt   | Halt 21 Halt state<br>The Halt bit indicates if the internal system bus clor<br>running or stopped. The value is sampled in the Cap<br>ture-DR state of the TAP controller: |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | R     | 0           |
|        |                                                                                                                                                                             | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |       |             |
|        |                                                                                                                                                                             | 0 Internal system clock is running                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |       |             |
|        |                                                                                                                                                                             | 1 Internal system clock is stopped                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |       |             |
| PerRst | 20                                                                                                                                                                          | Peripheral Reset<br>When the bit is set to 1, it is only guaranteed that the<br>peripheral reset has occurred in the system when the read<br>value of this bit is also 1. This is to ensure that the setting<br>from the <i>TCK</i> clock domain gets effect in the CPU clock<br>domain, and in peripherals.<br>When the bit is written to 0, then the bit must also be read<br>as 0 before it is guaranteed that the indication is cleared in<br>the CPU clock domain also.<br>This bit controls the <i>EJ_PerRst</i> signal on the core. | R/W   | 0           |
| PRnW   | 19                                                                                                                                                                          | Processor Access Read and Write<br>This bit indicates if the pending processor access is for a<br>read or write transaction, and the bit is only valid while<br><i>PrAcc</i> is set.                                                                                                                                                                                                                                                                                                                                                       | R     | Undefined   |
|        |                                                                                                                                                                             | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |       |             |
|        |                                                                                                                                                                             | 0 Read transaction                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |       |             |
|        |                                                                                                                                                                             | 1 Write transaction                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |       |             |
|        |                                                                                                                                                                             |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |       |             |

## Table 10.31 EJTAG Control Register Descriptions (Continued)

| Fie   | Fields |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | Read/ |             |
|-------|--------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|-------------|
| Name  | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | Write | Reset State |
| PrAcc | 18     | Description         Processor Access (PA)         Read value of this bit indicates if a Processor Access (PA)         to the EJTAG memory is pending:         Encoding Meaning         0       No pending processor access         1       Pending processor access         1       Pending processor access         The probe's software must clear this bit to 0 to indicate the end of the PA. Write of 1 is ignored.         A pending Processor Access is cleared when <i>Rocc</i> is set, but another PA may occur just after the reset if a debug exception occurs.         Finishing a Processor Access is not accepted while the <i>Rocc</i> bit is set. This is to avoid that a Processor Access occurring after the reset is finished due to indication of a Processor Access that occurred before the reset.         The FASTDATA access can clear this bit. |       | 0           |
| Res   | 17     | Reserved                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | R     | 0           |
| PrRst | 16     | Processor Reset (implementation-dependent behavior)<br>When the bit is set to 1, then it is only guaranteed that this<br>setting has taken effect in the system when the read value<br>of this bit is also 1. This is to ensure that the setting from<br>the <i>TCK</i> clock domain gets effect in the CPU clock<br>domain, and in peripherals.<br>When the bit is written to 0, then the bit must also be read<br>as 0 before it is guaranteed that the indication is cleared in<br>the CPU clock domain also.<br>This bit controls the <i>EJ_PrRst</i> signal. If the signal is<br>used in the system, then it must be ensured that both the<br>processor and all devices required for a reset are properly<br>reset. Otherwise the system may fail or hang. The bit<br>resets itself, since the <i>EJTAG Control</i> register is reset by<br>hard or soft reset.     | R/W   | 0           |

 Table 10.31 EJTAG Control Register Descriptions (Continued)

| Fields |        |                                                                                                                              | Read/ |                             |
|--------|--------|------------------------------------------------------------------------------------------------------------------------------|-------|-----------------------------|
| Name   | Bit(s) | Description                                                                                                                  | Write | Reset State                 |
| ProbEn | 15     | Probe EnableThis bit indicates to the CPU if the EJTAG memory is<br>handled by the probe so processor accesses are answered: | R/W   | 0 or 1<br>from<br>EJTAGBOOT |

## Table 10.31 EJTAG Control Register Descriptions (Continued)

| Fields   |        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | Read/ |                             |
|----------|--------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|-----------------------------|
| Name     | Bit(s) | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | Write | Reset State                 |
| ProbTrap | 14     | Probe Trap<br>This bit controls the location of the debug exception vec-<br>tor:                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | R/W   | 0 or 1<br>from<br>EJTAGBOOT |
|          |        | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |       |                             |
|          |        | 0 In normal memory 0xBFC0.0480                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |       |                             |
|          |        | 1 In EJTAG memory at 0xFF20.0200 in dmseg                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |       |                             |
|          |        | Valid setting of the <i>ProbTrap</i> bit depends on the setting<br>of the <i>ProbEn</i> bit, as described for the <i>ProbEn</i> bit.<br>The <i>ProbTrap</i> should not be set to 1, for debug exception<br>vector in EJTAG memory, unless the <i>ProbEn</i> bit is also<br>set to 1 to indicate that the EJTAG memory may be<br>accessed.<br>The read value indicates the effective value to the CPU,<br>due to synchronization issues between <i>TCK</i> and CPU<br>clock domains; however, it is ensured that change of the<br><i>ProbTrap</i> bit prior to setting the <i>EjtagBrk</i> bit will have<br>effect for the <i>EjtagBrk</i> .<br>The reset value of the bit depends on whether the EJTAG-<br>BOOT indication is given or not: |       |                             |
|          |        | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |       |                             |
|          |        | 0         Processor is in non-debug mode (No EJTAGBOOT indication given)           1         Processor is in debug mode (EJTAG-BOOT indication given)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |       |                             |
|          |        |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | _     |                             |
| Res      | 13     | Reserved                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | R     | 0                           |
| EjtagBrk | 12     | EJTAG Break<br>Setting this bit to 1 causes a debug exception to the pro-<br>cessor, unless the CPU was in debug mode or another<br>debug exception occurred.<br>When the debug exception occurs, the processor core<br>clock is restarted if the CPU was in low-power mode. This<br>bit is cleared by hardware when the debug exception is<br>taken.<br>The reset value of the bit depends on whether the EJTAG-<br>BOOT indication is given or not:                                                                                                                                                                                                                                                                                       | R/W1  | 0 or 1<br>from<br>EJTAGBOOT |
|          |        | Encoding Meaning                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |       |                             |
|          |        | 0 Processor is in non-debug mode (No<br>EJTAGBOOT indication given)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |       |                             |
|          |        | 1 Processor is in debug mode (EJTAG-<br>BOOT indication given)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |       |                             |
| Res      | 11:4   | Reserved                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | R     | 0                           |

 Table 10.31 EJTAG Control Register Descriptions (Continued)

| Fields |        | S                             |                                         | Read/ |             |
|--------|--------|-------------------------------|-----------------------------------------|-------|-------------|
| Name   | Bit(s) | -                             | Description                             | Write | Reset State |
| DM     | 3      | Debug Mode<br>This bit indica | tes the debug or non-debug mode:        | R     | 0           |
|        |        | Encoding                      | Meaning                                 |       |             |
|        |        | 0                             | Processor is in non-debug mode          |       |             |
|        |        | 1                             | Processor is in debug mode              |       |             |
|        |        | The bit is samp controller.   | oled in the Capture-DR state of the TAP |       |             |
| Res    | 2:0    | Reserved                      |                                         | R     | 0           |

### Table 10.31 EJTAG Control Register Descriptions (Continued)

## 10.5.3 Processor Access Address Register

The Processor Access Address (*PAA*) register is used to provide the address of the processor access in the dmseg, and the register is only valid when a processor access is pending. The length of the Address register is 32 bits, and this register is selected by shifting in the ADDRESS instruction.

#### 10.5.3.1 Processor Access Data Register

The *Processor Access Data (PAD)* register is used to provide data value to and from a processor access. The length of the Data register is 32 bits, and this register is selected by shifting in the DATA instruction.

The register has the written value for a processor access write due to a CPU store to the dmseg, and the output from this register is only valid when a processor access write is pending. The register is used to provide the data value for processor access read due to a CPU load or fetch from the dmseg, and the register should only be updated with a new value when a processor access write is pending.

The *PAD* register is 32 bits wide. Data alignment is not used for this register, so the value in the *PAD* register matches data on the internal bus. The undefined bytes for a PA write are undefined, and for a *PAD* read then 0 (zero) must be shifted in for the unused bytes.

The organization of bytes in the *PAD* register depends on the endianess of the core, as shown in Figure 10.28. The endian mode for debug/kernel mode is determined by the state of the *SI\_Endian* input at power-up.



#### Figure 10.28 Endian Formats for the PAD Register

Least significant byte is at lowest address. Word is addressed by byte address of least significant byte.

The size of the transaction and thus the number of bytes available/required for the *PAD* register is determined by the Psz field in the *ECR*.

## 10.5.4 Fastdata Register (TAP Instruction FASTDATA)

The width of the *Fastdata* Register is 1 bit. During a Fastdata access, the *Fastdata* register is written and read, i.e., a bit is shifted in and a bit is shifted out. During a Fastdata access, the *Fastdata* register value shifted in specifies whether the Fastdata access should be completed or not. The value shifted out is a flag that indicates whether the Fastdata access was successful or not (if completion was requested).

#### Figure 10.29 Fastdata Register Format



| Fields |      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             | Read/ | Power-up  |
|--------|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|-----------|
| Name   | Bits | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | Write | State     |
| SPrAce | 0    | Shifting in a zero value requests completion of the Fast-<br>data access. The <i>PrAcc</i> bit in the <i>EJTAG Control</i> regis-<br>ter is overwritten with zero when the access succeeds.<br>(The access succeeds if <i>PrAcc</i> is one and the operation<br>address is in the legal dmseg Fastdata area.) When suc-<br>cessful, a one is shifted out. Shifting out a zero indicates a<br>Fastdata access failure.<br>Shifting in a one does not complete the Fastdata access<br>and the <i>PrAcc</i> bit is unchanged. Shifting out a one indi-<br>cates that the access would have been successful if<br>allowed to complete and a zero indicates the access would<br>not have successfully completed. | R/W   | Undefined |

#### Table 10.32 Fastdata Register Field Description

The FASTDATA access is used for efficient block transfers between dmseg (on the probe) and target memory (on the processor). An "upload" is defined as a sequence of processor loads from target memory and stores to dmseg. A "download" is a sequence of processor loads from dmseg and stores to target memory. The "Fastdata area" specifies the legal range of dmseg addresses (0xFF20.0000 - 0xFF20.000F) that can be used for uploads and downloads. The Data + Fastdata registers (selected with the FASTDATA instruction) allow efficient completion of pending Fastdata area accesses.

During Fastdata uploads and downloads, the processor will stall on accesses to the Fastdata area. The *PrAcc* (processor access pending bit) will be 1 indicating the probe is required to complete the access. Both upload and download accesses are attempted by shifting in a zero *SPrAcc* value (to request access completion) and shifting out *SPrAcc* to see if the attempt will be successful (i.e., there was an access pending and a legal Fastdata area address was used). Downloads will also shift in the data to be used to satisfy the load from dmseg's Fastdata area, while uploads will shift out the data being stored to dmseg's Fastdata area.

As noted above, two conditions must be true for the Fastdata access to succeed. These are:

- *PrAcc* must be 1, i.e., there must be a pending processor access.
- The Fastdata operation must use a valid Fastdata area address in dmseg (0xFF20.0000 to 0xFF20.000F).

Table 10.33 shows the values of the PrAcc and SPrAcc bits and the results of a Fastdata access.

| Probe<br>Operation | Address<br>Match<br>Check | PrAcc in<br>the<br>Control<br>Register | LSB<br>(SPrAcc)<br>Shifted In | Action in the<br>Data Register | PrAcc<br>Changes<br>To | LSB<br>Shifted<br>Out | Data Shifted<br>Out        |
|--------------------|---------------------------|----------------------------------------|-------------------------------|--------------------------------|------------------------|-----------------------|----------------------------|
| Download using     | Fails                     | х                                      | х                             | none                           | unchanged              | 0                     | invalid                    |
| FASTDATA           | Passes                    | 1                                      | 1                             | none                           | unchanged              | 1                     | invalid                    |
|                    |                           | 1                                      | 0                             | write data                     | 0 (SPrAcc)             | 1                     | valid (previ-<br>ous) data |
|                    |                           | 0                                      | х                             | none                           | unchanged              | 0                     | invalid                    |
| Upload using       | Fails                     | х                                      | х                             | none                           | unchanged              | 0                     | invalid                    |
| FASTDATA           | Passes                    | 1                                      | 1                             | none                           | unchanged              | 1                     | invalid                    |
|                    |                           | 1                                      | 0                             | read data                      | 0 (SPrAcc)             | 1                     | valid data                 |
|                    |                           | 0                                      | Х                             | none                           | unchanged              | 0                     | invalid                    |

 Table 10.33 Operation of the FASTDATA access

There is no restriction on the contents of the Data register. It is expected that the transfer size is negotiated between the download/upload transfer code and the probe software. Note that the most efficient transfer size is a 32-bit word.

The Rocc bit of the Control register is not used for the FASTDATA operation.

## **10.6 TAP Processor Accesses**

The TAP modules support handling of fetches, loads and stores from the CPU through the dmseg segment, whereby the TAP module can operate like a *slave unit* connected to the on-chip bus. The core can then execute code taken from the EJTAG Probe and it can access data (via a load or store) which is located on the EJTAG Probe. This occurs

in a serial way through the EJTAG interface: the core can thus execute instructions e.g. debug monitor code, without occupying the memory.

Accessing the dmseg segment (EJTAG memory) can only occur when the processor accesses an address in the range from 0xFF20.0000 to 0xFF2F.FFFF, the *ProbEn* bit is set, and the processor is in debug mode (DM=1). In addition the LSNM bit in the CP0 Debug register controls transactions to/from the dmseg.

When a debug exception is taken, while the *ProbTrap* bit is set, the processor will start fetching instructions from address 0xFF20.0200.

A pending processor access can only finish if the probe writes 0 to *PrAcc* or by a soft or hard reset.

## 10.6.1 Fetch/Load and Store from/to EJTAG Probe Through dmseg

- 1. The internal hardware latches the requested address into the PA Address register (in case of the Debug exception: 0xFF20.0200).
- The internal hardware sets the following bits in the *EJTAG Control* register: *PrAcc* = 1 (selects Processor Access operation) *PRnW* = 0 (selects processor read operation) *Psz[1:0]* = value depending on the transfer size
- 3. The EJTAG Probe selects the *EJTAG Control* register, shifts out this control register's data and tests the *PrAcc* status bit (Processor Access): when the *PrAcc* bit is found 1, it means that the requested address is available and can be shifted out.
- 4. The EJTAG Probe checks the *PRnW* bit to determine the required access.
- 5. The EJTAG Probe selects the PA Address register and shifts out the requested address.
- 6. The EJTAG Probe selects the PA Data register and shifts in the instruction corresponding to this address.
- 7. The EJTAG Probe selects the *EJTAG Control* register and shifts a PrAcc = 0 bit into this register to indicate to the processor that the instruction is available.
- 8. The instruction becomes available in the instruction register and the processor starts executing.
- 9. The processor increments the program counter and outputs an instruction read request for the next instruction. This starts the whole sequence again.

Using the same protocol, the processor can also execute a load instruction to access the EJTAG Probe's memory. For this to happen, the processor must execute a load instruction (e.g. a LW, LH, LB) with the target address in the appropriate range.

Almost the same protocol is used to execute a store instruction to the EJTAG Probe's memory through dmseg. The store address must be in the range: 0xFF20.0000 to 0xFF2F.FFFF, the *ProbEn* bit must be set and the processor has to be in debug mode (*DM*=1). The sequence of actions is found below:

- 1. The internal hardware latches the requested address into the PA Address register
- 2. The internal hardware latches the data to be written into the PA Data register.

- The internal hardware sets the following bits in the *EJTAG Control* register: *PrAcc* = 1 (selects Processor Access operation) *PRnW* = 1 (selects processor write operation) *Psz*[1:0] = value depending on the transfer size
- 4. The EJTAG Probe selects the *EJTAG Control* register, shifts out this control register's data and tests the *PrAcc* status bit (Processor Access): when the *PrAcc* bit is found 1, it means that the requested address is available and can be shifted out.
- 5. The EJTAG Probe checks the *PRnW* bit to determine the required access.
- 6. The EJTAG Probe selects the PA Address register and shifts out the requested address.
- 7. The EJTAG Probe selects the PA Data register and shifts out the data to be written.
- 8. The EJTAG Probe selects the *EJTAG Control* register and shifts a *PrAcc* = 0 bit into this register to indicate to the processor that the write access is finished.
- 9. The EJTAG Probe writes the data to the requested address in its memory.
- 10. The processor detects that PrAcc bit = 0, which means that it is ready to handle a new access.

The above examples imply that no reset occurs during the operations, and that *Rocc* is cleared.

Note: Probe accesses and external bus accesses are serialized by the core. A probe access will not begin until all external bus requests have completed. Similarly, a new probe or external bus access will not begin until a pending probe access has completed.

## 10.7 SecureDebug

For security reasons, users can optionally disable certain EJTAG capabilities via the SecureDebug feature in order to prevent untrusted access to the core through debug mode.

### 10.7.1 Disabling EJTAG Debugging

#### 10.7.1.1 EJ\_DisableProbeDebug Signal

An input signal to the core is defined, *EJ\_DisableProbeDebug*, which when asserted, forces *ProbEn=0* and *ProbTrap=0*. *EJ\_DisableProbeDebug* overrides any other *ProbEn* or *ProbTrap* settings.

Suggested implementation of the *EJ\_DisableProbeDebug* signal is for a microcontroller to provide a bit within non-volatile memory (outside the core) that is pre-programmed to set or clear this control signal.

| Signal               | Description                                                                                                                                                                                                                            | Direction | Compliance                  |
|----------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------|-----------------------------|
| EJ_DisableProbeDebug | <ul> <li>When asserted:</li> <li>ProbEn = 0</li> <li>ProbTrap = 0</li> <li>EjtagBrk is disabled<sup>1</sup>.</li> <li>EJTAGBOOT is disabled.</li> <li>PC Sampling is disabled.</li> <li>DINT signal is ignored<sup>1</sup>.</li> </ul> | Input     | Required for<br>SecureDebug |

Table 10.34 EJ\_DisableProbeDebug Signal Overview

1. An override is provided.

#### 10.7.1.2 Override for EjtagBrk and DINT Disable

An override for the *EjtagBrk* and *DINT* disable caused by the *EJ\_DisableProbeDebug* signal is provided by the Memory Protection Unit (MPU) *Config* register field *EjtagBrk\_Override*. This override feature is only available if the Memory Protection Unit is implemented.

The override can be asserted by the CPU during the trusted boot process. Its purpose is to allow a probe to assert *EjtagBrk* or the assertion of the *DINT* signal, which requests a Debug Interrupt exception, thus providing a means of recovering the CPU from a crash or hang. This feature allows a Debug Executive, if one is provided in target firmware, to communicate with the probe over the Fast Debug Channel (FDC) in order to get the attention of the target by causing a debug exception. It also allows a host-based debugger to query the target via Debug Executive commands to determine the cause of the hang.

## 10.7.2 EJTAG Features Unmodified by SecureDebug

SecureDebug will not modify the following EJTAG features:

- FDC (Fast Debug Channel) over EJTAG. This is required to provide a path for an EJTAG probe to send and receive messages via the Debug Executive when one is included in the target code. The physical EJTAG serial connection, pins, and protocol must function correctly as well as a cJTAG (2-wire) connection for FDC.
- *RST*\* signal. This is the hardware signal on the EJTAG connector that connects to the target system reset circuit. It can be asserted by an EJTAG probe.

## **10.8 iFlowtrace™ Mechanism**

There is only one optional trace mechanism that is available to extract additional information about program execution. iFlowtrace is a light-weight instruction-only tracing scheme that is sufficient to reconstruct the execution flow in the core and it can only be controlled by debug software. This tracing scheme has been kept very simple to minimize the impact on die size.

The iFlowtrace tracing scheme is not a strict subset of the PDtrace tracing methodology, and its trace format outputs differ from those of PDtrace. Trace formats, using simplified instruction state descriptors, were designed for the iFlowtrace trace to simplify the trace mechanism and to obtain better compression.

Tracing is disabled if the processor enters Debug Mode (refer to the EJTAG specification for description of Debug Mode). This is true for both Normal Trace Mode as well as Special Trace Mode.

The presence of the iFlowtrace mechanism is indicated by the CPO Config3<sub>1TL</sub> register bit.

## 10.8.1 A Simple Instruction-Only Tracing Scheme

A trace methodology can often be mostly defined by its inputs and outputs. Hence this basic scheme is described by the inputs to the core tracing logic and by the trace output format from the core. We assume here that the execution flow of the program is traced at the end of the execution path in the core similar to PDtrace.

#### 10.8.1.1 Trace Inputs

- 1. *In\_TraceOn*: when on, legal trace words are coming from the core and at the point when it is turned on, that is for the first traced instruction, a full PC value is output. When off, it cannot be assumed that legal trace words are available at the core interface.
- 2. *In\_Stall*: This says, stall the processor to avoid buffer overflow that can lose trace information. When off, a buffer overflow will simply throw away trace data and start over again. When on, the processor is signalled from the tracing logic to stall until the buffer is sufficiently drained and then the pipeline is restarted.

#### 10.8.1.2 Normal Trace Mode Outputs

- 1. Stall cycles in the pipe are ignored by the tracing logic and are not traced. This is indicated by the signal *Out\_Valid* that is turned off when no valid instruction is being traced. When *Out\_Valid* is asserted, instructions are traced out as described in the rest of this section. The traced instruction PC is a virtual address.
- 2. In the output format, every sequentially executed instruction is traced as 1'b0.
- 3. Every instruction that is not sequential to the previous one is traced as either a 10 or an 11 (read this as a serial bitstream from left to right). This implies that the target instruction of a branch or jump is traced this way, not the actual branch or jump instruction (this is similar to PDtrace):
- 4. A 10 instruction implies a taken branch for a conditional branch instruction whose condition is unpredictable statically, but whose branch target can be computed statically and hence the new PC does not need to be traced out. Note that if this branch was not taken, it would have been indicated by a 0 bit, that is sequential flow.
- 5. A 11 instruction implies a taken branch for an indirect jump-like instruction whose branch target could not be computed statically and hence the taken branch address is now given in the trace. This includes, for example, instructions like jr, jalr, and interrupts:
  - 11 00 followed by 8 bits of 1-bit shifted offset from the last PC. The bit assignments of this format on the bus between the core tracing logic and the ITCB is:

[3:0] = 4'b0011 [11:4] = PCdelta[8:1]

• 11 01 - followed by 16 bits of 1-bit shifted offset from the last PC. The bit assignments of this format on the bus between the core tracing logic and the ITCB is:

[3:0] = 4'b1011 [19:4] = PCdelta[16:1]

٠

11 10 - followed by 31 of the most significant bits of the PC value, followed by a bit (NCC) that indicates no code compression. Note that for a MIPS32 or MIPS64 instruction, NCC=1, and for microMIPS instruction NCC=0. This trace record will appear at all transition points between MIPS32/MIPS64 and microMIPS instruction execution.

This form is also a special case of the 11 format and it is used when the instruction is not a branch or jump, but nevertheless the full PC value needs to be reconstructed. This is used for synchronization purposes, similar to the Sync in PDtrace. In iFlowtrace rev 2.0 onwards, the sync period is user-defined, and is counted down and when an internal counter runs through all the values, this format is used. The bit assignments of this format on the bus between the core tracing logic and the ITCB is:

[3:0] = 4'b0111 [34:4] = PC[31:1] [35] = NCC

11 11 - Used to indicate trace resumption after a discontinuity occurred. The next format is a 1110 that sends a full PC value. A discontinuity might happen due to various reasons, for example, an internal buffer overflow, and at trace-on/trace-off trigger action.

## 10.8.2 Special Trace Modes

iFlowtrace 2.0 adds special trace modes which can only be active when the normal tracing mode is disabled. Software can determine which modes are supported by attempting to write the enable bits in the *IFCTL* register. Software can check the Illegal bit in the *IFCTL* register—if an unsupported combination of modes is requested, the bit will be set and the trace contents will be unpredictable. The special trace modes are described below.

#### 10.8.2.1 Mode Descriptions

#### Delta Cycle Mode

This mode is specified in combination with the other special trace modes. It is enabled via the CYC bit in the Control/Status Register. When delta cycle reporting is enabled, each trace message will include a 10b delta cycle value which reports the number of cycles that have elapsed since the last message was generated. A value of 0 indicates that the two messages were generated in the same cycle. A value of 1 indicates that they were generated in consecutive cycles. If 1023 cycles elapse without an event being traced, a counter rollover message is generated.

Note: If the processor clocks stop due to execution of the WAIT instruction, the delta cycle counter will also stop and will report 'active' cycles between events rather than 'total' cycles.

#### Breakpoint Match Mode

This modes uses EJTAG data and instruction breakpoint hardware to enable a trace of PC values. Instead of starting or stopping trace, a triggerpoint will cause a single breakpoint match trace record. This record indicates that there was a triggerpoint match, the breakpoint ID of the matching breakpoint, and the PC value of an instruction that matched the instruction of data breakpoint. This mode can only be used when normal tracing mode is turned off. This mode can not be used in conjunction with other special trace modes. This mode is enabled or disabled via the BM field in the Control/Status register (see Section 10.8.6 "ITCB Register Interface for Software Configurability").

The breakpoints used in this mode must have the TE bet set to enable the match condition.

Software should avoid setting up overlapping breakpoints. The behavior when multiple matches occur on the same instruction is to report a BreakpointID of 7.

#### Filtered Data Tracing Mode

This mode uses EJTAG data breakpoint hardware to enable a trace of data values. Rather than starting or stopping trace as in normal trace mode, a data triggerpoint will cause a filtered data trace record. This record indicates that there was a data triggerpoint match, the breakpoint ID of the matching breakpoint, whether it was a load or store, the size of the request, low order address bits, and the data value. This mode can only be used when normal tracing mode is turned off. This mode can not be used in conjunction with other special trace modes. This mode can be enabled or disabled via the FDT bit in the Control/Status register (see Section 10.8.6 "ITCB Register Interface for Software Configurability").

The corresponding data breakpoint must have the TE bit set to enable the match condition.

Software should avoid setting up overlapping data breakpoints. The behavior when multiple matches on one load or store are detected is to report a BreakpointID of 7.

#### Extended Filtered Data Tracing Mode

Extends Filtered Data Tracing Mode by adding the virtual address of the load/store instruction to the generated trace information. (see Section "Filtered Data Tracing Mode" above).

This behavior is enabled/disabled by the FDT\_CAUSE field in the *IFCTL* Control/Status register (see Section 10.8.6 "ITCB Register Interface for Software Configurability"). FDT\_CAUSE only has effect if the FDT field is also set.

The extended trace sequence is a FDT trace message followed by the Breakpoint Match (BM) trace message. If the  $IFCTL_{CYC}$  field is set, the FDT trace message will have a DeltaCycle Message value of '0' directly followed by the Breakpoint Match message. This message sequence (FDT, delta cycle of 0, and BM) indicates to the trace disassembler that Extended Filtered Data Tracing mode is enabled ( $IFCTL_{FDT}$  CAUSE=1).

#### Function Call/Return and Exception Tracing Mode

In this mode, the PC value of function calls and returns and/or exceptions and returns are traced out. This mode can only be used when normal tracing mode is turned off. This mode cannot be used in conjunction with other special trace modes. The function call/return and exception/return are independently enabled or disabled via the FCR and ER bits in the Control//Status register (see Section 10.8.6 "ITCB Register Interface for Software Configurability").

These events are reported for the following instructions:

- MIPS32 function calls: JAL, JALR, JALR.HB, JALX
- microMIPS function calls: JAL, JALR, JALR.HB, JALX, JALR16, JALRS16, JALRS, JALRS.HB, JALS
- MIPS32 function returns: JR, JR.HB
- microMIPS function returns: JR, JR.HB, JRC, JRADDIUSP, JR16
- Exceptions: Reported on the first instruction of the exception handler
- Exception returns: ERET

• MCU ASE Interrupt returns: IRET

#### Other Trace Messages

In any of the special trace modes, it is possible to embed messages into the trace stream directly from a program. This is done by writing to the *UserTraceData1* or *UseTraceData2* Cop0 registers. When *UserTraceData1* register is written, a trace message of type "User Triggered Message 1" (UTM1) is generated. When *UserTraceData2* register is written, a trace message of type "User Triggered Message 2" (UTM2) is generated. Please refer to 6.2.43 "User Trace Data1 Register (CP0 Register 23, Select 3)/User Trace Data2 Register (CP0 Register 24, Select 3)" on page 197.

Overflow messages can also be generated when tracing off-chip if the IO control bit is 0 and trace data is generated faster than it is consumed. No overflow will be generated when using on-chip trace.

#### 10.8.2.2 Special Trace Mode Outputs

The normal and special trace modes cannot be enabled at the same time because the trace message encoding is not unique between the two modes. The software reading the trace stream must be aware of which mode is selected to know how to interpret the bits in the trace stream. The message types for each type of special trace message are unique.

- 00 (as above, read a bitstream from left to right) Delta Cycle Rollover message. The output format is: [1:0] = 2'b00
- 010 User Trace Message. The format of this type of message is: [2:0] = 3'b010
   [34:3] = Data[31:0]
   [35] = UTM2/UTM1 (1=UTM2, 0=UTM1)
   [44:36] = DeltaCycle (if enabled)
- 011 Reserved
- 10 Breakpoint Match Message. The output format during this trace mode is:
  [1:0] = 2'b01
  [5:2] = BreakpointID
  [6] = Instruction Breakpoint
  [37:7] = MatchingPC[31:1]
  [38] = NCC
  [48:39] = DeltaCycle (if enabled)
  Note that for a MIPS32 or MIPS64 instruction, NCC=1, and for microMIPS instruction NCC=0.
- 110 Filtered Data Message. The output format during this trace mode is:
  [2:0] = 3'b011
  [6:3] = BreakpointID
  [7] = Load/Store (1=Load, 0=Store)
  [8] = FullWord (1=32b data, 0=<32b)</li>
  [14:5] = Addr[7:2]
  [46:15] = {32b data value} OR
  [46:15] = {BE[3:0], 4'b0, 24b data value} OR
  [46:15] = {BE[3:0], 12'b0, 16b data value} OR
  [46:15] = {BE[3:0], 20'b0, 8b data value}
  [56:47] = DeltaCycle (if enabled)

1110 - Function Call/Return/Exception Tracing. The output format during this trace mode is:
[3:0] = 4'b0111
[4] = FC
[5] = Ex
[6] = R
[37:8] = PC[31:1]
[38] = NCC
[48:39] = Delta Cycle (if enabled)
Note that for a MIPS32 or MIPS64 instruction, NCC=1, and for microMIPS instruction NCC=0. FC=1 implies a function call, Ex=1 implies the start of an exception handler, and R=1 implies a function or exception return.

1111- Overflow message. The format of this type of message is:
 [3:0] = 4'b1111

#### 10.8.3 ITCB Overview

٠

The iFlowtrace Control Block (ITCB) is responsible for accepting trace signals from the CPU core, formatting them, and storing them into an on-chip FIFO. The figure also shows the Probe Interface Block (PIB) which reads the FIFO and outputs the memory contents through a narrow off-chip trace port.



Figure 10.30 Trace Logic Overview

#### 10.8.4 ITCB iFlowtrace Interface

The iFlowtrace interface consists of 57 data signals plus a valid signal. The 57 data signals encode information about what the CPU is doing in each clock cycle. Valid indicates that the CPU is executing an instruction in this cycle and

therefore the 57 data signals carry valid execution information. The iFlowtrace data bus is encoded as shown in Table 10.35. Note that all the non-defined upper bits of the bus are zeroes.

| Valid | Data (LSBs)                                            | Description                                                                                                                                                                                                                                                  |
|-------|--------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0     | Х                                                      | No instructions executed in this cycle                                                                                                                                                                                                                       |
| 1     | 0                                                      | Normal Mode: Sequential instruction executed                                                                                                                                                                                                                 |
| 1     | 01                                                     | Normal Mode: Branch executed, destination predictable from code                                                                                                                                                                                              |
| 1     | <8>0011                                                | Normal Mode: Discontinuous instruction executed, PC offset is 8 bit signed offset                                                                                                                                                                            |
| 1     | <16>1011                                               | Normal Mode: Discontinuous instruction executed, PC offset is 16 bit signed off-<br>set                                                                                                                                                                      |
| 1     | <ncc>&lt;31&gt;0111</ncc>                              | Normal Mode: Discontinuous instruction or synchronization record, No Code<br>Compression (NCC) bit included as well as 31 MSBs of the PC value                                                                                                               |
| 1     | 00                                                     | Special Mode: Delta Cycle Rollover message                                                                                                                                                                                                                   |
| 1     | <10><32>010                                            | Special Mode: User add-in Trace Message. 32 bit user data as well as 10 bit delta cycle if enabled.                                                                                                                                                          |
| 1     | <10> <ncc>&lt;31&gt;&lt;1&gt;&lt;4&gt;01</ncc>         | Special Mode: Breakpoint Match Message. 4-bit breakpoint ID, 1 bit indicates breakpoint type, 31 MSBs of the PC value, NCC bit included as well as 10-bit delta cycle if enable.                                                                             |
| 1     | <10><32><6><1><1><4>011                                | Special Mode: Filtered Data Message. 4 bit breakpoint ID, 1 bit load or store indi-<br>cation, 1 bit full word indication, 6 bit of addr[7:2], 32 bit of the data information<br>included as well as 10 bit delta cycle if enabled.                          |
| 1     | <10> <ncc>&lt;31&gt;<r><ex><fc>011</fc></ex></r></ncc> | Special Mode: Function Call/Return/Exception Tracing. 1 bit function call indica-<br>tion, 1 bit exception indication, 1 bit function or exception return indication, 31<br>MSBs of the PC value, NCC bit included as well as 10 bit delta cycle if enabled. |
| 1     | 1111                                                   | Internal overflow                                                                                                                                                                                                                                            |

#### Table 10.35 Data Bus Encoding

## 10.8.5 TCB Storage Representation

Records from iFlowtrace are inserted into a memory stream exactly as they appear in the iFlowtrace data output. Records are concatenated into a continuous stream starting at the LSB. When a trace word is filled, it is written to memory along with some tag bits. Each record consists of a 64-bit word, which comprises 58 message bits and 6 tag bits or header bits that clarify information about the message in that word.

The ITCB includes a 58-bit shift register to accumulate trace messages. When 58 or more bits are accumulated, the 58 bits and 6 tag bits are sent to the memory write interface. Messages may span a trace word boundary; in this case, the 6 tag bits indicate the bit number of the first full trace message in the 58-bit data field.

The tag bits are slightly encoded so they can serve a secondary purpose of indicating to off-chip trace hardware when a valid trace word transmission begins. The encoding ensures that at least one of the 4 LSBs of the tag is always a 1 for a valid trace message. The tag values are shown in Table 10.36. The longest trace message is 57 bits (filtered data trace in special trace mode with delta cycle), so the starting position indicated by the tag bits is always between 0 and 56.

| Starting Bit of First Full | Encoding  |
|----------------------------|-----------|
| Trace Message              | (decimal) |
| 0                          | 58        |

| Table | 10.36 | Tag Bit | Encoding |
|-------|-------|---------|----------|
|-------|-------|---------|----------|

| Starting Bit of First Full<br>Trace Message | Encoding<br>(decimal) |
|---------------------------------------------|-----------------------|
| 16                                          | 59                    |
| 32                                          | 60                    |
| 48                                          | 61                    |
| Unused                                      | 0,16,32,48            |
| Reserved                                    | 62,63                 |
| Others                                      | StartingBit           |

## Table 10.36 Tag Bit Encoding

When trace stops (ON set to zero), any partially filled trace words are written to memory. Any unused space above the final message is filled with 1's. The decoder distinguishes 1111 patterns used for fill in this position from an 1111 overflow message by recognizing that it is the last trace word.

These trace formats are written to a trace memory that is either on-chip or off-chip. No particular size of SRAM is specified; the size is user selectable based on the application needs and area trade-offs. Each trace word can typically store about 20 to 30 instructions in normal trace mode, so a 1 KWord trace memory could store the history of 20K to 30K executed instructions.

The on-chip SRAM or trace memory is written continuously as a circular buffer. It is accessible via drseg address mapped registers. There are registers for the read pointer, write pointer, and trace word. The write pointer register includes a wrap bit that indicates that the pointer has wrapped since the last time the register was written. Before starting trace, the write pointer would typically be set to 0. To read the trace memory, the read pointer should be set to 0 if there has not been a wrap, or to the value of the write pointer if there has been. Reading the trace word register will read the entry pointed to by the read pointer and will automatically increment the read pointer. Software can continue reading until all valid entries have been read out.

## 10.8.6 ITCB Register Interface for Software Configurability

The ITCB includes a drseg memory interface to allow software to set up tracing and read the current status. If an on-chip trace buffer is also implemented, there are additional registers included for accessing it.

## 10.8.6.1 iFlowtrace Control/Status (IFCTL) Register (offset 0x3fc0)

The Control/Status register provides the mechanism for turning on the different trace modes. Figure 10.31 has the format of the register and Table 10.37 describes the register fields.

| 31      | 30 | 1 | 6 15      | 14  | 13  | 12 | 11 | 10  | 9   | 8   | 5 | 4     | 3   | 2 | 1  | 0  |
|---------|----|---|-----------|-----|-----|----|----|-----|-----|-----|---|-------|-----|---|----|----|
| Illegal |    | 0 | FDT_CAUSE | СҮС | FDT | BM | ER | FCR | EST | SyP |   | OfClk | OfC | Ю | En | On |

#### Figure 10.31 Control/Status Register

| Fields    |      |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | Read / | Reset |                                        |  |
|-----------|------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-------|----------------------------------------|--|
| Name      | Bits | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Write  | State | Compliance                             |  |
| 0 30:16   |      | Reserved for future use. Read as zeros, must be written as zeros                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | R      | 0     | Required                               |  |
| Illegal   | 31   | This bit is set by hardware and indicates if the currently<br>enabled trace output modes are an illegal combination. A<br>value of 1 indicates an unsupported setting. A value of 0<br>indicates that the currently selected settings are legal.                                                                                                                                                                                                                                                                                                                               | R      | 0     | Required                               |  |
| FDT_CAUSE | 15   | Extended Filtered Data Trace mode (FDT). Adds causing<br>load/store virtual address to Filtered Data Trace.<br>FDT_CAUSE only has effect if FDT is set.<br>The extended trace sequence is a FDT trace message fol-<br>lowed by the Breakpoint Match (BM) trace message. If<br>CYC is set, the FDT trace message will have a DeltaCycle<br>Message value of '0' directly followed by the Breakpoint<br>match (BM) message. This message sequence (FDT, delta<br>cycle of 0, and BM) indicates to the trace disassembler<br>that Extended Filtered Data Tracing mode is enabled. | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| СҮС       | 14   | Delta Cycle Mode: This mode can be set in combination<br>with the EST special trace modes. When set, a delta cycle<br>value is included in each of the trace messages and indi-<br>cates the number of cycles since the last message was gen-<br>erated. If this tracing mode is not implemented, the field is<br>read-only and read as zero.                                                                                                                                                                                                                                  | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| FDT       | 13   | Filtered Data Trace mode. If set, on a data breakpoint<br>match, the data value of the matching breakpoint is traced.<br>Normal tracing is inhibited when this mode is active. If<br>this tracing mode is not implemented, the field is<br>read-only and read as zero.                                                                                                                                                                                                                                                                                                         | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| BM        | 12   | Breakpoint Match. If set, only instructions that match<br>instruction or data breakpoints are traced. Normal tracing<br>is inhibited when this mode is active. If this tracing mode<br>is not implemented, the field is read-only and read as zero.                                                                                                                                                                                                                                                                                                                            | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| ER        | 11   | Trace exceptions and exception returns. If set, trace<br>includes markers for exceptions and exception returns.<br>Can be used in conjunction with the FCR bit. Inhibits nor-<br>mal tracing. If this tracing mode is not implemented, the<br>field is read-only and read as zero.                                                                                                                                                                                                                                                                                             | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| FCR 10    |      | Trace Function Calls and Returns. If set, trace includes<br>markers for function calls and returns. Can be used in con-<br>junction with the ER bit. If this tracing mode is not imple-<br>mented, the field is read-only and read as zero.                                                                                                                                                                                                                                                                                                                                    | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| EST       | 9    | Enable Special Tracing Modes. If set, normal tracing is<br>inhibited, allowing the user to choose one of several spe-<br>cial tracing modes. Setting this bit inhibits normal trace<br>mode. If no special tracing modes are implemented, this<br>field is read-only, and read as zero.                                                                                                                                                                                                                                                                                        | R/W    | 0     | Optional for<br>iFlowtrace rev<br>2.0+ |  |
| SyP       | 8:5  | Synchronization Period. The synchronization period is set to $2^{(SyP+8)}$ instructions. Thus a value of 0x0 implies 256 instructions, and a value of 0xF implies 8M instructions.                                                                                                                                                                                                                                                                                                                                                                                             | R/W    | 0     | Required for<br>iFlowtrace rev<br>2.0+ |  |

Table 10.37 Control/Status Register Field Descriptions

| Fields |                                                                                                                                                                                                                                                                                     |                                                                                                                                                                                                                                                                                                                                                | Read /         | Reset  |            |  |
|--------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------|--------|------------|--|
| Name   | Bits                                                                                                                                                                                                                                                                                | Description                                                                                                                                                                                                                                                                                                                                    | Write          | State  | Compliance |  |
| OfClk  | 4 Controls the Off-chip clock ratio. When the bit is set, this<br>implies 1:2, that is, the trace clock is running at 1/2 the<br>core clock, and when the bit is clear, implies 1:4 ratio, that<br>is, the trace clock is at 1/4 the core clock. Ignored unless<br>OfC is also set. |                                                                                                                                                                                                                                                                                                                                                | R/W            | 0      | Required   |  |
| OfC    | 3                                                                                                                                                                                                                                                                                   | Off-chip. 1 enables the PIB (if present) to unload the trace<br>memory. 0 disables the PIB and would be used when<br>on-chip storage is desired or if a PIB is not present. This<br>bit is settable only if the design supports both on-chip and<br>off-chip modes. Otherwise is a read-only bit indicating<br>which mode is supported.        | R/W<br>or<br>R | Preset | Required   |  |
| Ю      | 2                                                                                                                                                                                                                                                                                   | Inhibit overflow. If set, the CPU is stalled whenever the trace memory is full. Ignored unless OfC is also set.                                                                                                                                                                                                                                | R/W            | 0      | Required   |  |
| En     | 1                                                                                                                                                                                                                                                                                   | Trace enable. This bit may be set by software or by<br>Trace-on/Trace-off action bits from the Complex Trigger<br>block. Software writes EN with the desired initial state of<br>tracing when the ITCB is first turned on and EN is con-<br>trolled by hardware thereafter. EN turning on and off does<br>not flush partly filled trace words. | R/W            | 0      | Required   |  |
| On     | 0                                                                                                                                                                                                                                                                                   | Software control of trace collection. 0 disables all collec-<br>tion and flushes out any partially filled trace words.                                                                                                                                                                                                                         | R/W            | 0      | Required   |  |

## Table 10.37 Control/Status Register Field Descriptions (Continued)

## 10.8.6.2 ITCBTW Register (offset 0x3F80)

The *ITCBTW* register is used to read Trace Words from the on-chip trace memory. The TW read is the TW pointed to by the *ITCBRDP* register. A side effect of reading the *ITCBTW* register is that the *ITCBRDP* register increments to the next TW in the on-chip trace memory. If *ITCBRDP* is at the max size of the on-chip trace memory, the increment wraps back to address zero.

Note that this is a 64b register. On a 32b processor, software must read the upper word (offset 0x3F84) first as the address increment takes place on a read of the lower word (0x3F80).

The format of the *ITCBTW* register is shown below, and the field is described in Table 10.38.

## Figure 10.32 ITCBTW Register Format

| 63 | 0    |
|----|------|
|    | Data |

## **Table 10.38 ITCBTW Register Field Descriptions**

| Fields |       | lds  | Description | Read/ | Reset     | Compliance |
|--------|-------|------|-------------|-------|-----------|------------|
|        | Names | Bits |             | Write | State     |            |
|        | Data  | 63:0 | Trace Word  | R     | Undefined | Required   |

### 10.8.6.3 ITCBRDP Register (Offset 0x3f88)

The *ITCBRDP* register is the addresspointer to on-chiptrace memory. It points to the TW read when reading the *ITCBTW* register. This value will be automatically incremented after a read of the *ITCBTW* register.

The format of the *ITCBRDP* register is shown below, and the field is described in Table 10.39. The value of n depends on the size of the on-chip trace memory. As the address points to a 64-bit TW, lower three bits are always zero.

#### Figure 10.33 ITCBRDP Register Format

| 31 | n+1 | n 0     |
|----|-----|---------|
|    |     | Address |

## Table 10.39 ITCBRDP Register Field Descriptions

| Fields  |          | Description                                      | Read/ | Reset     | Compliance |  |
|---------|----------|--------------------------------------------------|-------|-----------|------------|--|
| Names   | Bits     |                                                  | Write | State     |            |  |
| Data    | 31:(n+1) | Reserved. Must be written zero, reads back zero. | 0     | 0         | Required   |  |
| Address | n:0      | Byte address of on-chip trace memory word.       | R/W   | Undefined | Required   |  |

#### 10.8.6.4 ITCBWRP Register (Offset 0x3f90)

The *ITCBWRP* register is the address pointer to on-chip trace memory. It points to the location where the next new TW for on-chip trace will be written. The top bit in the register indicates whether the pointer has wrapped. If it has, then the write pointer will also point to the oldest trace word. and the read pointer can be set to that to read the entire array in order. If it is cleared, then the read pointer can be set to 0 to read up to the write pointer position.

The format of the *ITCBWRP* register is shown below, and the field is described in Table 10.40. The value of n depends on the size of the on-chip trace memory. As the address points to a 64-bit TW, lower three bits are always zero.

### Figure 10.34 ITCBWRP Register Format

| 31   | 30 n+ | l n | 0       |
|------|-------|-----|---------|
| Wrap | 0     |     | Address |

#### Table 10.40 ITCBWRP Register Field Descriptions

| Fields  |          | Description                                                      | Read/ | Reset     | Compliance |
|---------|----------|------------------------------------------------------------------|-------|-----------|------------|
| Names   | Bits     |                                                                  | Write | State     |            |
| Wrap    | 31       | Indicates that the entire array has been written at least once   | R/W   | Undefined | Required   |
| 0       | 30:(n+1) | Reserved. Must be written zero, reads back zero.                 | 0     | 0         | Required   |
| Address | n:0      | Byte address of the next on-chip trace memory word to be written | R/W   | Undefined | Required   |

## 10.8.7 ITCB iFlowtrace Off-Chip Interface

The off-chip interface consists of a 4-bit data port ( $TR\_DATA$ ) and a trace clock ( $TR\_CLK$ ).  $TR\_CLK$  can be a DDR clock; that is, both edges are significant.  $TR\_DATA$  and  $TR\_CLK$  follow the same timing and have the same output structure as the PDtrace TCB described in MIPS specifications. The trace clock is synchronous to the system clock but running at a divided frequency. The *OfClk* bit in the *Control/Status* register indicates the ratio between the trace clock and the core clock. The Trace clock is always 1/2 of the trace port data rate, hence the "full speed" ITCB outputs data at the CPU core clock rate but the trace clock is half that, hence the 1:2 OfClk value is the full speed, and the 1:4 OfClk ratio is half-speed.

When a 64-bit trace word is ready to transmit, the PIB reads it from the FIFO and begins sending it out on  $TR_DATA$ . It is sent in 4-bit increments starting at the LSBs. In a valid trace word, the 4 LSBs are never all zero, so a probe listening on the  $TR_DATA$  port can easily determine when the transmission begins and then count 15 additional cycles to collect the whole 64-bit word. Between valid transmissions,  $TR_DATA$  Is held at zero and  $TR_CLK$  continues to run.

*TR\_CLK* runs continuously whenever a probe is connected. An optional signal *TR\_PROBE\_N* may be pulled high when a probe is not connected and could be used to disable the off-chip trace port. If not present, this signal must be tied low at the Probe Interface Block (PIB) input.

The following encoding is used for the 6 tag bits to tell the PIB receiver that a valid transmission is starting:

```
// if (srcount == 0), EncodedSrCount = 111010 = 58
// else if (srcount == 16) EncodedSrCount = 111011 = 59
// else if (srcount == 32) EncodedSrCount = 111100 = 60
// else if (srcount == 48) EncodedSrCount = 111101 = 61
// else EncodedSrCount = srcount
```

## 10.8.8 Breakpoint-Based Enabling of Tracing

Each hardware breakpoint in the EJTAG block (see the MIPS EJTAG Specification, MD00047, revision 4.14) has a control bit associated with it that enables a trigger signal to be generated on a break match condition. In special trace mode, this trigger can be used to insert an event record into the trace stream. In normal trace mode, this trigger signal can be used to turn trace on or off, thus allowing a user to control the trace on/off functionality using breakpoints. Similar to the TraceIBPC and TraceDBPC registers in PDtrace, registers are defined to control the start and stop of iFlowtrace. The details on the actual register names and drseg addresses are shown in Table 10.41.

| Register Name   | drseg Address | Reset Value | Description                                                                                                                      |
|-----------------|---------------|-------------|----------------------------------------------------------------------------------------------------------------------------------|
| ITrigiFlowTrcEn | 0x3FD0        | 0           | Register that controls whether or not hard-<br>ware instruction breakpoints can trigger<br>iFlowtrace tracing functionality      |
| DTrigiFlowTrcEn | 0x3FD8        | 0           | Register that controls whether or not hard-<br>ware data and tuple breakpoints can trig-<br>ger iFlowtrace tracing functionality |

Table 10.41 drseg Registers that Enable/Disable Trace from Breakpoint-Based Triggers

The bits in each register are defined as follows:

• Bit 28 (IE/DE): Used to specify whether the trigger signal from EJTAG simple or complex instruction (data or tuple) break should trigger iFlowtrace tracing functions or not. A value of 0 disables trigger signals from EJTAG instruction breaks, and 1 enables triggers for the same.

• Bits 14:0 (IBrk/DBrk): Used to explicitly specify which instruction (data or tuple) breaks enable or disable iFlowtrace. A value of 0 implies that trace is turned off (unconditional trace stop) and a value of 1 specifies that the trigger enables trace (unconditional trace start).

## 10.9 PC/Data Address Sampling

It is often useful for program profiling and analysis to periodically sample the value of the PC. This information can be used for statistical profiling akin to gprof, and is also very useful for detecting hot-spots in the code. In a multi-threaded environment, this information can be used to understand thread behavior, and to verify thread scheduling mechanisms in the absence of a full-fledged tracing facility like PDtrace.

The PC sampling feature is optional within EJTAG, but EJTAG and the TAP controller must be implemented if PC Sampling is required. When implemented, PC sampling can be turned on or off using an enable bit; when the feature is enabled, the PC value is continually sampled.

The presence or absence of the PC Sampling feature is indicated by the *PCS* (PC Sample) bit in the Debug Control Register. If PC sampling is implemented, and the *PCSe* (PC Sample Enable) bit in the Debug Control Register is also set to one, then the PC values are constantly sampled at the defined rate ( $DCR_{PCR}$ ) and written to a TAP register. The old value in the TAP register is overwritten by the new value, even if this register has not been read out by the debug probe.

The presence or absence of Data Address Sampling is indicated by the DAS (Data Address Sample) bit in the Debug Control Register and enabled by the DASe (Data Address Sampling Enable) bit in the Debug Control Register.

The sample rate is specified by the 3-bit *PCR* (PC Sample Rate) field (bits 8:6) in the Debug Control Register (*DCR*). These three bits encode a value  $2^5$  to  $2^{12}$  in a manner similar to the specification of SyncPeriod. When the implementation allows these bits to be written, the internal PC sample counter will be reset by each write, so that counting for the requested sample rate is immediately restarted.

The sample format includes a New data bit, the sampled value, the ASID of the sampled value (if not disabled by PCnoASID, bit 25 in *DCR*). Figure 10.35shows the format of the sampled values in the PCSAMPLE TAP register for MIPS32. The New data bit is used by the probe to determine if the sampled data just read out is new or has already been read and must be discarded.

| 40              | 33  | 32 | 1                  | 0   |  |
|-----------------|-----|----|--------------------|-----|--|
| ASID (if enable | ed) |    | PC or Data Address | New |  |

#### Figure 10.35 PCSAMPLE TAP Register Format (MIPS32)

The sampled PC value is the PC of the graduating instruction in the current cycle. If the processor is stalled when the PC sample counter overflows, then the sampled PC is the PC of the next graduating instruction. The processor continues to sample the PC value even when it is in Debug mode.

Note that some of the smaller sample periods can be shorter than the time needed to read out the sampled value. That is, it might take 41 (TCK) clock ticks to read a MIPS32 sample, while the smallest sample period is 32 (processor) clocks. While the sample is being read out, multiple samples may be taken and discarded, needlessly wasting power. To reduce unnecessary overhead, the TAP register includes only those fields that are enabled. If both PC Sampling and Data Sampling are enabled, then both samples are included in the PCSample scan register. PC Sample is in the least significant bits followed by a Data Address Sample. If either PC Sampling or Data Address Sampling is disabled, then the TAP register does not include that sample. The total scan length is 49 \* 2 = 82 bits if all fields are present and enabled.

### 10.9.1 PC Sampling in Wait State

Note that the processor samples PC even when it is asleep, that is, in a WAIT state. This permits an analysis of the amount of time spent by a processor in WAIT state which may be used for example to revert to a low power mode during the non-execution phase of a real-time application. But counting cycles to update the PC sample value is a waste of power. Hence, when in a WAIT state, the processor must simply switch the New bit to 1 each time it is set to 0 by the probe hardware. Hence, the external agent or probe reading the PC value will detect a WAIT instruction for as long as the processor remains in the WAIT state. When the processor leaves the WAIT state, then counting is resumed as before.

## 10.9.2 Cache Miss PC Sampling

EJTAG revision 5.0 adds a new optional mechanism for triggering PC sampling when an instruction fetch misses in the I-cache. When PCIM (bit 26 in *DCR*) is 1, PC addresses that hit the cache are not sampled. When the PCSR counter triggers, the most recent instruction whose fetch missed the cache is stored and available for EJTAG to shift out through PCSAMPLE. Over time, this collection mode results in an overall picture of the instruction cache behavior and can be used to increase performance by re-arranging code to minimize cache thrashing.

### 10.9.3 Data Address Sampling

EJTAG revision 5.0 extends the PC sampling mechanism to allow sampling of data (load and store) addresses. This feature is enabled with DASe, bit 23 in the Debug Control Register. When enabled, the PCSAMPLE scan register includes a data address sample. All load and store addresses can be captured, or they can be qualified using a data breakpoint trigger. DASQ=1 configures data sampling to record a data address only when it triggers data breakpoint 0. To be used for Data Address Sampling qualification, data breakpoint 0 must be enabled using its TE (trigger enable) bit.

PCSR controls how often data addresses are sampled. When the PCSR counter triggers, the most recent load/store address generated is accepted and made available to shift out through PCSAMPLE.

## 10.10 Fast Debug Channel

The Fast Debug Channel (FDC) mechanism provides an efficient means to transfer data between the core and an external device using the EJTAG TAP pins. The external device would typically be an EJTAG probe and that is the term used here, but it could be something else. FDC utilizes two First In First Out (FIFO) structures to buffer data between the core and probe. The probe uses the FDC TAP instruction to access these FIFOs, while the core itself accesses them using memory accesses. To transfer data out of the core, the core writes one or more pieces of data to the transmit FIFO. At this time, the core can resume doing other work. An external probe would examine the status of the transmit FIFO periodically. If there is data to be read, the probe goes back to waiting for more data. The core can either choose to be informed of the empty transmit FIFO via an interrupt, or it can choose to periodically check the status. Receiving data works in a similar manner - the probe writes to the receive FIFO. At that time, the core is either interrupted, or finds out via polling a status bit. The core can then do load accesses to the receive FIFO and receive data being sent to it by the probe. The TAP transfer is bidirectional - a single shift can be pulling transmit data and putting receive data at the same time.

The primary advantage of FDC over normal processor accesses or fastdata accesses is that it does not require the core to be blocked when the probe is reading or writing to the data transfer FIFOs. This significantly reduces the core overhead and makes the data transfer far less intrusive to the code executing on the core.

Refer to the EJTAG Specification [12] for the general details on FDC. The remainder of this section describes implementation specific behavior and register values.

The FDC memory mapped registers are located in the common device memory map (CDMM) region. FDC has a device ID of 0xFD.

## 10.10.1 Common Device Memory Map

Software on the core accesses FDC through memory-mapped registers, located within the Common Device Memory Map (CDMM). The CDMM is a region of physical address space that is reserved for mapping IO device configuration registers within a MIPS processor. The base address and enabling of this region is controlled by the *CDMMBase* CP0 register, as described in 6.2.29 "CDMMBase Register (CP0 Register 15, Select 2)" on page 175.

Refer to MIPS® Architecture For Programmers Volume III [9] for full details on the CDMM.

## 10.10.2 Fast Debug Channel Interrupt

The FDC block can generate an interrupt to inform software of incoming data being available or space being available in the outgoing FIFO. This interrupt is handled similarly to the timer or performance counter interrupts. The *CauseFDCI* bit indicates that the interrupt is pending. Traditionally, this interrupt is also sent to the core output SI\_FDCI where it is combined with one of the SI\_Int pins. However, this is no longer needed as the core will internally route the interrupt to the IP number set by the *IntCtl.IPFDCI* field. Note that this interrupt is a regular interrupt and not a debug interrupt.

The FDC Configuration Register (see Section 10.10.6.2 "FDC Configuration (FDCFG) Register (Offset 0x8)") includes fields for enabling and setting the threshold for generating each interrupt. Receive and transmit interrupt thresholds are specified independently, but they are ORed together to form a single interrupt.

The following interrupt thresholds are supported:

- Interrupts Disabled: No interrupt will be generated and software must poll the status registers to determine if incoming data is available or if there is space for outgoing data.
- Minimum Core Overhead: This setting minimizes the core overhead by not generating an interrupt until the receive FIFO (RxFIFO) is completely full or the transmit FIFO (TxFIFO) is completely empty.
- Minimum latency: To have the core take data as soon as it is available, the receive interrupt can be fired whenever the RxFIFO is not empty. There is a complimentary TxFIFO not full setting although that may not be quite as useful.
- Maximum bandwidth: When configured for minimum core overhead, bandwidth between the probe and core can
  be wasted if the core does not service the interrupt before the next transfer occurs. To reduce the chances of this
  happening, the interrupt threshold can be set to almost full or almost empty to generate an interrupt earlier. This
  setting causes receive interrupts to be generated when there are 0 or 1 unused RxFIFO entries. Transmit interrupts are generated when there are 0 or 1 used TxFIFO entries (see note in following section about this condition)

## 10.10.3 microAptiv<sup>™</sup> UP Core FDC Buffers

Figure 10.36 shows the general organization of the transmit and receive buffers on the microAptiv UP core.



## Figure 10.36 Fast Debug Channel Buffer Organization

One particular thing to note is the asynchronous crossings between the *EJ\_TCK* and *SI\_ClkIn* clock domains. This crossing is handled with a handshake interface that safely transfers data between the domains. Two data registers are included in this interface, one in the source domain and one in the destination domain. The control logic actively manages these registers so that they can be used as FIFO entries. The fact that one FIFO entry is in the *EJ\_TCK* clock domain is normally transparent, but it can create some unexpected behavior:

- TxFIFO availability: Data is first written into the *SI\_Clk* FIFO entries, then into the *EJ\_TCK* FIFO entry, requiring several *EJ\_TCK* cycles to complete the handshake and move the data. *EJ\_TCK* is generally much slower than *SI\_ClkIn*, and may even be stopped (although that would be uncommon when this feature is in use). This can result in not enough space for new data, even though there are only N-1 data values queued up. To prevent the loss of data, the *TxF* flag in *FDSTAT* is set when all of the *SI\_ClkIn* FIFO entries are full. Software writes to the FIFO should always check the *TxF* bit before attempting the write and should not make any assumptions about being able to use all entries arbitrarily. i.e., software seeing the *FxE* bit set should not assume that it can write *TxCnt* data words without checking for full.
- TxFIFO Almost Empty Interrupt: As transmit data moves from SI\_ClkIn to EJ\_TCK, both of the flops will temporarily look full. This makes it difficult to determine when just 1 FIFO entry is in use. To enable a simpler condition, the almost empty TxInterrupt condition is set when all of the SI\_ClkIn FIFO entries are empty. When this

condition is met, there will be 0 or 1 valid entries. However, the interrupt will not be asserted when there is only one valid entry if it is an *SI\_ClkIn* entry

• The RxFIFO has similar characteristics, but these are even less visible to software since *SI\_ClkIn* must be running to access the FDC registers.

## 10.10.4 Sleep mode

FDC data transfers do not prevent the core from entering sleep mode and will proceed normally in sleep mode. The FDC block monitors the TAP interface signals with a free-running clock. When new receive data is available or transmit data can be sent, the gated clock will be enabled for a few cycles to transfer the data and then allowed to stop again. If FDC interrupts are enabled, transferring data may cause an interrupt to be generated which can wake the core up.

## 10.10.5 FDC TAP Register

The FDC TAP instruction performs a 38-bit bidirectional transfer of the FDC TAP register. The register format is shown in Figure 10.37 and the fields are described in Figure 10.42

|     | 37                     | 36                | 35 | 32        | 31   | 0 |
|-----|------------------------|-------------------|----|-----------|------|---|
| In  | Probe Data<br>Accept   | Data In<br>Valid  |    | ChannelID | Data |   |
| Out | Receive<br>Buffer Full | Data Out<br>Valid |    | ChannenD  | Data |   |

#### Figure 10.37 FDC TAP Register Format

| Fields                 |       |                                                                                                                                                                                                                                                     | Read / | Reset     |
|------------------------|-------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-----------|
| Name                   | Bits  | Description                                                                                                                                                                                                                                         | Write  | State     |
| Probe Data<br>Accept   | 37    | Indicates to core that the probe is accepting the data that was scanned out.                                                                                                                                                                        | W      | Undefined |
| Data In<br>Valid       | 36    | Indicates to core that the probe is sending new data to the receive FIFO.                                                                                                                                                                           | W      | Undefined |
| Receive<br>Buffer Full |       |                                                                                                                                                                                                                                                     | R      | 0x0       |
| Data Out<br>Valid      | 36    | Indicates to probe that the core is sending new data from the transmit FIFO                                                                                                                                                                         | R      | 0         |
| ChannelID              | 35:32 | Channel number associated with the data being scanned in<br>or out. This field can be used to indicate the type of data<br>that is being sent and allow independent communication<br>channels<br>Scanning in a value with ChannelID=0xd and Data In | R/W    | Undefined |
|                        |       | Valid = $0$ will generate a receive interrupt. This can be used when the probe has completed sending data to the core.                                                                                                                              |        |           |

#### Table 10.42 FDC TAP Register Field Descriptions

| Fie  | lds  |                                    | Read / | Reset     |
|------|------|------------------------------------|--------|-----------|
| Name | Bits | Description                        | Write  | State     |
| Data | 31:0 | Data value being scanned in or out | R/W    | Undefined |

#### Table 10.42 FDC TAP Register Field Descriptions

## 10.10.6 Fast Debug Channel Registers

This section describes the Fast Debug Channel registers. CPU access to FDC is via loads and stores to the FDC device in the Common Device Memory Map (CDMM) region. These registers provide access control, configuration and status information, as well as access to the transmit and receive FIFOs. The registers and their respective offsets are shown in Table 10.43

| Offset in CDMM<br>device block | Register<br>Mnemonic | Register Name and Description                |
|--------------------------------|----------------------|----------------------------------------------|
| 0x0                            | FDACSR               | FDC Access Control and Status Register       |
| 0x8                            | FDCFG                | FDC Configuration Register                   |
| 0x10                           | FDSTAT               | FDC Status Register                          |
| 0x18                           | FDRX                 | FDC Receive Register                         |
| 0x20 + 0x8* n                  | FDTXn                | FDC Transmit Register n ( $0 \le n \le 15$ ) |

### 10.10.6.1 FDC Access Control and Status (FDACSR) Register (Offset 0x0)

This is the general CDMM Access Control and Status register which defines the device type and size and controls user and supervisor access to the remaining FDC registers. The Access Control and Status register itself is only accessible in kernel mode. Figure 10.38 has the format of an Access Control and Status register (shown as a 64-bit register), and Table 10.44 describes the register fields.

## Figure 10.38 FDC Access Control and Status Register

| 63 32 | 31 24 | 23 22 | 21 16   | 15 12  | 11 4 | 3  | 2  | 1  | 0  |
|-------|-------|-------|---------|--------|------|----|----|----|----|
| 0     | DevID | 0     | DevSize | DevRev | 0    | Uw | Ur | Sw | Sr |

#### Table 10.44 FDC Access Control and Status Register Field Descriptions

| Fields  |       |                                                                                                                                                             | Read / | Reset |
|---------|-------|-------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-------|
| Name    | Bits  | Description                                                                                                                                                 | Write  | State |
| DevType | 31:24 | This field specifies the type of device.                                                                                                                    | R      | 0xfd  |
| DevSize | 21:16 | This field specifies the number of extra 64-byte blocks allocated to this device. The value 0x2 indicates that this device uses 2 extra, or 3 total blocks. | R      | 0x2   |
| DevRev  | 15:12 | This field specifies the revision number of the device. The value 0x0 indicates that this is the initial version of FDC                                     | R      | 0x0   |

| Fie  | elds |                                                                                                                                                                                                                                                                                                        | Read / | Reset |  |
|------|------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------|-------|--|
| Name | Bits | Description                                                                                                                                                                                                                                                                                            | Write  | State |  |
| Uw   | 3    | This bit indicates if user-mode write access to this device<br>is enabled. A value of 1 indicates that access is enabled. A<br>value of 0 indicates that access is disabled. An attempt to<br>write to the device while in user mode with access dis-<br>abled is ignored.                             | R/W    | 0     |  |
| Ur   | 2    | This bit indicates if user-mode read access to this device is<br>enabled. A value of 1 indicates that access is enabled. A<br>value of 0 indicates that access is disabled. An attempt to<br>read from the device while in user mode with access dis-<br>abled will return 0 and not change any state. | R/W    | 0     |  |
| Sw   | 1    | This bit indicates if supervisor-mode write access to this<br>device is enabled. A value of 1 indicates that access is<br>enabled. A value of 0 indicates that access is disabled. An<br>attempt to write to the device while in supervisor mode<br>with access disabled is ignored.                   | R/W    | 0     |  |
| Sr   | 0    | This bit indicates if supervisor-mode read access to this device is enabled. A value of 1 indicates that access is enabled. A value of 0 indicates that access is disabled. An attempt to read from the device while in supervisor mode with access disabled will return 0 and not change any state.   | R/W    | 0     |  |
| 0    | 11:4 | Reserved for future use. Ignored on write; returns zero on read.                                                                                                                                                                                                                                       | R      | 0     |  |

## 10.10.6.2 FDC Configuration (FDCFG) Register (Offset 0x8)

The FDC configuration register holds information about the current configuration of the Fast Debug Channel mechanism. Figure 10.39 has the format of the FDC Configuration register, and Table 10.45 describes the register fields.

#### Figure 10.39 FDC Configuration Register

| 31 | 20 | 19     | 18      | 17     | 16     | 15 |            | 8 | 7 |            | 0 |
|----|----|--------|---------|--------|--------|----|------------|---|---|------------|---|
|    | 0  | Tx_Int | tThresh | Rx_Int | Thresh |    | TxFIFOSize |   |   | RxFIFOSize |   |

| Fie  | lds   |                                                                   | Read / | Reset |
|------|-------|-------------------------------------------------------------------|--------|-------|
| Name | Bits  | Description                                                       |        | State |
| 0    | 31:20 | Reserved for future use. Read as zeros, must be written as zeros. | R      | 0     |

| Fie         | Fields |                           |                                                                                                              | Read / | Reset  |
|-------------|--------|---------------------------|--------------------------------------------------------------------------------------------------------------|--------|--------|
| Name        | Bits   |                           | Description                                                                                                  | Write  | State  |
| TxIntThresh | 19:18  |                           | er transmit interrupts are enabled and the IFO needed to generate an interrupt.                              | R/W    | 0      |
|             |        | Encoding                  | Meaning                                                                                                      |        |        |
|             |        | 0                         | Transmit Interrupt Disabled                                                                                  |        |        |
|             |        | 1                         | Empty                                                                                                        |        |        |
|             |        | 2                         | Not Full                                                                                                     |        |        |
|             |        | 3                         | Almost Empty - zero or one entry in use (see 10.10.2 for specifics)                                          |        |        |
| RxIntThresh | 17:16  |                           | Controls whether receive interrupts are enabled and the state of the RxFIFO needed to generate an interrupt. |        | 0      |
|             |        | Encoding                  | Meaning                                                                                                      |        |        |
|             |        | 0                         | Receive Interrupt Disabled                                                                                   |        |        |
|             |        | 1                         | Full                                                                                                         |        |        |
|             |        | 2                         | Not empty                                                                                                    |        |        |
|             |        | 3                         | Almost Full - zero or one entry free                                                                         |        |        |
| TxFIFOSize  | 15:8   | This field holds<br>FIFO. | This field holds the total number of entries in the transmit FIFO.                                           |        | Preset |
| RxFIFOSize  | 7:0    | This field holds<br>FIFO. | the total number of entries in the receive                                                                   | R      | Preset |

### Table 10.45 FDC Configuration Register Field Descriptions (Continued)

## 10.10.6.3 FDC Status (FDSTAT) Register (Offset 0x10)

The FDC Status register holds up to date state information for the FDC mechanism. Figure 10.40 shows the format of the FDC Status register, and Table 10.46 describes the register fields.

### Figure 10.40 FDC Status Register

| 31 | 24       | 23 16    | 15 8 | 7 4    | 3   | 2   | 1   | 0   |
|----|----------|----------|------|--------|-----|-----|-----|-----|
|    | Tx_Count | Rx_Count | 0    | RxChan | RxE | RxF | TxE | TxF |

| Fie      | lds                   |                                                                                                                      | Read / | Reset     |  |
|----------|-----------------------|----------------------------------------------------------------------------------------------------------------------|--------|-----------|--|
| Name     | Name Bits Description |                                                                                                                      |        |           |  |
| Tx_Count | 31:24                 | This optional field is not implemented and will read as 0                                                            | R      | 0         |  |
| Rx_Count | 23:16                 | This optional field is not implemented and will read as 0                                                            | R      | 0         |  |
| 0        | 15:8                  | Reserved for future use. Must be written as zeros and read as zeros.                                                 | R      | 0         |  |
| RxChan   | 7:4                   | This field indicates the channel number used by the top item in the receive FIFO. This field is only valid if RxE=0. | R      | Undefined |  |

#### Table 10.46 FDC Status Register Field Descriptions

| Fie  | elds |                                                                                      | Read / | Reset |
|------|------|--------------------------------------------------------------------------------------|--------|-------|
| Name | Bits | Description                                                                          | Write  | State |
| RxE  | 3    | If RxE is set, the receive FIFO is empty. If RxE is not set, the FIFO is not empty.  | R      | 1     |
| RxF  | 2    | If RxF is set, the receive FIFO is full. If RxF is not set, the FIFO is not full.    | R      | 0     |
| TxE  | 1    | If TxE is set, the transmit FIFO is empty. If TxE is not set, the FIFO is not empty. | R      | 1     |
| TxF  | 0    | If TxF is set, the transmit FIFO is full. If TxF is not set, the FIFO is not full.   | R      | 0     |

## Table 10.46 FDC Status Register Field Descriptions (Continued)

## 10.10.6.4 FDC Receive (FDRX) Register (Offset 0x18)

This register exposes the top entry in the receive FIFO. A read from this register returns the top item in the FIFO and removes it from the FIFO itself. The result of a write to this register is **UNDEFINED**. The result of a read when the FIFO is empty is also **UNDEFINED** so software must check the *RxE* flag in *FDSTAT* prior to reading. Figure 10.41 shows the format of the *FDC Receive* register, and Table 10.47 describes the register fields.

### Figure 10.41 FDC Receive Register

| 31 |        | 0 |
|----|--------|---|
|    | RxData |   |

## Table 10.47 FDC Receive Register Field Descriptions

| Fields                |      |                                                       | Read / | Reset     |
|-----------------------|------|-------------------------------------------------------|--------|-----------|
| Name Bits Description |      | Description                                           | Write  | State     |
| RxData                | 31:0 | This register holds the top entry in the receive FIFO | R      | Undefined |

## 10.10.6.5 FDC Transmit n (FDTXn) Registers (Offset 0x20 + 0x8\*n)

These sixteen registers access the bottom entry in the transmit FIFO. The different addresses are used to generate a 4b channel identifier that is attached to the data value. This allows software to track different event types without needing to reserve a portion of the 32b data as a tag. A write to one of these registers results in a write to the transmit FIFO of the data value and channel ID corresponding to the register being written. Reads from these registers are **UNDE-FINED**. Attempting to write to the transmit FIFO if it is full has **UNDEFINED** results. Hence, the software running on the core must check the *TxF* flag in *FDSTAT* to ensure that there is space for the write. Figure 10.42 shows the format of the FDC Transmit register, and Table 10.48 describes the register fields.

## Figure 10.42 FDC Transmit Register

| 31 |        | 0 |
|----|--------|---|
|    | TxData |   |

| Fields |      |                                                           | Read /                             | Reset     |
|--------|------|-----------------------------------------------------------|------------------------------------|-----------|
| Name   | Bits | Description                                               | Write                              | State     |
| TxData | 31:0 | This register holds the bottom entry in the transmit FIFO | W, Unde-<br>fined value<br>on read | Undefined |

### Table 10.48 FDC Transmit Register Field Descriptions

### Table 10.49 FDTXn Address Decode

| Address | Channel | Address | Channel | Address | Channel | Address | Channel |
|---------|---------|---------|---------|---------|---------|---------|---------|
| 0x20    | 0x0     | 0x40    | 0x4     | 0x60    | 0x8     | 0x80    | 0xc     |
| 0x28    | 0x1     | 0x48    | 0x5     | 0x68    | 0x9     | 0x88    | 0xd     |
| 0x30    | 0x2     | 0x50    | 0x6     | 0x70    | 0xa     | 0x90    | 0xe     |
| 0x38    | 0x3     | 0x58    | 0x7     | 0x78    | 0xb     | 0x98    | 0xf     |

## 10.11 cJTAG Interface

The cJTAG external IP block, provided as part of the microAptiv UP processor core, converts a 4-wire EJTAG (IEEE 1149.1) interface to a 2-wire cJTAG (IEEE1149.7) interface. A high-level view of cJTAG is shown in Figure 10.43. Operation of the conversion adapter is transparent to software.

Refer to the *cJTAG Adapter User's Manual* [13] for more details.

#### Figure 10.43 cJTAG Interface



Chapter 11

# **Instruction Set Overview**

This chapter provides a general overview on the three CPU instruction set formats of the MIPS architecture: Immediate, Jump, and Register. Refer to Chapter 12, "microAptiv<sup>™</sup> UP Processor Core Instructions" on page 310 for a complete listing and description of instructions.

This chapter discusses the following topics

- Section 11.1 "CPU Instruction Formats"
- Section 11.2 "Load and Store Instructions"
- Section 11.3 "Computational Instructions"
- Section 11.4 "Jump and Branch Instructions"
- Section 11.5 "Control Instructions"
- Section 11.6 "Coprocessor Instructions"
- Section 11.7 "Enhancements to the MIPS Architecture"
- Section 11.8 "MCU ASE Instructions"

## **11.1 CPU Instruction Formats**

Each CPU instruction consists of a single 32-bit word, aligned on a word boundary. There are three instruction formats: immediate (I-type), jump (J-type), and register (R-type) (shown in Figure 11.1). The use of a small number of instruction formats simplifies instruction decoding, allowing the compiler to synthesize more complicated (and less frequently used) operations and addressing modes from these three formats as needed.

| I-Туре | e (Imme       | diate              | )    |                                                                     |           |            |            |           |  |  |
|--------|---------------|--------------------|------|---------------------------------------------------------------------|-----------|------------|------------|-----------|--|--|
|        | 31            | 26 25              | 21   | 20 16                                                               | 15        |            |            | 0         |  |  |
|        | ор            |                    | rs   | rt                                                                  |           | immediat   | te         |           |  |  |
|        |               | .)                 |      |                                                                     |           |            |            |           |  |  |
| • •    | e (Jump<br>31 | )<br>26 <b>2</b> 5 |      |                                                                     |           |            |            | 0         |  |  |
|        | ор            | 20 45              |      |                                                                     | target    |            |            |           |  |  |
| R-Tvn  | e (Regi       | ister)             |      |                                                                     |           |            |            |           |  |  |
| • •    | 31            | 26 25              | 21   | 20 16                                                               | 15 11     | 10 6       | 5          | 0         |  |  |
|        | ор            |                    | rs   | rt                                                                  | rd        | sa         | funct      |           |  |  |
|        | ор            |                    | 6-bi | t operatio                                                          | on code   |            |            |           |  |  |
|        | rs            |                    | 5-bi | 5-bit source register specifier                                     |           |            |            |           |  |  |
|        | rt            |                    |      | t target (s<br>dition                                               | source/de | stination  | ) register | or branch |  |  |
|        | immed         | iate               |      | 16-bit immediate value, branch displacement or address displacement |           |            |            |           |  |  |
|        | target        |                    | 26-t | oit jump t                                                          | arget add | lress      |            |           |  |  |
|        | rd            |                    | 5-bi | t destinat                                                          | ion regis | ter specif | ier        |           |  |  |
|        | sa            |                    |      | t shift an                                                          | -         | -          |            |           |  |  |
|        | funct         |                    | 6-bi | t function                                                          | n field   |            |            |           |  |  |
|        |               |                    |      |                                                                     |           |            |            |           |  |  |

## Figure 11.1 Instruction Formats

## **11.2 Load and Store Instructions**

Load and store instructions are immediate (I-type) instructions that move data between memory and the general registers. The only addressing mode that load and store instructions directly support is *base register plus 16-bit signed immediate offset*.

## 11.2.1 Scheduling a Load Delay Slot

A load instruction that does not allow its result to be used by the instruction immediately following is called a *delayed load instruction*. The instruction slot immediately following this delayed load instruction is referred to as the *load delay slot*.

In the microAptiv UP core, the instruction immediately following a load instruction can use the contents of the loaded register; however in such cases hardware interlocks insert additional real cycles. Although not required, the scheduling of load delay slots can be desirable, both for performance and R-Series processor compatibility.

## 11.2.2 Defining Access Types

Access type indicates the size of a core data item to be loaded or stored, set by the load or store instruction opcode.

Regardless of access type or byte ordering (endianness), the address given specifies the low-order byte in the addressed field. For a big-endian configuration, the low-order byte is the most-significant byte; for a little-endian configuration, the low-order byte is the least-significant byte.

The access type, together with the three low-order bits of the address, define the bytes accessed within the addressed word as shown in Table 11.1. Only the combinations shown in Table 11.1 are permissible; other combinations cause address error exceptions.

|             |   |                   |   | Bytes Accessed |    |       |    |                        |    |      |   |  |  |
|-------------|---|-------------------|---|----------------|----|-------|----|------------------------|----|------|---|--|--|
|             |   | ow Ord<br>dress E | - | (31-           | -  | ndian | 0) | Little Endian<br>(310) |    |      |   |  |  |
| Access Type | 2 | 1                 | 0 |                | Ву | /te   |    |                        | Ву | Byte |   |  |  |
| Word        | 0 | 0                 | 0 | 0              | 1  | 2     | 3  | 3                      | 2  | 1    | 0 |  |  |
| Triplebyte  | 0 | 0                 | 0 | 0              | 1  | 2     |    |                        | 2  | 1    | 0 |  |  |
|             | 0 | 0                 | 1 |                | 1  | 2     | 3  | 3                      | 2  | 1    |   |  |  |
| Halfword    | 0 | 0                 | 0 | 0              | 1  |       |    |                        |    | 1    | 0 |  |  |
|             | 0 | 1                 | 0 |                |    | 2     | 3  | 3                      | 2  |      |   |  |  |
| Byte        | 0 | 0                 | 0 | 0              |    |       |    |                        |    |      | 0 |  |  |
|             | 0 | 0                 | 1 |                | 1  |       |    |                        |    | 1    |   |  |  |
|             | 0 | 1                 | 0 |                |    | 2     |    |                        | 2  |      |   |  |  |
|             | 0 | 1                 | 1 |                |    |       | 3  | 3                      |    |      |   |  |  |

Table 11.1 Byte Access Within a Word

## **11.3 Computational Instructions**

Computational instructions can be either in register (R-type) format, in which both operands are registers, or in immediate (I-type) format, in which one operand is a 16-bit immediate.

Computational instructions perform the following operations on register values:

- Arithmetic
- Logical
- Shift
- Multiply
- Divide

These operations fit in the following four categories of computational instructions:

- ALU Immediate instructions
- Three-operand Register-type Instructions
- Shift Instructions

• Multiply And Divide Instructions

## 11.3.1 Cycle Timing for Multiply and Divide Instructions

Any multiply instruction in the integer pipeline is transferred to the multiplier as remaining instructions continue through the pipeline; the product of the multiply instruction is saved in the HI and LO registers. If the multiply instruction is followed by an MFHI or MFLO before the product is available, the pipeline interlocks until this product does become available. Refer to Chapter 2, "Pipeline of the microAptiv<sup>™</sup> UP Core" on page 41 for more information on instruction latency and repeat rates.

## **11.4 Jump and Branch Instructions**

Jump and branch instructions change the control flow of a program. All jump and branch instructions occur with a delay of one instruction: that is, the instruction immediately following the jump or branch (this is known as the instruction in the *delay slot*) always executes while the target instruction is being fetched from storage.

## 11.4.1 Overview of Jump Instructions

Subroutine calls in high-level languages are usually implemented with Jump or Jump and Link instructions, both of which are J-type instructions. In J-type format, the 26-bit target address shifts left 2 bits and combines with the high-order 4 bits of the current program counter to form an absolute address.

Returns, dispatches, and large cross-page jumps are usually implemented with the Jump Register or Jump and Link Register instructions. Both are R-type instructions that take the 32-bit byte address contained in one of the general purpose registers.

For more information about jump instructions, refer to the individual instructions in Chapter 12, "microAptiv<sup>™</sup> UP Processor Core Instructions" on page 310.

## 11.4.2 Overview of Branch Instructions

All branch instruction target addresses are computed by adding the address of the instruction in the delay slot to the 16-bit *offset* (shifted left 2 bits and sign-extended to 32 bits). All branches occur with a delay of one instruction.

If a conditional branch likely is not taken, the instruction in the delay slot is nullified.

Branches, jumps, ERET, and DERET instructions should not be placed in the delay slot of a branch or jump.

## **11.5 Control Instructions**

Control instructions allow the software to initiate traps; they are always R-type.

## **11.6 Coprocessor Instructions**

CP0 instructions perform operations on the System Control Coprocessor registers to manipulate the memory management and exception handling facilities of the processor. Refer to Chapter 12, "microAptiv<sup>™</sup> UP Processor Core Instructions" on page 310 for a listing of CP0 instructions.

## **11.7 Enhancements to the MIPS Architecture**

The core execution unit implements the MIPS32 architecture, which includes the following instructions.

- CLOCount Leading Ones
- CLZCount Leading Zeros
- MADDMultiply and Add Word
- MADDUMultiply and Add Unsigned Word
- MSUBMultiply and Subtract Word
- MSUBUMultiply and Subtract Unsigned Word
- MULMultiply Word to Register
- SSNOPSuperscalar Inhibit NOP

## 11.7.1 CLO - Count Leading Ones

The CLO instruction counts the number of leading ones in a word. The 32-bit word in the GPR *rs* is scanned from most-significant to least-significant bit. The number of leading ones is counted and the result is written to the GPR *rd*. If all 32 bits are set in the GPR *rs*, the result written to the GPR *rd* is 32.

## 11.7.2 CLZ - Count Leading Zeros

The CLZ instruction counts the number of leading zeros in a word. The 32-bit word in the GPR *rs* is scanned from most-significant to least-significant bit. The number of leading zeros is counted and the result is written to the GPR *rd*. If all 32 bits are cleared in the GPR *rs*, the result written to the GPR *rd* is 32.

## 11.7.3 MADD - Multiply and Add Word

The MADD instruction multiplies two words and adds the result to the HI/LO register pair. The 32-bit word value in the GPR *rs* is multiplied by the 32-bit value in the GPR *rt*, treating both operands as signed values, to produce a 64-bit result. The product is added to the 64-bit concatenated values in the HI and LO register pair. The resulting value is then written back to the HI and LO registers. No arithmetic exception occurs under any circumstances.

## 11.7.4 MADDU - Multiply and Add Unsigned Word

The MADDU instruction multiplies two unsigned words and adds the result to the HI/LO register pair. The 32-bit word value in the GPR *rs* is multiplied by the 32-bit value in the GPR *rt*, treating both operands as unsigned values, to produce a 64-bit result. The product is added to the 64-bit concatenated values in the HI and LO register pair. The resulting value is then written back to the HI and LO registers. No arithmetic exception occurs under any conditions.

## 11.7.5 MSUB - Multiply and Subtract Word

The MSUB instruction multiplies two words and subtracts the result from the HI/LO register pair. The 32-bit word value in the GPR *rs* is multiplied by the 32-bit value in the GPR *rt*, treating both operands as signed values, to produce a 64-bit result. The product is subtracted from the 64-bit concatenated values in the HI and LO register pair. The

resulting value is then written back to the HI and LO registers. No arithmetic exception occurs under any circumstances.

## 11.7.6 MSUBU - Multiply and Subtract Unsigned Word

The MSUBU instruction multiplies two unsigned words and subtracts the result from the HI/LO register pair. The 32-bit word value in the GPR *rs* is multiplied by the 32-bit value in the GPR *rt*, treating both operands as unsigned values, to produce a 64-bit result. The product is subtracted from the 64-bit concatenated values in the HI and LO register pair. The resulting value is then written back to the HI and LO registers. No arithmetic exception occurs under any circumstances.

## 11.7.7 MUL - Multiply Word

The MUL instruction multiplies two words and writes the result to a GPR. The 32-bit word value in the GPR *rs* is multiplied by the 32-bit value in the GPR *rt*, treating both operands as signed values, to produce a 64-bit result. The least-significant 32-bits of the product are written to the GPR *rd*. The contents of the HI and LO register pair are not defined after the operation. No arithmetic exception occurs under any circumstances.

## 11.7.8 SSNOP- Superscalar Inhibit NOP

The MIPS32 microAptiv UP processor cores treat this instruction as a regular NOP.

## **11.8 MCU ASE Instructions**

The MCU ASE includes some new instructions which are particularly useful in microcontroller applications.

## 11.8.1 ACLR

This instruction allows a bit within an uncached I/O control register to be atomically cleared; that is, the read-modify byte write sequence performed by this instruction cannot be interrupted.

## 11.8.2 ASET

This instruction allows a bit within an uncached I/O control register to be atomically set; that is, the read-modify byte write sequence performed by this instruction cannot be interrupted.

### 11.8.3 IRET

This instruction can be used as a replacement for the ERET instruction when returning from an interrupt. This instruction implements the Automated Interrupt Epilogue feature, which automates restoring some of the COP0 registers from the stack and updating the C0\_Status register in preparation for returning to non-exception mode. This instruction also implements the optional Interrupt Chaining feature, which allows a subsequent interrupt to be handled without returning to non-exception mode.

Chapter 12

# microAptiv<sup>™</sup> UP Processor Core Instructions

This chapter supplements the *MIPS32*® *Architecture Reference Manual, Volume II* by describing instruction behavior that is specific to a MIPS32 microAptiv UP processor core. For complete descriptions of all MIPS32 and mciroMIPS32 instructions, refer to *MIPS*® *Architecture For Programmers, Volume II: The MIPS32*® *Instruction Set* [7] and *MIPS*® *Architecture For Programmers, Volume II: The microMIPS32*® *Instruction Set* [8].

This chapter is divided into the following sections:

- Section 12.1 "Understanding the Instruction Descriptions"
- Section 12.2 "microAptiv<sup>™</sup> UP Core Opcode Map"
- Section 12.3 "MIPS32<sup>®</sup> Instruction Set for the microAptiv<sup>™</sup> UP Core"

The microAptiv UP processor core also supports theMIPS32 microMIPS architecture. The microMIPS instruction set is described in Chapter 13, "microMIPS<sup>TM</sup> Instruction Set Architecture" on page 349.

The microAptiv UP processor core also supports the instructions in the MIPS DSP Module Revision 2. The MIPS DSP Module Revision 2 instruction set is described in Chapter 3, "The MIPS® DSP Module" on page 72.

## **12.1 Understanding the Instruction Descriptions**

Refer to *Volume II* of the *MIPS32 Architecture Reference Manual* for detailed information about the instruction descriptions, namely, the instruction fields, definition of terms, and functional notation. This section provides basic information

## 12.2 microAptiv<sup>™</sup> UP Core Opcode Map

### Key

- CAPITALIZED text indicates an opcode mnemonic
- Italicized text refers the reader to indicates to the specified opcode submap for further instruction bit decode.
- Entries containing the α symbol indicate that a reserved instruction fault occurs if the core executes this instruction.
- Entries containing the β symbol indicate that a coprocessor unusable exception occurs if the core executes this instruction

| ор   | code | bits 2826 |        |      |       |          |      |       |          |
|------|------|-----------|--------|------|-------|----------|------|-------|----------|
|      |      | 0         | 1      | 2    | 3     | 4        | 5    | 6     | 7        |
| bits | 3129 | 000       | 001    | 010  | 011   | 100      | 101  | 110   | 111      |
| 0    | 000  | Special   | RegImm | J    | JAL   | BEQ      | BNE  | BLEZ  | BGTZ     |
| 1    | 001  | ADDI      | ADDIU  | SLTI | SLTIU | ANDI     | ORI  | XORI  | LUI      |
| 2    | 010  | COP0      | β      | COP2 | β     | BEQL     | BNEL | BLEZL | BGTZL    |
| 3    | 011  | α         | α      | α    | α     | Special2 | θΑΛΞ | α     | Σπεχιαλ3 |
| 4    | 100  | LB        | LH     | LWL  | LW    | LBU      | LHU  | LWR   | α        |
| 5    | 101  | SB        | SH     | SWL  | SW    | α        | α    | SWR   | CACHE    |
| 6    | 110  | LL        | β      | LWC2 | PREF  | α        | β    | α     | α        |
| 7    | 111  | SC        | β      | SWC2 | α     | α        | β    | α     | α        |

## Table 12.1 Encoding of the Opcode Field

## Table 12.2 Special Opcode Encoding of Function Field

| fun | ction | bits 20 |       |              |      |         |       |                |      |
|-----|-------|---------|-------|--------------|------|---------|-------|----------------|------|
|     |       | 0       | 1     | 2            | 3    | 4       | 5     | 6              | 7    |
| bit | s 53  | 000     | 001   | 010          | 011  | 100     | 101   | 110            | 111  |
| 0   | 000   | SLL     | β     | SRL/<br>ROTR | SRA  | SLLV    | α     | SRLV/<br>ROTRV | SRAV |
| 1   | 001   | JR      | JALR  | MOVZ         | MOVN | SYSCALL | BREAK | α              | SYNC |
| 2   | 010   | MFHI    | MTHI  | MFLO         | MTLO | α       | α     | α              | α    |
| 3   | 011   | MULT    | MULTU | DIV          | DIVU | α       | α     | α              | α    |
| 4   | 100   | ADD     | ADDU  | SUB          | SUBU | AND     | OR    | XOR            | NOR  |
| 5   | 101   | α       | α     | SLT          | SLTU | α       | α     | α              | α    |
| 6   | 110   | TGE     | TGEU  | TLT          | TLTU | TEQ     | α     | TNE            | α    |
| 7   | 111   | α       | α     | α            | α    | α       | α     | α              | α    |

## Table 12.3 Special2 Opcode Encoding of Function Field

| fun | oction | bits 20             |       |     |     |      |       |     |       |
|-----|--------|---------------------|-------|-----|-----|------|-------|-----|-------|
|     |        | 0                   | 1     | 2   | 3   | 4    | 5     | 6   | 7     |
| bit | s 53   | 000                 | 001   | 010 | 011 | 100  | 101   | 110 | 111   |
| 0   | 000    | MADD                | MADDU | MUL | α   | MSUB | MSUBU | α   | α     |
| 1   | 001    | α                   | α     | α   | α   | α    | α     | α   | α     |
| 2   | 010    | $UDI^1$ or $\alpha$ |       |     |     |      |       |     |       |
| 3   | 011    |                     |       |     |     |      |       |     |       |
| 4   | 100    | CLZ                 | CLO   | α   | α   | α    | α     | α   | α     |
| 5   | 101    | α                   | α     | α   | α   | α    | α     | α   | α     |
| 6   | 110    | α                   | α     | α   | α   | α    | α     | α   | α     |
| 7   | 111    | α                   | α     | α   | α   | α    | α     | α   | SDBBP |

1. CorExtend instructions are a build-time option of the microAptiv UP Pro core, if not implemented this instructions space will cause a reserved instruction exception. If assembler support exists, the mnemonics for CorExtend instructions are most likely UDI0, UDI1, ..., UDI15.

| fun | ction | bits 20 |     |     |       |     |     |     |     |
|-----|-------|---------|-----|-----|-------|-----|-----|-----|-----|
|     |       | 0       | 1   | 2   | 3     | 4   | 5   | 6   | 7   |
| bit | s 53  | 000     | 001 | 010 | 011   | 100 | 101 | 110 | 111 |
| 0   | 000   | EXT     | α   | α   | α     | INS | α   | α   | α   |
| 1   | 001   | α       | α   | α   | α     | α   | α   | α   | α   |
| 2   | 010   | α       | α   | α   | α     | α   | α   | α   | α   |
| 3   | 011   | α       | α   | α   | α     | α   | α   | α   | α   |
| 4   | 100   | BSHFL   | α   | α   | α     | α   | α   | α   | α   |
| 5   | 101   | α       | α   | α   | α     | α   | α   | α   | α   |
| 6   | 110   | α       | α   | α   | α     | α   | α   | α   | α   |
| 7   | 111   | α       | α   | α   | ΡΔΗΩΡ | α   | α   | α   | α   |

## Table 12.4 Special3 Opcode Encoding of Function Field

## Table 12.5 Regimm Encoding of rt Field

|      | rt   | bits 1816 |        |         |         |      |     |      |       |
|------|------|-----------|--------|---------|---------|------|-----|------|-------|
|      |      | 0         | 1      | 2       | 3       | 4    | 5   | 6    | 7     |
| bits | 2019 | 000       | 001    | 010     | 011     | 100  | 101 | 110  | 111   |
| 0    | 00   | BLTZ      | BGEZ   | BLTZL   | BGEZL   | α    | α   | α    | α     |
| 1    | 01   | TGEI      | TGEIU  | TLTI    | TLTIU   | TEQI | α   | TNEI | α     |
| 2    | 10   | BLTZAL    | BGEZAL | BLTZALL | BGEZALL | α    | α   | α    | α     |
| 3    | 11   | α         | α      | α       | α       | α    | α   | α    | ΣΨΝΧΙ |

## Table 12.6 COP2 Encoding of rs Field

|           | rs | bits 2321 |     |         |       |      |     |      |       |
|-----------|----|-----------|-----|---------|-------|------|-----|------|-------|
|           |    | 0         | 1   | 2       | 3     | 4    | 5   | 6    | 7     |
| bits 2524 |    | 000       | 001 | 010     | 011   | 100  | 101 | 110  | 111   |
| 0         | 00 | MFC2      | α   | CFC2    | ΜΦΗΧ2 | MTC2 | α   | CTC2 | MTHX2 |
| 1         | 01 | BC2       |     | $BC2^1$ |       |      |     |      |       |
| 2         | 10 |           | СО  |         |       |      |     |      |       |
| 3         | 11 |           |     |         |       |      |     |      |       |

1. The core will treat the entire row as a *BC2* instruction. However compiler and assembler support only exists for the first one. Some compiler and assembler products may allow the user to add new instructions.

| rt      | bits 16 |       |
|---------|---------|-------|
| bits 17 | 0       | 1     |
| 0       | BC2F    | BC2T  |
| 1       | BC2FL   | BC2TL |

|      | rs   | bits 2321 |     |        |       |      |     |        |     |
|------|------|-----------|-----|--------|-------|------|-----|--------|-----|
|      |      | 0         | 1   | 2      | 3     | 4    | 5   | 6      | 7   |
| bits | 2524 | 000       | 001 | 010    | 011   | 100  | 101 | 110    | 111 |
| 0    | 00   | MFC0      | α   | α      | α     | MTC0 | α   | α      | α   |
| 1    | 01   | α         | α   | ΡΔΠΓΠΡ | ΜΦΜΧ0 | α    | α   | ΩΡΠΓΠΡ | α   |
| 2    | 10   | СО        |     |        |       |      |     |        |     |
| 3    | 11   |           |     |        |       |      |     |        |     |

## Table 12.8 COP0 Encoding of rs Field

## Table 12.9 COP0 Encoding of Function Field When rs=CO

| fun | ction | bits 20 |      |       |     |     |     |       |       |
|-----|-------|---------|------|-------|-----|-----|-----|-------|-------|
|     |       | 0       | 1    | 2     | 3   | 4   | 5   | 6     | 7     |
| bit | s 53  | 000     | 001  | 010   | 011 | 100 | 101 | 110   | 111   |
| 0   | 000   | α       | TLBR | TLBWI | α   | α   | α   | TLBWR | α     |
| 1   | 001   | TLBP    | α    | α     | α   | α   | α   | α     | α     |
| 2   | 010   | α       | α    | α     | α   | α   | α   | α     | α     |
| 3   | 011   | ERET    | IAXK | α     | α   | α   | α   | α     | DERET |
| 4   | 100   | WAIT    | α    | α     | α   | α   | α   | α     | α     |
| 5   | 101   | α       | α    | α     | α   | α   | α   | α     | α     |
| 6   | 110   | α       | α    | α     | α   | α   | α   | α     | α     |
| 7   | 111   | α       | α    | α     | α   | α   | α   | α     | α     |

## 12.3 MIPS32® Instruction Set for the microAptiv<sup>™</sup> UP Core

This section provides a summary of the MIPS32 instructions for the microAptiv UP cores (microMIPS32 instructions are described in Chapter 13, "microMIPS<sup>TM</sup> Instruction Set Architecture" on page 349).

Table 12.10 lists the instructions in alphabetical order. Instructions that have implementation-dependent behavior are described in subsequent sections; all other MIPS32 instructions are described in detail in the *MIPS®* Architecture For Programmers, Volume II: The MIPS32® Instruction Set [7] and are not duplicated here.

| Instruction | Description                                                       | Function                       |
|-------------|-------------------------------------------------------------------|--------------------------------|
| ADD         | Integer Add                                                       | Rd = Rs + Rt                   |
| ADDI        | Integer Add Immediate                                             | Rt = Rs + Immed                |
| ADDIU       | Unsigned Integer Add Immediate                                    | $Rt = Rs +_U Immed$            |
| ADDU        | Unsigned Integer Add                                              | $Rd = Rs +_U Rt$               |
| AND         | Logical AND                                                       | Rd = Rs & Rt                   |
| ANDI        | Logical AND Immediate                                             | $Rt = Rs \& (0_{16}    Immed)$ |
| ACLR        | Atomic Bit Clear                                                  | See MCU ASE Instructions       |
| ASET        | Atomic Bit Set                                                    | See MCU ASE Instructions       |
| В           | Unconditional Branch<br>(Assembler idiom for: BEQ r0, r0, offset) | PC += (int)offset              |

| Instruction | Description                                                 | Function                                                                               |
|-------------|-------------------------------------------------------------|----------------------------------------------------------------------------------------|
| BAL         | Branch and Link<br>(Assembler idiom for: BGEZAL r0, offset) | GPR[31] = PC + 8<br>PC += (int)offset                                                  |
| BC2F        | Branch On COP2 Condition False                              | if COP2Condition(cc) == 0<br>PC += (int)offset                                         |
| BC2FL       | Branch On COP2 Condition False Likely                       | if COP2Condition(cc) == 0<br>PC += (int)offset<br>else<br>Ignore Next Instruction      |
| BC2T        | Branch On COP2 Condition True                               | if COP2Condition(cc) == 1<br>PC += (int)offset                                         |
| BC2TL       | Branch On COP2 Condition True Likely                        | if COP2Condition(cc) == 1<br>PC += (int)offset<br>else<br>Ignore Next Instruction      |
| BEQ         | Branch On Equal                                             | if Rs == Rt<br>PC += (int)offset                                                       |
| BEQL        | Branch On Equal Likely                                      | if Rs == Rt<br>PC += (int)offset<br>else<br>Ignore Next Instruction                    |
| BGEZ        | Branch on Greater Than or Equal To Zero                     | if !Rs[31]<br>PC += (int)offset                                                        |
| BGEZAL      | Branch on Greater Than or Equal To Zero And<br>Link         | GPR[31] = PC + 8<br>if !Rs[31]<br>PC += (int)offset                                    |
| BGEZALL     | Branch on Greater Than or Equal To Zero And<br>Link Likely  | GPR[31] = PC + 8<br>if !Rs[31]<br>PC += (int)offset<br>else<br>Ignore Next Instruction |
| BGEZL       | Branch on Greater Than or Equal To Zero<br>Likely           | if !Rs[31]<br>PC += (int)offset<br>else<br>Ignore Next Instruction                     |
| BGTZ        | Branch on Greater Than Zero                                 | if !Rs[31] && Rs != 0<br>PC += (int)offset                                             |
| BGTZL       | Branch on Greater Than Zero Likely                          | if !Rs[31] && Rs != 0<br>PC += (int)offset<br>else<br>Ignore Next Instruction          |
| BLEZ        | Branch on Less Than or Equal to Zero                        | $if Rs[31] \parallel Rs == 0$<br>PC += (int)offset                                     |
| BLEZL       | Branch on Less Than or Equal to Zero Likely                 | if Rs[31]    Rs == 0<br>PC += (int)offset<br>else<br>Ignore Next Instruction           |

| Table 12.10 Instruction Set (Continued) | Table | 12.10 Instruction | n Set (Continued) |
|-----------------------------------------|-------|-------------------|-------------------|
|-----------------------------------------|-------|-------------------|-------------------|

| Instruction | Description                              | Function                                                                              |  |  |  |
|-------------|------------------------------------------|---------------------------------------------------------------------------------------|--|--|--|
| BLTZ        | Branch on Less Than Zero                 | if Rs[31]<br>PC += (int)offset                                                        |  |  |  |
| BLTZAL      | Branch on Less Than Zero And Link        | GPR[31] = PC + 8<br>if Rs[31]<br>PC += (int)offset                                    |  |  |  |
| BLTZALL     | Branch on Less Than Zero And Link Likely | GPR[31] = PC + 8<br>if Rs[31]<br>PC += (int)offset<br>else<br>Ignore Next Instruction |  |  |  |
| BLTZL       | Branch on Less Than Zero Likely          | if Rs[31]<br>PC += (int)offset<br>else<br>Ignore Next Instruction                     |  |  |  |
| BNE         | Branch on Not Equal                      | if Rs != Rt<br>PC += (int)offset                                                      |  |  |  |
| BNEL        | Branch on Not Equal Likely               | if Rs != Rt<br>PC += (int)offset<br>else<br>Ignore Next Instruction                   |  |  |  |
| BREAK       | Breakpoint                               | Break Exception                                                                       |  |  |  |
| CACHE       | Cache Operation                          | See Cache Description                                                                 |  |  |  |
| CFC2        | Move Control Word From Coprocessor 2     | Rt = CCR[2, n]                                                                        |  |  |  |
| CLO         | Count Leading Ones                       | Rd = NumLeadingOnes(Rs)                                                               |  |  |  |
| CLZ         | Count Leading Zeroes                     | Rd = NumLeadingZeroes(Rs)                                                             |  |  |  |
| COP0        | Coprocessor 0 Operation                  | See Coprocessor Description                                                           |  |  |  |
| COP2        | Coprocessor 2 Operation                  | See Coprocessor 2 Description                                                         |  |  |  |
| CTC2        | Move Control Word To Coprocessor 2       | CCR[2, n] = Rt                                                                        |  |  |  |
| DERET       | Return from Debug Exception              | PC = DEPC<br>Exit Debug Mode                                                          |  |  |  |
| DI          | Disable Interrupts                       | Rt=Status<br>Status <sub>IE</sub> =0                                                  |  |  |  |
| DIV         | Divide                                   | LO = (int)Rs / (int)Rt<br>HI = (int)Rs % (int)Rt                                      |  |  |  |
| DIVU        | Unsigned Divide                          | LO = (uns)Rs / (uns)Rt<br>HI = (uns)Rs % (uns)Rt                                      |  |  |  |
| EHB         | Execution Hazard Barrier                 | Stall until execution hazards are cleared                                             |  |  |  |
| EI          | Enable Interrupts                        | Rt=Status<br>Status <sub>IE</sub> =1                                                  |  |  |  |

| Instruction | Description                                | Function                                                                                     |
|-------------|--------------------------------------------|----------------------------------------------------------------------------------------------|
| ERET        | Return from Exception                      | if SR[2]<br>PC = ErrorEPC<br>else<br>PC = EPC<br>SR[1] = 0<br>SR[2] = 0<br>LL = 0            |
| EXT         | Extract Bit Field                          | Rt=ExtractField(Rs,msbd,lsb)                                                                 |
| INS         | Insert Bit Field                           | Rt=InsertField(Rt,Rs,msb,lsb)                                                                |
| IRET        | Return from Exception                      | See MCU ASE Instructions                                                                     |
| J           | Unconditional Jump                         | PC = PC[31:28]    offset<<2                                                                  |
| JAL         | Jump and Link                              | GPR[31] = PC + 8<br>PC = PC[31:28]    offset<<2                                              |
| JALR        | Jump and Link Register                     | Rd = PC + 8 $PC = Rs$                                                                        |
| JALR.HB     | Jump and Link Register with Hazard Barrier | Rd = PC + 8<br>PC = Rs<br>Stall until all execution and instruc-<br>tion hazards are cleared |
| JR          | Jump Register                              | PC = Rs                                                                                      |
| JR.HB       | Jump Register with Hazard Barrier          | PC = Rs<br>Stall until all execution and instruc-<br>tion hazards are cleared                |
| LB          | Load Byte                                  | Rt = (byte)Mem[Rs+offset]                                                                    |
| LBU         | Unsigned Load Byte                         | Rt = (ubyte))Mem[Rs+offset]                                                                  |
| LH          | Load Halfword                              | Rt = (half)Mem[Rs+offset]                                                                    |
| LHU         | Unsigned Load Halfword                     | Rt = (uhalf)Mem[Rs+offset]                                                                   |
| LL          | Load Linked Word                           | Rt = Mem[Rs+offset]<br>LL = 1<br>LLAdr = Rs + offset                                         |
| LUI         | Load Upper Immediate                       | Rt = immediate << 16                                                                         |
| LW          | Load Word                                  | Rt = Mem[Rs+offset]                                                                          |
| LWC2        | Load Word To Coprocessor 2                 | CPR[2, n, 0] = Mem[Rs+offset]                                                                |
| LWL         | Load Word Left                             | See LWL instruction.                                                                         |
| LWR         | Load Word Right                            | See LWR instruction.                                                                         |
| MADD        | Multiply-Add                               | HI, LO += (int)Rs * (int)Rt                                                                  |
| MFC0        | Move From Coprocessor 0                    | Rt = CPR[0, n, sel]                                                                          |
| MFC2        | Move From Coprocessor 2                    | $Rt = CPR[2, n, sel_{310}]$                                                                  |
| MFHC2       | Move From High Word Coprocessor2           | $Rt=CPR[2,n,sel]_{6332}$                                                                     |
| MFHI        | Move From HI                               | Rd = HI                                                                                      |
| MFLO        | Move From LO                               | Rd = LO                                                                                      |

Table 12.10 Instruction Set (Continued)

| Instruction | Description                                           | Function                                         |
|-------------|-------------------------------------------------------|--------------------------------------------------|
| MOVN        | Move Conditional on Not Zero                          | if $GPR[rt] \neq 0$ then<br>GPR[rd] = GPR[rs]    |
| MOVZ        | Move Conditional on Zero                              | if GPR[rt] = 0 then<br>GPR[rd] = GPR[rs]         |
| MSUB        | Multiply-Subtract                                     | HI, LO -= (int)Rs * (int)Rt                      |
| MSUBU       | Multiply-Subtract Unsigned                            | HI, LO -= (uns)Rs * (uns)Rt                      |
| MTC0        | Move To Coprocessor 0                                 | CPR[0, n, sel] = Rt                              |
| MTC2        | Move To Coprocessor 2                                 | $CPR[2, n, sel]_{310} = Rt$                      |
| MTHC2       | Move To High Word Coprocessor 2                       | $CPR[2, n, sel]_{6332} = Rt$                     |
| MTHI        | Move To HI                                            | HI = Rs                                          |
| MTLO        | Move To LO                                            | LO = Rs                                          |
| MUL         | Multiply with register write                          | HI   LO =Unpredictable<br>Rd = LO                |
| MULT        | Integer Multiply                                      | HI   LO = (int)Rs * (int)Rd                      |
| NOP         | No Operation<br>(Assembler idiom for: SLL r0, r0, r0) |                                                  |
| NOR         | Logical NOR                                           | $Rd = \sim (Rs \mid Rt)$                         |
| OR          | Logical OR                                            | Rd = Rs   Rt                                     |
| ORI         | Logical OR Immediate                                  | $Rt = Rs \mid Immed$                             |
| PREF        | Prefetch                                              | Load Specified Line into Cache                   |
| RDHWR       | Read HardWare Register                                | Rt=HWR[Rd]                                       |
| RDPGPR      | Read GPR from Previous Shadow Set                     | Rd=SGPR[SRSCtl <sub>PSS</sub> , Rt]              |
| ROTR        | Rotate Word Right                                     | $Rd = Rt_{sa-10} \parallel Rt_{31sa}$            |
| ROTRV       | Rotate Word Right Variable                            | $Rd = Rt_{Rs-10}    Rt_{31Rs}$                   |
| SB          | Store Byte                                            | (byte)Mem[Rs+offset] = Rt                        |
| SC          | Store Conditional Word                                | if LL =1<br>mem[Rxoffs] = Rt<br>Rt = LL          |
| SDBBP       | Software Debug Breakpoint                             | Trap to SW Debug Handler                         |
| SEB         | Sign Extend Byte                                      | Rd=SignExtend(Rt <sub>70</sub> )                 |
| SEH         | Sign Extend Half                                      | Rd=SignExtend(Rt <sub>150</sub> )                |
| SH          | Store Halfword                                        | (half)Mem[Rs+offset] = Rt                        |
| SLL         | Shift Left Logical                                    | $Rd = Rt \ll sa$                                 |
| SLLV        | Shift Left Logical Variable                           | Rd = Rt << Rs[4:0]                               |
| SLT         | Set on Less Than                                      | if (int)Rs < (int)Rt<br>Rd = 1<br>else<br>Rd = 0 |

| Table 12.10 Instructio | n Set (Continued) |
|------------------------|-------------------|
|------------------------|-------------------|

| Instruction | Description                                             | Function                                                    |  |  |  |  |  |
|-------------|---------------------------------------------------------|-------------------------------------------------------------|--|--|--|--|--|
| SLTI        | Set on Less Than Immediate                              | if (int)Rs < (int)Immed $Rt = 1$ else $Rt = 0$              |  |  |  |  |  |
| SLTIU       | Set on Less Than Immediate Unsigned                     | if (uns)Rs < (uns)Immed<br>Rt = 1<br>else<br>Rt = 0         |  |  |  |  |  |
| SLTU        | Set on Less Than Unsigned                               | if (uns)Rs < (uns)Immed<br>Rd = 1<br>else<br>Rd = 0         |  |  |  |  |  |
| SRA         | Shift Right Arithmetic                                  | Rd = (int)Rt >> sa                                          |  |  |  |  |  |
| SRAV        | Shift Right Arithmetic Variable                         | Rd = (int)Rt >> Rs[4:0]                                     |  |  |  |  |  |
| SRL         | Shift Right Logical                                     | Rd = (uns)Rt >> sa                                          |  |  |  |  |  |
| SRLV        | Shift Right Logical Variable                            | Rd = (uns)Rt >> Rs[4:0]                                     |  |  |  |  |  |
| SSNOP       | Superscalar Inhibit No Operation                        | Nop                                                         |  |  |  |  |  |
| SUB         | Integer Subtract                                        | Rt = (int)Rs - (int)Rd                                      |  |  |  |  |  |
| SUBU        | Unsigned Subtract                                       | Rt = (uns)Rs - (uns)Rd                                      |  |  |  |  |  |
| SW          | Store Word                                              | Mem[Rs+offset] = Rt                                         |  |  |  |  |  |
| SWC2        | Store Word From Coprocessor 2                           | Mem[Rs+offset] = CPR[2, n, 0]                               |  |  |  |  |  |
| SWL         | Store Word Left                                         | See SWL instruction description.                            |  |  |  |  |  |
| SWR         | Store Word Right                                        | See SWR instruction description.                            |  |  |  |  |  |
| SYNC        | Synchronize                                             | See SYNC instruction below.                                 |  |  |  |  |  |
| SYNCI       | Synchronize Caches to Make Instruction Writes Effective | Force D\$ writeback and I\$ invalidate on specified address |  |  |  |  |  |
| SYSCALL     | System Call                                             | SystemCallException                                         |  |  |  |  |  |
| TEQ         | Trap if Equal                                           | if Rs == Rt<br>TrapException                                |  |  |  |  |  |
| TEQI        | Trap if Equal Immediate                                 | if Rs == (int)Immed<br>TrapException                        |  |  |  |  |  |
| TGE         | Trap if Greater Than or Equal                           | if (int)Rs >= (int)Rt<br>TrapException                      |  |  |  |  |  |
| TGEI        | Trap if Greater Than or Equal Immediate                 | if (int)Rs >= (int)Immed<br>TrapException                   |  |  |  |  |  |
| TGEIU       | Trap if Greater Than or Equal Immediate<br>Unsigned     | if (uns)Rs >= (uns)Immed<br>TrapException                   |  |  |  |  |  |
| TGEU        | Trap if Greater Than or Equal Unsigned                  | if (uns)Rs >= (uns)Rt<br>TrapException                      |  |  |  |  |  |
| TLBP        | Probe TLB for Matching Entry                            | See TLBP instruction below.                                 |  |  |  |  |  |
| TLBR        | Read Index for TLB Entry                                | See TLBR instruction below.                                 |  |  |  |  |  |
| TLBWI       | Write Indexed TLB Entry                                 | See TLBWI instruction below.                                |  |  |  |  |  |

| Table 12.10 Instruction Set | (Continued) |
|-----------------------------|-------------|
|                             | (Continued) |

| Instruction | Description                          | Function                                 |
|-------------|--------------------------------------|------------------------------------------|
| TLBWR       | Write Random TLB Entry               | See TLBWR instruction below.             |
| TLT         | Trap if Less Than                    | if (int)Rs < (int)Rt<br>TrapException    |
| TLTI        | Trap if Less Than Immediate          | if (int)Rs < (int)Immed<br>TrapException |
| TLTIU       | Trap if Less Than Immediate Unsigned | if (uns)Rs < (uns)Immed<br>TrapException |
| TLTU        | Trap if Less Than Unsigned           | if (uns)Rs < (uns)Rt<br>TrapException    |
| TNE         | Trap if Not Equal                    | if Rs != Rt<br>TrapException             |
| TNEI        | Trap if Not Equal Immediate          | if Rs != (int)Immed<br>TrapException     |
| WAIT        | Wait for Interrupts                  | Stall until interrupt occurs             |
| WRPGPR      | Write to GPR in Previous Shadow Set  | SGPR[SRSCtl <sub>PSS</sub> ,Rd]=Rt       |
| WSBH        | Word Swap Bytes within Halfwords     | Rd=SwapBytesWithinHalfs(Rt)              |
| XOR         | Exclusive OR                         | $Rd = Rs \wedge Rt$                      |
| XORI        | Exclusive OR Immediate               | $Rt = Rs^{(uns)}Immed$                   |

| Table 12 | 2.10 Inst | ruction Set ( | (Continued) |
|----------|-----------|---------------|-------------|
|----------|-----------|---------------|-------------|

| 31 | 26               | 6 25 | 21   | 20 | 16              | 15 | 14  | 12 | 11 |        | 4 | 3 | 0 |  |
|----|------------------|------|------|----|-----------------|----|-----|----|----|--------|---|---|---|--|
|    | REGIMM<br>000001 |      | base | 1  | ATOMIC<br>00111 | 0  | Bit |    |    | offset |   |   |   |  |
|    | 6                |      | 5    |    | 5               | 1  | 3   |    |    | 12     |   |   |   |  |

Format: ACLR bit, offset(base)

Purpose: Atomically Clear Bit within Byte

Description: Disable interrupts; temp ← memory[GPR[base] + offset]; temp ← (temp and ~(1
<< bit)) ; memory[GPR[base] + offset] ← temp; Enable Interrupts</pre>

MIPS and MCU ASE

The contents of the 8-bit byte at the memory location specified by the effective address are fetched. The specified bit within the byte is cleared to zero. The modified byte is stored in memory at the location specified by the effective address. The 12-bit signed *offset* is added to the contents of GPR *base* to form the effective address. The read-modify-write sequence cannot be interrupted.

Transactions with locking semantics occur in some memory interconnects/busses. It is implementation-specific whether this instruction uses such locking transactions.

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if an ACLR instruction is executed in the delay slot of a branch or jump instruction.

#### **Operation:**

#### **Exceptions:**

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch

#### **Programming Notes:**

| 31                | 26 | 25 24   | 23 21 | 20 16 | 15 12        | 11 0   |
|-------------------|----|---------|-------|-------|--------------|--------|
| POOL32B<br>001000 |    | A0<br>0 | bit   | base  | ACLR<br>1011 | offset |
| 6                 |    | 2       | 3     | 5     | 4            | 12     |

Format: ACLR bit, offset(base)

microMIPS and MCU ASE

Purpose: Atomically Clear Bit within Byte

Description: Disable interrupts; temp ← memory[GPR[base] + offset]; temp ← (temp and ~(1
<< bit)) ; memory[GPR[base] + offset] ← temp; Enable Interrupts</pre>

The contents of the byte at the memo ry location specified by the effective address are fetched. The specified bit within the byte is cleared to zero. The modified byte is stored in memory at the location specified by the effective address. The 12-bit signed *offset* is added to the contents of GPR *base* to form the effective address. The read-modify-write sequence cannot be interrupted.

Transactions with locking semantics occur in some memory interconnects/busses. It is implementation-specific whether this instruction uses such locking transactions.

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if an ACLR instruction is executed in the delay slot of a branch or jump instruction.

#### **Operation:**

#### **Exceptions:**

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch

#### **Programming Notes:**

| 31 | 2               | 6 25 | 5 21 | 20              | 16 | 15 | 14 12 | 11     | 4 | 3 | 0 |
|----|-----------------|------|------|-----------------|----|----|-------|--------|---|---|---|
|    | EGIMM<br>000001 |      | base | ATOMIC<br>00111 |    | 1  | Bit   | offset |   |   |   |
|    | 6               |      | 5    | 5               |    | 1  | 3     | 12     |   |   |   |

Format: ASET bit, offset(base)

MIPS and MCU ASE

Purpose: Atomically Set Bit within Byte

Description: Disable interrupts;temp ← memory[GPR[base] + offset]; temp ← (temp or (1 << bit)) ; memory[GPR[base] + offset] ← temp; Enable Interrupts

The contents of the 8-bit byte at the memory location specified by the effective address are fetched. The specified bit within the byte is set to one. The modified byte is stored in memory at the location specified by the effective address. The 12-bit signed *offset* is added to the contents of GPR *base* to form the effective address. The read-modify-write sequence cannot be interrupted.

Transactions with locking semantics occur in some memory interconnects/busses. It is implementation-specific whether this instruction uses such locking transactions.

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if an ASET instruction is executed in the delay slot of a branch or jump instruction.

#### **Operation:**

#### **Exceptions:**

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch

#### **Programming Notes:**

| 31                | 26 2 | 25 2    | 24 | 23 21 | 20   | 16 15 |              | 12 | 11 0   |
|-------------------|------|---------|----|-------|------|-------|--------------|----|--------|
| POOL32B<br>001000 |      | A0<br>0 |    | bit   | base |       | ASET<br>0011 |    | offset |
| 6                 |      | 2       |    | 3     | 5    |       | 4            |    | 12     |

Format: ASET bit, offset(base)

microMIPS AND MCU ASE

Purpose: Atomically Set Bit within Byte

Description: Disable interrupts;temp ← memory[GPR[base] + offset]; temp ← (temp or (1 << bit)) ; memory[GPR[base] + offset] ← temp; Enable Interrupts

The contents of the byte at the memo ry location specified by the effective address are fetched. The specified bit within the byte is set to one. The modified byte is stored in memory at the location specified by the effective address. The 12-bit signed *offset* is added to the contents of GPR *base* to form the effective address. The read-modify-write sequence cannot be interrupted.

Transactions with locking semantics occur in some memory interconnects/busses. It is implementation-specific whether this instruction uses such locking transactions.

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if an ASET instruction is executed in the delay slot of a branch or jump instruction.

#### **Operation:**

#### **Exceptions:**

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch

#### **Programming Notes:**



Purpose: Perform Cache Operation

To perform the cache operation specified by op.

#### **Description:**

The 16-bit offset is sign-extended and added to the contents of the base register to form an effective address. The effective address is used in one of the following ways based on the operation to be performed and the type of cache, as described in the following table.

| Operation<br>Requires an | Type of<br>Cache | Usage of Effective Address                                                                                                                                                                                                                                                                                                                          |
|--------------------------|------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Address                  | Virtual          | The effective address is used to address the cache. An address translation may or may not be performed on the effective address (with the possibility that a TLB Refill or TLB Invalid exception might occur)                                                                                                                                       |
| Address                  | Physical         | The effective address is translated by the MMU to a physical address. The physical address is then used to address the cache                                                                                                                                                                                                                        |
| Index                    | N/A              | The effective address is translated by the MMU to a physical address. It is imple-<br>mentation dependent whether the effective address or the translated physical<br>address is used to index the cache. As such, a kseg0 address should always be used<br>for cache operations that require an index. See the Programming Notes section<br>below. |
|                          |                  | Assuming that the total cache size in bytes is CS, the associativity is A, and the number of bytes per tag is BPT, the following calculations give the fields of the address which specify the way and the index:                                                                                                                                   |
|                          |                  | OffsetBit ← Log2(BPT)                                                                                                                                                                                                                                                                                                                               |
|                          |                  | IndexBit ← Log2(CS / A)<br>WayBit ← IndexBit + Ceiling(Log2(A))                                                                                                                                                                                                                                                                                     |
|                          |                  | Way $\leftarrow$ Addr <sub>WayBit-1IndexBit</sub>                                                                                                                                                                                                                                                                                                   |
|                          |                  | $Index \leftarrow Addr_{IndexBit-1OffsetBit}$                                                                                                                                                                                                                                                                                                       |

#### Figure 12.1 Usage of Address Fields to Select Index and Way



A TLB Refill and TLB Invalid (both with cause code equal TLBL) exception can occur on any operation. For index operations (where the address is used to index the cache but need not match the cache tag) s oftware should use unmapped addresses to avoid TLB exceptions. This instruction never causes TLB Modified exceptions nor TLB Refill exceptions with a cause code of TLBS.

The effective address may be an arb itrarily-aligned by address. The CACHE instruction never causes an A ddress Error Exception due to an non-aligned address.

A Cache Error exception may occur as a by-product of some operations performed by this instruction. For example, if a Writeback operation detects a cache or bus error during the processing of the operation, that error is reported via a Cache Error exception. Similarly, a Bus Error Exception may occur if a bus operation invoked by this instruction is terminated in an error. However, cache error exceptions must not be triggered by an Index Load Tag or Index Store tag operation, as these operations are used for initialization and diagnostic purposes.

An Address Error Exception (when the *Cause* register's *ExcCode* value is AdEL) may occur if the effective address references a portion of the kern el address space that would no rmally result in such an exception. It is implementation-dependent whether such an exception does occur.

It is implementation-dependent whether a data watch is triggered by a cache instruction whose address matches the Watch register address match conditions. Data watch is not triggered by a cache instruction whose address matches the Watch register address match conditions. The preferred implementation is not to match on the cache instruction.

Bits [17:16] of the instruction specify the cache on which to perform the operation, as follows:

| Code | Name | Cache                           |
|------|------|---------------------------------|
| 0b00 | Ι    | Primary Instruction             |
| 0b01 | D    | Primary Data or Unified Primary |
| 0b10 | Т    | Tertiary                        |
| 0b11 | S    | Secondary                       |

Table 12.12 Encoding of Bits[17:16] of CACHE Instruction

Bits [20:18] of the instruction specify the operation to perform. To provide software with a consistent base of cache operations, certain encodings must be supported on all processors. The remaining encodings are recommended.

Table 12.13 Encoding of Bits [20:18] of the CACHE Instruction

| Code  | Caches | Name                                                | Effective<br>Address<br>Operand<br>Type | Operation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | Compliance<br>Implemented                    |
|-------|--------|-----------------------------------------------------|-----------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------|
| 06000 | Ι      | Index Invalidate                                    | Index                                   | Set the state of the cache block at the specified<br>index to invalid.<br>This required encoding may be used by software<br>to invalidate the entire instruction cache by step-<br>ping through all valid indices.                                                                                                                                                                                                                                                                                                                                                                                                                                      | Required                                     |
|       | D      | Index Writeback<br>Invalidate / Index<br>Invalidate | Index                                   | For a write-back cache: If the state of the cache<br>block at the specified index is valid and dirty,<br>write the block back to the memory address                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     | Required                                     |
|       | S, T   | Index Writeback<br>Invalidate / Index<br>Invalidate | Index                                   | <ul> <li>specified by the cache tag. After that operation has completed, set the state of the cache block to invalid. If the block is valid but not dirty, set the state of the block to invalid.</li> <li>For a write-through cache: Set the state of the cache block at the specified index to invalid.</li> <li>This required encoding may be used by software to invalidate the entire data cache by stepping through all valid indices. Note that Index Store Tag should be used to initialize the cache at power-up.</li> </ul>                                                                                                                   | Required if<br>S, T cache is<br>implemented. |
| 06001 | All    | Index Load Tag                                      | Index                                   | Read the tag for the cache block at the specified<br>index into the <i>TagLo</i> and <i>TagHi</i> Coprocessor 0<br>registers. If the <i>DataLo</i> and <i>DataHi</i> registers<br>are implemented, also read the data correspond-<br>ing to the byte index into the <i>DataLo</i> and<br><i>DataHi</i> registers. This operation must not cause<br>a Cache Error Exception.<br>The granularity and alignment of the data read<br>into the <i>DataLo</i> and <i>DataHi</i> registers are imple-<br>mentation-dependent, but are typically the result<br>of an aligned access to the cache, ignoring the<br>appropriate low-order bits of the byte index. | Recommended                                  |
| 0b010 | All    | Index Store Tag                                     | Index                                   | Write the tag for the cache block at the specified<br>index from the <i>TagLo</i> and <i>TagHi</i> Coprocessor<br>0 registers. This operation must not cause a<br>Cache Error Exception.<br>This required encoding may be used by software<br>to initialize the entire instruction or data caches<br>by stepping through all valid indices. Doing so<br>requires that the <i>TagLo</i> and <i>TagHi</i> registers<br>associated with the cache be initialized first.                                                                                                                                                                                    | Required                                     |

# Table 12.13 Encoding of Bits [20:18] of the CACHE Instruction (Continued)

| Code  | Caches | Name                                                                                                                                                                  | Effective<br>Address<br>Operand<br>Type | Operation                                                                                                                                                                                                                                                                                                                                                                                                                               | Compliance<br>Implemented                                                                       |
|-------|--------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------|
| 0b011 | All    | Index Store Data                                                                                                                                                      | Index                                   | Write the data for the cache block at the speci-<br>fied index from the <i>DataLo</i> Coprocessor 0 reg-<br>ister. This operation must not cause a Cache<br>Error Exception.<br>This required encoding may be used by software<br>to initialize the entire instruction or data caches<br>by stepping through all valid indices. Doing so<br>requires that the <i>DataLo</i> register associated<br>with the cache be initialized first. | Required                                                                                        |
| 0b100 | I, D   | Hit Invalidate                                                                                                                                                        | Address                                 | If the cache block contains the specified address, set the state of the cache block to invalid.                                                                                                                                                                                                                                                                                                                                         | Required (Instruction<br>Cache Encoding<br>Only), Recom-<br>mended otherwise.                   |
|       | S, T   | Hit Invalidate                                                                                                                                                        | Address                                 | This required encoding may be used by software<br>to invalidate a range of addresses from the<br>instruction cache by stepping through the<br>address range by the line size of the cache.                                                                                                                                                                                                                                              | Optional. If<br>Hit_Invalidate_D is<br>implemented, the S<br>and T variants are<br>recommended. |
| 0b101 | Ι      | Fill                                                                                                                                                                  | Address                                 | Fill the cache from the specified address.                                                                                                                                                                                                                                                                                                                                                                                              | Recommended                                                                                     |
|       | D      | Hit Writeback Inval-<br>idate / Hit Invalidate       Address       For a write-back cache: If the cache block con-<br>tains the specified address and it is valid and |                                         | tains the specified address and it is valid and dirty, write the contents back to memory. After                                                                                                                                                                                                                                                                                                                                         | Required                                                                                        |
|       | S, T   | Hit Writeback Inval-<br>idate / Hit Invalidate                                                                                                                        | Address                                 | that operation is completed, set the state of the<br>cache block to invalid. If the block is valid but<br>not dirty, set the state of the block to invalid.<br>For a write-through cache: If the cache block<br>contains the specified address, set the state of<br>the cache block to invalid.<br>This required encoding may be used by software                                                                                       | Required if<br>S, T cache is<br>implemented.                                                    |
|       |        |                                                                                                                                                                       |                                         | to invalidate a range of addresses from the data<br>cache by stepping through the address range by<br>the line size of the cache.                                                                                                                                                                                                                                                                                                       |                                                                                                 |
| 0b110 | D      | Hit Writeback                                                                                                                                                         | Address                                 | If the cache block contains the specified address<br>and it is valid and dirty, write the contents back                                                                                                                                                                                                                                                                                                                                 | Recommended                                                                                     |
|       | S, T   | Hit Writeback                                                                                                                                                         | Address                                 | to memory. After the operation is completed,<br>leave the state of the line valid, but clear the<br>dirty state. For a write-through cache, this oper-<br>ation may be treated as a nop.                                                                                                                                                                                                                                                | Optional, if<br>Hit_Writeback_D is<br>implemented, the S<br>and T variants are<br>recommended.  |

## Table 12.13 Encoding of Bits [20:18] of the CACHE Instruction (Continued)

| Code  | Caches | Name           | Effective<br>Address<br>Operand<br>Type | Operation                                                                                                                                                                                                                                                                                                                                                                     | Compliance<br>Implemented |
|-------|--------|----------------|-----------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------|
| 0b111 | I, D   | Fetch and Lock | Address                                 | If the cache does not contain the specified<br>address, fill it from memory, performing a<br>writeback if required, and set the state to valid<br>and locked. If the cache already contains the<br>specified address, set the state to locked. In<br>set-associative or fully-associative caches, the<br>way selected on a fill from memory is imple-<br>mentation-dependent. | Recommended               |
|       |        |                |                                         | The lock state may be cleared by executing an<br>Index Invalidate, Index Writeback Invalidate,<br>Hit Invalidate, or Hit Writeback Invalidate oper-<br>ation to the locked line, or via an Index Store<br>Tag operation to the line that clears the lock bit.<br>Note that clearing the lock state via Index Store                                                            |                           |
|       |        |                |                                         | Tag depends on the implementation-dependent<br>cache tag and cache line organization, and that<br>Index and Index Writeback Invalidate opera-<br>tions are dependent on cache line organization.<br>Only Hit and Hit Writeback Invalidate opera-<br>tions are generally portable across implementa-<br>tions.                                                                 |                           |
|       |        |                |                                         | It is implementation-dependent whether a<br>locked line is displaced as the result of an exter-<br>nal invalidate or intervention that hits on the<br>locked line. Software must not depend on the<br>locked line remaining in the cache if an external<br>invalidate or intervention would invalidate the<br>line if it were not locked.                                     |                           |
|       |        |                |                                         | It is implementation-dependent whether a Fetch<br>and Lock operation affects more than one line.<br>For example, more than one line around the ref-<br>erenced address may be fetched and locked. It is<br>recommended that only the single line contain-<br>ing the referenced address be affected.                                                                          |                           |

#### **Restrictions:**

The operation of this instruction is **UNDEFINED** for any operation/cache combination that is not implemented.

The operation of this instruction is **UNDEFINED** if the operation requires an address, and that address is uncacheable.

The operation of the instruction is **UNPREDICTABLE** if the cache line that contains the CACHE instruction is the target of an invalidate or a writeback invalidate.

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled.

Any use of this instruction that can cause cache-line writebacks should be followed by a subsequent SYNC instruc-

tion to avoid hazards where the writeback data is not yet visible at the next level of the memory hierarchy.

#### **Operation:**

```
if IsCoprocessorEnabled(0) then
    vAddr ← GPR[base] + sign_extend(offset)
    (pAddr, uncached) ← AddressTranslation(vAddr, DataReadReference)
    CacheOp(op, vAddr, pAddr)
else
    SignalException(CoprocessorUnusable, 0)
endif
```

#### **Exceptions:**

TLB Refill Exception.

**TLB Invalid Exception** 

Coprocessor Unusable Exception

Address Error Exception

Cache Error Exception

Bus Error Exception

#### **Programming Notes:**

For cache operations that require an index, it is implementation-dependent whether the effective address or the translated physical address is used as the cache index. Therefore, the index value should always be converted to a kseg0 address by ORing the index with 0x80000000 before being used by the cache instruction. For example, the following code sequence performs a data cache Index Store Tag operation using the index passed in GPR a0:

| li    | a1,  | 0x8000000        | /* | Base of | kseg0  | segment */                  |
|-------|------|------------------|----|---------|--------|-----------------------------|
| or    | a0,  | a0, a1           | /* | Convert | index  | to kseg0 address */         |
| cache | DCII | ndexStTag, 0(a1) | /* | Perform | the ir | ndex store tag operation */ |

| 31 | 26             | 25      | 6                           | 5 | 0              |
|----|----------------|---------|-----------------------------|---|----------------|
|    | COP0<br>010000 | C0<br>1 | 0<br>00 0000 0000 0000 0000 |   | IRET<br>111000 |
| L  | 6              | 1       | 20                          |   | 6              |

Format: IRET

**MIPS and MCU ASE** 

Purpose: Interrupt Return with Automated Interrupt Epilogue

Optionally jump directly to another interrupt vector without returning to original return address.

#### **Description:**

IRET is used to automate some of the operations that are required when returning from an interrupt handler. It can be used in place of the ERET instruction at the end of interrupt handlers. The IRET instruction is only appropriate when using Shadow Register Sets and EIC Interrupt mode. The automated operations of this instruction can be used to reverse the effects of the automated operations of the Auto-Prologue feature.

If the EIC mode of interrupts and the Interrupt Chaining feature are used, the IRET instruction can be used to shorten the time between returning from the current interrupt handler and handling the next requested interrupt.

If Automated Prologue feature is disabled, then IRET behaves exactly as ERET.

If either *Status<sub>ERL</sub>* or *Status<sub>BEV</sub>* bits are set, then IRET behaves exactly as ERET.

If Interrupt Chaining is disabled:

• Interrupts are disabled. COP0 *Status, SRSCtl*, and *EPC* registers are restored from the stack. GPR 29 is incremented for the stack frame size. IRET then clears execution and instruction hazards, conditionally restores *SRSCtl<sub>CSS</sub>* from *SRSCtl<sub>PSS</sub>*, and returns to the interrupted instruction pointed by the *EPC* register at the completion of interrupt processing.

If Interrupt Chaining is enabled:

- Interrupts are disabled. COP0 *Status* register is restored from the stack. The priority output of the External Interrupt Controller is compared with the *IPL* field of the *Status* register.
- If *Status*<sub>IPI</sub> has a higher priority than that of the External Interrupt Controller value:

COP0 *SRSCtl* and *EPC* registers are restored from the stack. GPR 29 is incremented for the stack frame size. IRET then clears execution and instruction hazards, conditionally restores  $SRSCtl_{CSS}$  from  $SRSCtl_{PSS}$ , and returns to the interrupted instruction pointed by the *EPC* register at the completion of interrupt processing.

• If *Status<sub>IPL</sub>* field has a lower priority than that of the External Interrupt Controller value:

The value of GPR 29 is first saved to a temporary register then GPR 29 is incremented for the stack frame size. The EIC is signalled that the next pending interrupt has been accepted. This signalling will update the  $Cause_{RIPL}$  and  $SRSCtl_{EICSS}$  fields from the EIC output values. The  $SRSCtl_{EICSS}$  field is copied to the  $SRSCtl_{CSS}$  field while the  $Cause_{RIPL}$  field is copied to the  $Status_{IPL}$  field. The saved temporary register is copied to the GPR 29 of the current SRS. The *KSU*, *ERL* and *EXL* fields of the *Status* register are optionally set to zero. No barrier for execution hazards nor instruction hazards is created. IRET finishes by jumping to the interrupt vector driven by the EIC.

IRET does not execute the next instruction (i.e., it has no delay slot).

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if an IRET is executed in the delay slot of a branch or jump instruction.

The operation of the processor is **UNDEFINED** if an IRET is executed when eith er Shadow Register Sets are not enabled or when EIC interrupt mode is not enabled.

An IRET placed between an LL and SC instruction will always cause the SC to fail.

The effective addresses used for the stack memory transactions must be naturally-aligned. If either of the two leastsignificant bits of the address is non-zero, an Address Error exception occurs.

IRET implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 state changes (for Release 2 implementations, refer to the SY NCI instruction for additional information on resolving instruction hazards created by writing the instruction stream). The effects of this barrier are seen starting with the instruction fetch and decode of the instruction at the PC to which the IRET returns.

In a Release 2 implementation, IRET does not restore  $SRSCtl_{CSS}$  from  $SRSCtl_{PSS}$  if  $Status_{BEV} = 1$ , or if  $Status_{ERL} = 1$  because any exception that sets  $Status_{ERL}$  to 1 (Reset, Soft Reset, NMI, or cache error) does not save  $SRSCtl_{CSS}$  in  $SRSCtl_{PSS}$ . If software sets  $Status_{ERL}$  to 1, it must be aware of the operation of an IRET that may be subsequently executed.

The stack memory transactions behave as individual L W operations with respect to exception reporting. BadVAddr would report the faulting addr ess for unaligned access and the faulting word address for un privileged access, TLB Refill and TLB Invalid exceptions. For TLB exceptions, the faulting word address would be reflected in the *Context*, and *EntryHi* registers. The *CacheError* register would reflect the faulting word address for Cache Errors.

#### **Operation:**

```
if (( IntCtl_{APE} == 0) | (Status_{ERL} == 1) | (Status_{BEV} == 1))
   Act as ERET // read Operation section of ERET description
else
   if (ISAMode)
       EPC \leftarrow PC_..1 || 1 // in case of memory exception
   else
       EPC \leftarrow PC // in case of memory exception
   endif
   temp \leftarrow 0x4 + GPR[29]
   ClearHazards()
   if ( (IntCtl_{ICE} == 0) \mid ((IntCtl_{ICE} == 1) &
    (tempStatus_{IPL} > EIC_{RIPL}))
       temp \leftarrow 0x8 + GPR[29]
       temp \leftarrow 0x0 + GPR[29]
       endif
   Status \leftarrow tempStatus
   if ( (IntCtl_{ICE} == 0) \mid ((IntCtl_{ICE} == 1) &
        (tempStatus<sub>IPL</sub> > EIC<sub>RIPL</sub>)) )
       GPR[29] ← GPR[29] + DecodedValue(IntCtl<sub>StkDec</sub>)
       SRSCtl ← tempSRSCtl
       \texttt{EPC} \leftarrow \texttt{tempEPC}
       temp \leftarrow EPC
       \text{Status}_{\text{EXL}} \leftarrow 0
       if (ArchitectureRevision \geq 2) and (SRSCtl_{HSS} > 0)
       and (Status_{\rm BEV} = 0) then
           SRSCtl_{CSS} \leftarrow SRSCtl_{PSS}
```

```
endif
          if IsMicroMIPSImplemented() then
               \texttt{PC} \leftarrow \texttt{temp}_{..1} || \texttt{0}
               ISAMode \leftarrow \text{temp}_0
          else
               PC \leftarrow temp
          endif
          LLbit \leftarrow 0
          Cause_{TC} \leftarrow 0
          ClearHazards()
     else
          Signal_EIC_for_Next_Interrupt()
          (wait for EIC outputs to update)
          Cause_{RIPL} \leftarrow EIC_{RIPL}
          \text{SRSCtl}_{\text{EICSS}} \leftarrow \text{EIC}_{\text{SS}}
          temp29 \leftarrow GPR[29]
          GPR[29] \leftarrow GPR[29] + DecodedValue(IntCtl_{StkDec})
          \texttt{Status}_{\texttt{IPL}} \leftarrow \texttt{Cause}_{\texttt{RIPL}}
          \texttt{SRSCtl}_{\texttt{CSS}} \leftarrow \texttt{SRSCtl}_{\texttt{EICSS}}
          NewShadowSet \leftarrow SRSCtl<sub>EICSS</sub>
          GPR[29] \leftarrow temp29
          if (IntCtl<sub>ClrEXL</sub> == 1)
               \texttt{Status}_{\texttt{EXL}} \ \leftarrow \ \texttt{0}
               Status_{KSU} \leftarrow 0
          endif
          \texttt{Cause}_{\texttt{IC}} \gets \texttt{1}
          ClearHazards()
          PC ← CalcIntrptAddress()
     endif
endif
function LoadStackWord(vaddr)
     if vAddr_{1,0} \neq 0^2 then
          SignalException (AddressError)
     endif
     (pAddr, CCA) \leftarrow AddressTranslation (vAddr, DATA, LOAD)
    memword 

LoadMemory (CCA, WORD, pAddr, vAddr, DATA)
    LoadStackWord \leftarrow memword
endfunction LoadStackWord
function CalcIntrptAddress()
    if Status_{BEV} = 1
          vectorBase \leftarrow 0 \times BFC0.0200
     else
          if (ArchitectureRevision \geq 2)
               vectorBase \leftarrow EBase..12 || 0<sup>11</sup>)
          else
               vectorBase \leftarrow 0 \times 8000.0000
          endif
     endif
     if (Cause_{IV} = 0)
          vectorOffset = 0x180
    else
          if (\text{Status}_{\text{BEV}} = 1) or (\text{IntCtl}_{\text{VS}} = 0)
               vectorOffset = 0x200
```

```
else
           if ( Config3_{VEIC} = 1 and EIC_Option=1)
               VectorNum = Cause<sub>RIPL</sub>
           elseif (Config_{VEIC} = 1 and EIC_Option=2)
              VectorNum = EIC_VectorNum
           elseif (Config3_{VEIC} = 0)
              VectorNum = VIntPriorityEncoder()
           endif
           if (Config3_{VEIC} = 1 \text{ and } EIC_Option=3)
               vectorOffset = EIC_VectorOffset
           else
               vectorOffset = 0x200 + (VectorNum x (IntCtl<sub>VS</sub> || <math>0^5))
           endif
       endif
       endif
   CalcIntrptAddress = vectorBase | vectorOffset
endfunction CalcIntrptAddress
```

#### **Exceptions:**

Coprocessor Unusable Exception, TLB Refill, TLB Invalid, Address Error, Watch, Cache Error, Bus Error Exceptions

| 31 | 26                | 25 6                    | 5 | 0                   |
|----|-------------------|-------------------------|---|---------------------|
|    | POOL32A<br>000000 | 000 0000 0011 0100 1101 |   | POOL32AXf<br>111100 |
|    | 6                 | 20                      |   | 6                   |

Format: IRET

microMIPS and MCU ASE

Purpose: Interrupt Return with Automated Interrupt Epilogue

Optionally jump directly to another interrupt vector without returning to original return address.

#### **Description:**

IRET automates some of the operations that are required when returning from an interrupt handler and can be used in place of the ERET instruction at the end of interrupt handlers. IRET is only appropriate when using Shadow Register Sets and the EIC Interrupt mode. The automated operations of this instruction can be used to reverse the effects of the automated operations of the Auto-Prologue feature.

If the EIC interrupt mode and the Interrupt Chaining feature are used, the IRET instruction can be used to shorten the time between returning from the current interrupt handler and handling the next requested interrupt.

If the Automated Prologue feature is disabled, then IRET behaves exactly like ERET.

If either the Status<sub>ERL</sub> or Status<sub>BEV</sub> bits are set, then IRET behaves exactly like ERET.

If Interrupt Chaining is disabled:

Interrupts are disabled. COP0 *Status, SRSCtl*, and *EPC* registers are restored from the stack. GPR 29 is incremented for the stack frame size. IRET then clears execution and instruction hazards, conditionally restores *SRSCtl<sub>CSS</sub>* from *SRSCtl<sub>PSS</sub>*, and returns at the completion of interrupt processing to the interrupted instruction pointed to by the *EPC* register. If Interrupt Chaining is enabled:

Interrupts are disabled. COP0 *Status* register is restored from the stack. The priority output of the External Interrupt Controller is compared with the IPL field of the *Status* register.

If *Status<sub>IPL</sub>* has a higher priority than the External Interrupt Controller value:

COP0 *SRSCtl* and *EPC* registers are restored from the stack. GPR 29 is incremented for the stack frame size. IRET then clears execution and instruction hazards, conditionally restores *SRSCtl<sub>CSS</sub>* from *SRSCtl<sub>PSS</sub>*, and returns to the interrupted instruction pointed to by the *EPC* register at the completion of interrupt processing.

If *Status<sub>IPL</sub>* has a lower priority than the External Interrupt Controller value:

The value of GPR 29 is first saved to a temporary register and then GPR 29 is incremented for the stack frame size. The EIC is signalled that the next pending interrupt has been accepted. This signalling will update the  $Cause_{RIPL}$  and  $SRSCtl_{EICSS}$  fields from the EIC output values. The  $SRSCtl_{EICSS}$  field is copied to the  $SRSCtl_{CSS}$  field, while the  $Cause_{RIPL}$  field is copied to the  $Status_{IPL}$  field. The saved temporary register is copied to the GPR 29 of the current SRS. The KSU and EXL fields of the *Status* register are optionally set to zero. No barrier for execution hazards or instruction hazards is created. IRET finishes by jumping to the interrupt vector driven by the EIC.

IRET does not execute the next instruction (i.e., it has no delay slot).

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if IRET is executed in the delay slot of a branch or jump instruction.

The operation of the processor is **UNDEFINED** if IRET is executed when either Shadow Register Sets are not enabled, or the EIC interrupt mode is not enabled.

An IRET placed between an LL and SC instruction will always cause the SC to fail.

The effective addresses used for stack transactions must be naturally-aligned. If either of the two least-significant bits of the address is non-zero, an Address Error exception occurs.

IRET implements a software barrier that resolves all execution and instruction hazards created by Coprocessor 0 state changes (for Release 2 implementations, refer to the SY NCI instruction for additional information on resolving instruction hazards created by writing the instruction stream). The effects of this barrier begin with the instruction fetch and decode of the instruction at the PC to which the IRET returns.

In a Release 2 implementation, IRET does not restore  $SRSCtl_{CSS}$  from  $SRSCtl_{PSS}$  if  $Status_{BEV} = 1$  or  $Status_{ERL} = 1$ , because any exception that sets  $Status_{ERL}$  to 1 (Reset, Soft Reset, NMI, or cache error) does not save  $SRSCtl_{CSS}$  in  $SRSCtl_{PSS}$ . If software sets  $Status_{ERL}$  to 1, it must be aware of the operation of an IRET that may be subsequently executed.

The stack transactions behave as individual L W operations with respect to exception reporting. BadVAddr would report the faulting address for an unaligned access, and the faulting word address for unprivileged access, TLB Refill, and TLB Invalid exceptions. For TLB exceptions, the faulting word address would be reflected in the *Context* and *EntryHi* registers. The *CacheError* register would reflect the faulting word address for Cache Errors.

#### **Operation:**

```
if (( IntCtl_{APE} == 0) | (Status<sub>ERL</sub> == 1) | (Status<sub>BEV</sub>== 1))
   Act as ERET // read Operation section of ERET description
else
    if (ISAMode)
       EPC \leftarrow PC_{11} \mid \mid 1 \mid / \mid in case of memory exception
   else
       EPC \leftarrow PC // in case of memory exception
    endif
    temp \leftarrow 0x4 + GPR[29]
   ClearHazards()
    if ( (IntCtl_{ICE} == 0) | ((IntCtl_{ICE} == 1) &
    (tempStatus<sub>IPL</sub> > EIC<sub>RIPL</sub>)))
       temp \leftarrow 0x8 + GPR[29]
       temp \leftarrow 0x0 + GPR[29]
        endif
   Status \leftarrow tempStatus
    if ( (IntCtl_{ICE} == 0) | ((IntCtl_{ICE} == 1) &
        (tempStatus<sub>IPL</sub> > EIC<sub>RIPL</sub>)))
       GPR[29] \leftarrow GPR[29] + DecodedValue(IntCtl_{stkDec})
       SRSCtl ← tempSRSCtl
       EPC ← tempEPC
       temp \leftarrow EPC
       \text{Status}_{\text{EXL}} \leftarrow 0
       if (ArchitectureRevision \geq 2) and (SRSCtl<sub>HSS</sub> > 0) and (Status<sub>BEV</sub> = 0) then
            SRSCtl<sub>CSS</sub> ← SRSCtl<sub>PSS</sub>
       endif
       if IsMicroMIPSImplemented() then
```

```
PC \leftarrow temp_{..1} \mid \mid 0
              \texttt{ISAMode} \leftarrow \texttt{temp}_0
         else
             PC \leftarrow temp
         endif
         LLbit \leftarrow 0
         Cause_{TC} \leftarrow 0
         ClearHazards()
    else
         Signal_EIC_for_Next_Interrupt()
         (wait for EIC outputs to update)
         Cause_{RIPL} \leftarrow EIC_{RIPL}
         \text{SRSCtl}_{\text{EICSS}} \leftarrow \text{EIC}_{\text{SS}}
         temp29 \leftarrow GPR[29]
         GPR[29] \leftarrow GPR[29] + DecodedValue(IntCtl_{StkDec})
         Status_{IPL} \leftarrow Cause_{RIPL}
         SRSCtl_{CSS} \leftarrow SRSCtl_{EICSS}
         NewShadowSet \leftarrow SRSCtl<sub>EICSS</sub>
         GPR[29] \leftarrow temp29
         if (IntCtl<sub>ClrEXL</sub> == 1)
             \text{Status}_{\text{EXL}} \leftarrow 0
              \text{Status}_{\text{KSU}} \leftarrow 0
         endif
         Cause_{TC} \leftarrow 1
         ClearHazards()
         PC ← CalcIntrptAddress()
    endif
endif
function LoadStackWord(vaddr)
    if vAddr_{1,0} \neq 0^2 then
         SignalException (AddressError)
    endif
    (pAddr, CCA) \leftarrow AddressTranslation (vAddr, DATA, LOAD)
    \texttt{LoadStackWord} \leftarrow \texttt{memword}
endfunction LoadStackWord
function CalcIntrptAddress()
    if StatusBEV = 1
         vectorBase \leftarrow 0xBFC0.0200
    else
         if ( ArchitectureRevision \geq 2)
             vectorBase \leftarrow EBase..12 || 0<sup>11</sup>)
         else
              vectorBase \leftarrow 0x8000.0000
         endif
    endif
    if (Cause<sub>IV</sub> = 0)
         vectorOffset = 0x180
    else
         if (\text{Status}_{\text{BEV}} = 1) or (\text{IntCtl}_{\text{VS}} = 0)
              vectorOffset = 0x200
         else
              if ( Config3_{VEIC} = 1 and EIC_Option=1)
```

```
\label{eq:second} \begin{array}{c} \mbox{VectorNum} = \mbox{Cause}_{RIPL} \\ \mbox{elseif} & (\mbox{Config3}_{VEIC} = 1 \mbox{ and } EIC\_Option=2) \\ & \mbox{VectorNum} = EIC\_VectorNum \\ \mbox{elseif} & (\mbox{Config3}_{VEIC} = 0 \ ) \\ & \mbox{VectorNum} = \mbox{VintPriorityEncoder}() \\ \mbox{endif} \\ & \mbox{if} & (\mbox{Config3}_{VEIC} = 1 \mbox{ and } EIC\_Option=3) \\ & \mbox{vectorOffset} = EIC\_VectorOffset \\ & \mbox{else} \\ & \mbox{vectorOffset} = 0x200 \ + \ (\mbox{VectorNum} \ x \ (\mbox{IntCtl}_{VS} \parallel 0^5)) \\ & \mbox{endif} \\ & \mbox{calcIntrptAddress} = \mbox{vectorOffset} \ | \mbox{vectorOffset} \\ & \mbox{endif} \\
```

#### **Exceptions:**

Coprocessor Unusable Exception, TLB Refill, TLB Invalid, Address Error, Watch, Cache Error, Bus Error Exceptions

| 31           | 26 | 25 21 | 20 16 | 15 0   |
|--------------|----|-------|-------|--------|
| LL<br>110000 |    | base  | rt    | offset |
| 6            |    | 5     | 5     | 16     |

Format: LL rt, offset(base)

Purpose: Load Linked Word

To load a word from memory for an atomic read-modify-write

**Description:** GPR[rt] ← memory[GPR[base] + offset]

The LL and SC instructions provide the primitives to implement atomic read-modify-write (RMW) operations for synchronizable memory locations.

MIPS32

The contents of the 32-bit word at the memory location specified by the aligned effective address are fetched and written into GPR *rt*. The 16-bit signed *offset* is added to the contents of GPR *base* to form an effective address.

This begins a RMW sequence on the current processor. There can be only one active RMW sequence per processor. When an LL is executed it starts an active RMW sequence replacing any other sequence that was active. The RMW sequence is completed by a subsequent SC instruction that either completes the RMW sequence atomically and succeeds, or does not and fails.

Executing LL on one processor does not cause an action that, by itself, causes an SC for the same block to fail on another processor.

An execution of LL does not have to be followed by execution of SC; a p rogram is free t o abandon the RMW sequence without attempting a write.

#### **Restrictions:**

The addressed location must be synchronizable by all processors and I/O devices sharing the location; if it is not, the result in **UNPREDICTABLE**. Which storage is synchronizable is a function of both CPU and system implementations. See the documentation of the SC instruction for the formal definition.

The effective address must be naturally-aligned. If either of the 2 least-s ignificant bits of the effective address is non-zero, an Address Error exception occurs.

#### **Operation:**

```
vAddr ← sign_extend(offset) + GPR[base]
if vAddr<sub>1..0</sub> ≠ 0<sup>2</sup> then
    SignalException(AddressError)
endif
(pAddr, CCA) ← AddressTranslation (vAddr, DATA, LOAD)
memword ← LoadMemory (CCA, WORD, pAddr, vAddr, DATA)
GPR[rt] ← memword
LLbit ← 1
```

#### **Exceptions:**

TLB Refill, TLB Invalid, Address Error, Reserved Instruction, Watch

#### **Programming Notes:**

There is no Load Linked Word Unsigned operation corresponding to Load Word Unsigned.

| 31             | 26 25 | 21 20 | 16 15 | 0      |
|----------------|-------|-------|-------|--------|
| PREF<br>110011 | base  |       | hint  | offset |
| 6              | 5     | ·     | 5     | 16     |

Format: PREF hint, offset(base)

#### Purpose: Prefetch

To move data between memory and cache.

#### **Description:** prefetch\_memory(GPR[base] + offset)

PREF adds the 16-bit signed *offset* to the contents of GPR *base* to form an effective byte address. The *hint* field supplies information about the way that the data is expected to be used.

PREF does not cause addressing-related exceptions, including TLB exceptions. If the address specified would cause an addressing exception, the exception condition is ignored and no data movement occurs. However even if no data is moved, some action that is not architecturally visible, such as writeback of a dirty cache line, can take place.

It is implementation-dependent whether a Bus Error or Cache Error exception is reported if such an error is detected as a byproduct of the action taken by the PREF instruction.

PREF neither generates a memory operation nor modifies the state of a cache line for a location with an *uncached* memory access type, whether this type is specified by the address segment (e.g., kseg1), the programmed coherency attribute of a segment (e.g., the use of the *K0*, *KU*, or *K23* fields in the *Config* register), or the per-page coherency attribute provided by the TLB.

If PREF results in a memory operation, the memory access type and coherency attribute used for the operation are determined by the memory access type and coherency attribute of the effective address, just as it would be if the memory operation had been caused by a load or store to the effective address.

Any of the following conditions causes the core to treat a PREF instruction as a NOP.

- A reserved *hint* value is used
- The address has a translation error
- The address maps to an uncacheable page

In all other cases, except when *hint* equals 25, execution of the PREF instruction initiates an external bus read transaction. PREF is a n on-blocking operation and does not cause the pipeline to stall while waiting for the data to be returned. The values of the *hint* field for PREF are shown below

| Value | Name     | Data Use and Desired Prefetch Action                                                                |
|-------|----------|-----------------------------------------------------------------------------------------------------|
| 0     | load     | Use: Prefetched data is expected to be read (not modified).<br>Action: Fetch data as if for a load. |
| 1     | store    | Use: Prefetched data is expected to be stored or modified.<br>Action: Fetch data as if for a store. |
| 2-3   | Reserved | Reserved - treated as a NOP.                                                                        |

MIPS32

| 4     | load_streamed                                   | Use: Prefetched data is expected to be read (not modified) but not reused<br>extensively; it "streams" through cache.<br>Action: Fetch data as if for a load and place it in the cache so that it does not<br>displace data prefetched as "retained."             |
|-------|-------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 5     | store_streamed                                  | Use: Prefetched data is expected to be stored or modified but not reused exten-<br>sively; it "streams" through cache.<br>Action: Fetch data as if for a store and place it in the cache so that it does not<br>displace data prefetched as "retained."           |
| 6     | load_retained                                   | Use: Prefetched data is expected to be read (not modified) and reused exten-<br>sively; it should be "retained" in the cache.<br>Action: Fetch data as if for a load and place it in the cache so that it is not dis-<br>placed by data prefetched as "streamed." |
| 7     | store_retained                                  | Use: Prefetched data is expected to be stored or modified and reused exten-<br>sively; it should be "retained" in the cache.<br>Action: Fetch data as if for a store and place it in the cache so that it is not dis-<br>placed by data prefetched as "streamed." |
| 8-24  | Reserved                                        | Reserved - treated as a NOP.                                                                                                                                                                                                                                      |
| 25    | writeback_invalidate (also<br>known as "nudge") | Use: Data is no longer expected to be used.<br>Action: Schedule a writeback of any dirty data. The cache line is marked as<br>invalid upon completion of the writeback. If cache line is clean or locked, no<br>action is taken.                                  |
| 26-29 | Reserved                                        | Reserved - treated as a NOP.                                                                                                                                                                                                                                      |
| 30    | PrepareForStore                                 | Use: Prepare the cache for writing an entire line, without the overhead involved in filling the line from memory.<br>Reserved - treated as a NOP.                                                                                                                 |
| 31    | Reserved                                        | Reserved - treated as a NOP.                                                                                                                                                                                                                                      |

#### **Restrictions:**

None

#### **Operation:**

vAddr ← GPR[base] + sign\_extend(offset)
(pAddr, CCA) ← AddressTranslation(vAddr, DATA, LOAD)
Prefetch(CCA, pAddr, vAddr, DATA, hint)

#### **Exceptions:**

Bus Error, Cache Error

Prefetch does not take any TLB-related or address-related exceptions under any circumstances.

#### **Programming Notes:**

Prefetch cannot move data to or from a mapped location unless the translation for that location is present in the TLB. Locations in memory pages that have not been accessed recently may not have translations in the TLB, so prefetch may not be effective for such locations.

Prefetch does not cause addressing exceptions. A prefetch may be used using an address pointer before the validity of

MIPS32® microAptiv™ UP Processor Core Family Software User's Manual, Revision 01.02

the pointer is determined without worrying about an addressing exception.

It is implementation-dependent whether a Bus Error or Cache Error exception is reported if such an error is detected as a byproduct of the action taken by the PREF instruct ion. Typically, this only occurs in systems which have high-reliability requirements.

Prefetch operations have no effect on cache lines that were previously locked with the CACHE instruction.



Format. SC II, OIISEL(Dase)

Purpose: Store Conditional Word

To store a word to memory to complete an atomic read-modify-write

**Description:** if atomic\_update then memory[GPR[base] + offset]  $\leftarrow$  GPR[rt], GPR[rt]  $\leftarrow$  1 else GPR[rt]  $\leftarrow$  0

The LL and SC instructions provide primitives to implement atomic read-modify-write (RMW) operations for synchronizable memory locations.

The32-bit word in GPR *rt* is conditionally stored in memory at the location specified by the aligned effective address. The 16-bit signed *offset* is added to the contents of GPR *base* to form an effective address.

The SC completes the RMW sequence begun by the preceding LL instruction executed on the processor. To complete the RMW sequence atomically, the following occur:

- The32-bit word of GPR rt is stored into memory at the location specified by the aligned effective address.
- A 1, indicating success, is written into GPR rt.

Otherwise, memory is not modified and a 0, indicating failure, is written into GPR rt.

If the following event occurs between the execution of LL and SC, the SC fails:

• An ERET instruction is executed.

If either of the following events occurs between the execution of LL and SC, the SC may succeed or it may fail; the success or failure is not predictable. Portable programs should not cause one of these events.

- A memory access instruction (load, store, or prefetch) is executed on the processor executing the LL/SC.
- The instructions executed starting with the LL and ending with the SC do not lie in a 2048-byte contiguous region of virtual memory. (The region does not have to be aligned, other than the alignment required for instruction words.)

The following conditions must be true or the result of the SC is UNPREDICTABLE:

- Execution of SC must have been preceded by execution of an LL instruction.
- An RMW sequence executed without intervening events that would cause the SC to fail must use the same address in the LL and SC. The address is the same if the virtual address, physical address, and cache-coherence algorithm are identical.

#### **Restrictions:**

The effective address must be naturally-aligned. If either of the 2 least-significant bits of the address is non-zero, an Address Error exception occurs.

#### **Operation:**

```
vAddr ← sign_extend(offset) + GPR[base]
if vAddr<sub>1..0</sub> ≠ 0<sup>2</sup> then
   SignalException(AddressError)
endif
(pAddr, CCA) ← AddressTranslation (vAddr, DATA, STORE)
dataword ← GPR[rt]
if LLbit then
   StoreMemory (CCA, WORD, dataword, pAddr, vAddr, DATA)
endif
GPR[rt] ← 0 || LLbit
```

#### **Exceptions:**

TLB Refill, TLB Invalid, TLB Modified, Address Error, Watch

#### **Programming Notes:**

LL and SC are used to atomically update memory locations, as shown below.

```
L1:

LL T1, (T0) # load counter

ADDI T2, T1, 1 # increment

SC T2, (T0) # try to store, checking for atomicity

BEQ T2, 0, L1 # if not atomic (0), try again

NOP # branch-delay slot
```

Exceptions between the LL and SC caus e SC to fail, so persistent exceptions must be avoided. Some examples of these are arithmetic operations that trap, system calls, and floating point operations that trap or require software emulation assistance.

LL and SC function on a single processor for *cached noncoherent* memory so that parallel programs can be run on uniprocessor systems that do not support *cached coherent* memory access types.

| 31                | 26 | 25 21 2 | 20 16                   | 15 11 | 10 6  | 5 0            |   |
|-------------------|----|---------|-------------------------|-------|-------|----------------|---|
| SPECIAL<br>000000 |    | 00      | 0<br>) 0000 0000 0000 0 |       | stype | SYNC<br>001111 |   |
| 6                 |    |         | 15                      |       | 5     | 6              | - |

MIPS32

Format: SYNC (stype = 0 implied)

**Purpose:** Synchronize Shared Memory

To order loads and stores.

#### **Description:**

Simple Description:

- SYNC affects only *uncached* and *cached coherent* loads and stores. The loads and stores that occur before the SYNC must be completed before the loads and stores after the SYNC are allowed to start.
- Loads are completed when the destination register is written. Stores are completed when the stored value is visible to every other processor in the system.
- SYNC is required, potentially in conjunction with SSNOP (in Release 1 of the Architecture) or EHB (in Release 2 of the Architecture), to guarantee that memory reference results are visible across operating mode changes. For example, a SYNC is required on entry to and exit from Debug Mode to guarantee that memory affects are handled correctly.

#### Detailed Description:

- SYNC does not guarantee the order in which instruction fetches are performed. The *stype* values 1-31 are reserved for future extensions to the architecture. A value of zero will always be defined such that it performs all defined synchronization operations. Non-zero values may be defined to remove some synchronization operations. As such, software should never use a non-zero value of the *stype* field, as this may inadvertently cause future failures if non-zero values remove synchronization operations.
- The SYNC instruction stalls until all loads, stores, refills are completed and all write buffers are empty.

#### **Restrictions:**

The effect of SYNC on the global order of loads and stores for memory access types other than *uncached* and *cached coherent* is **UNPREDICTABLE**.

#### **Operation:**

SyncOperation(stype)

#### **Exceptions:**

None

| 31  | 26        | 25 2 | 4 6                     | 5      | 0      |
|-----|-----------|------|-------------------------|--------|--------|
|     | COP0      | CO   | 0                       | TLBR   |        |
| (   | 010000    | 1    | 000 0000 0000 0000 0000 | 000001 |        |
|     | 6         | 1    | 19                      | 6      |        |
| For | mat: TLBR |      |                         | N      | AIPS32 |

Purpose: Read Entry from TLB

#### **Description:**

The EntryHi, EntryLo0, EntryLo1, and PageMask registers are loaded with the contents of the TLB entry pointed to by the Index register. Note that the value written to the EntryHi, EntryLo0, and EntryLo1 registers may be different from that originally written to the TLB via these registers in that:

• The value returned in the G bit in both the *EntryLo0* and *EntryLo1* registers comes from the single G bit in the TLB entry. Recall that this bit was set from the logical AND of the two G bits in *EntryLo0* and *EntryLo1* when the TLB was written.

#### **Restrictions:**

The operation is **UNDEFINED** if the contents of the Index register ar e greater than or equal to the number of TLB entries in the processor.

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled.

#### **Operation:**

```
\begin{split} \mathbf{i} \leftarrow \mathrm{Index} \\ \mathrm{if} \ \mathbf{i} > (\mathrm{TLBEntries} - 1) \ \mathrm{then} \\ \mathbf{UNDEFINED} \\ \mathrm{endif} \\ \mathrm{PageMask}_{\mathrm{Mask}} \leftarrow \mathrm{TLB[i]}_{\mathrm{Mask}} \\ \mathrm{EntryHi} \leftarrow \\ & (\mathrm{TLB[i]}_{\mathrm{VPN2}} \ \mathrm{and} \ \mathrm{not} \ \mathrm{TLB[i]}_{\mathrm{Mask}}) \ || \\ & 0^{5} \ || \ \mathrm{TLB[i]}_{\mathrm{ASID}} \\ \mathrm{EntryLo1} \leftarrow 0 \ || \\ & (\mathrm{TLB[i]}_{\mathrm{PFN1}} \ \mathrm{and} \ \mathrm{not} \ \mathrm{TLB[i]}_{\mathrm{Mask}}) \ || \\ & \mathrm{TLB[i]}_{\mathrm{C1}} \ || \ \mathrm{TLB[i]}_{\mathrm{D1}} \ || \ \mathrm{TLB[i]}_{\mathrm{V1}} \ || \ \mathrm{TLB[i]}_{\mathrm{G}} \\ \mathrm{EntryLo0} \leftarrow 0 \ || \\ & (\mathrm{TLB[i]}_{\mathrm{PFN0}} \ \mathrm{and} \ \mathrm{not} \ \mathrm{TLB[i]}_{\mathrm{Mask}}) \ || \\ & \mathrm{TLB[i]}_{\mathrm{C0}} \ || \ \mathrm{TLB[i]}_{\mathrm{Mask}} \ || \\ & \mathrm{TLB[i]}_{\mathrm{C0}} \ || \ \mathrm{TLB[i]}_{\mathrm{D0}} \ || \ \mathrm{TLB[i]}_{\mathrm{V0}} \ || \ \mathrm{TLB[i]}_{\mathrm{G}} \end{split}
```

#### **Exceptions:**

Coprocessor Unusable

| 31  | 26   | 25 | 24 6                    | 5      | 0 |
|-----|------|----|-------------------------|--------|---|
| CO  | OP0  | CO | 0                       | TLBWI  |   |
| 010 | 0000 | 1  | 000 0000 0000 0000 0000 | 000010 |   |
|     | 6    | 1  | 19                      | 6      |   |
|     |      |    |                         |        |   |

Format: TLBWI

MIPS32

Purpose: Write TLB Entry Indexed by Index Register

#### **Description:**

The TLB entry pointed to by the Index register is written from the contents of the *EntryHi*, *EntryLo0*, *EntryLo1*, and *PageMask* registers. The information written to the TLB entry may be different from that in the *EntryHi*, *EntryLo0*, and *EntryLo1* registers, in that:

• The single G bit in the TLB entry is set from the logical AND of the G bits in the *EntryLo0* and *EntryLo1* registers.

#### **Restrictions:**

The operation is **UNDEFINED** if the contents of the Index register ar e greater than or equal to the number of TLB entries in the processor.

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled.

#### **Operation:**

```
\begin{split} & i \leftarrow \text{Index} \\ & \text{TLB}[i]_{\text{Mask}} \leftarrow \text{PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{VPN2}} \leftarrow \text{EntryHi}_{\text{VPN2}} \\ & \text{TLB}[i]_{\text{ASID}} \leftarrow \text{EntryHi}_{\text{ASID}} \\ & \text{TLB}[i]_{\text{G}} \leftarrow \text{EntryLol}_{\text{G}} \text{ and EntryLo0}_{\text{G}} \\ & \text{TLB}[i]_{\text{PFN1}} \leftarrow \text{EntryLo1}_{\text{PFN}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{C1}} \leftarrow \text{EntryLo1}_{\text{C}} \\ & \text{TLB}[i]_{\text{D1}} \leftarrow \text{EntryLo1}_{\text{V}} \\ & \text{TLB}[i]_{\text{V1}} \leftarrow \text{EntryLo1}_{\text{V}} \\ & \text{TLB}[i]_{\text{PFN0}} \leftarrow \text{EntryLo0}_{\text{PFN}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{C0}} \leftarrow \text{EntryLo0}_{\text{PFN}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{D0}} \leftarrow \text{EntryLo0}_{\text{D}} \\ & \text{TLB}[i]_{\text{D0}} \leftarrow \text{EntryLo0}_{\text{D}} \\ & \text{TLB}[i]_{\text{V0}} \leftarrow \text{EntryLo0}_{\text{V}} \end{split}
```

#### **Exceptions:**

Coprocessor Unusable

| 31   | 26         | 25 | 24 6                    | 5      | 0      |
|------|------------|----|-------------------------|--------|--------|
| С    | OP0        | CO | 0                       | TLBWR  |        |
| 01   | 0000       | 1  | 000 0000 0000 0000 0000 | 000110 |        |
| L    | 6          | 1  | 19                      | 6      |        |
| Form | nat: TLBWF | ર  |                         |        | MIPS32 |

Purpose: Write TLB Entry Indexed by Random Register

#### **Description:**

The TLB entry pointed to by the *Random* register is written from the contents of the *EntryHi*, *EntryLo0*, *EntryLo1*, and *PageMask* registers. The information written to the TLB entry may be different from that in the *EntryHi*, *EntryLo0*, and *EntryLo1* registers, in that:

• The single G bit in the TLB entry is set from the logical AND of the G bits in the *EntryLo0* and *EntryLo1* registers.

#### **Restrictions:**

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled.

#### **Operation:**

```
\begin{split} \label{eq:second} \begin{split} & i \leftarrow \text{Random} \\ & \text{TLB}[i]_{\text{Mask}} \leftarrow \text{PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{VPN2}} \leftarrow \text{EntryHi}_{\text{VPN2}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{ASID}} \leftarrow \text{EntryHi}_{\text{ASID}} \\ & \text{TLB}[i]_{\text{G}} \leftarrow \text{EntryLol}_{\text{G}} \text{ and EntryLoO}_{\text{G}} \\ & \text{TLB}[i]_{\text{PFN1}} \leftarrow \text{EntryLol}_{\text{PFN}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{C1}} \leftarrow \text{EntryLol}_{\text{C}} \\ & \text{TLB}[i]_{\text{D1}} \leftarrow \text{EntryLol}_{\text{D}} \\ & \text{TLB}[i]_{\text{V1}} \leftarrow \text{EntryLol}_{\text{V}} \\ & \text{TLB}[i]_{\text{PFN0}} \leftarrow \text{EntryLoO}_{\text{PFN}} \text{ and not PageMask}_{\text{Mask}} \\ & \text{TLB}[i]_{\text{C0}} \leftarrow \text{EntryLoO}_{\text{C}} \\ & \text{TLB}[i]_{\text{D0}} \leftarrow \text{EntryLoO}_{\text{D}} \\ & \text{TLB}[i]_{\text{D0}} \leftarrow \text{EntryLoO}_{\text{D}} \\ & \text{TLB}[i]_{\text{V0}} \leftarrow \text{EntryLoO}_{\text{V}} \end{split}
```

#### **Exceptions:**

Coprocessor Unusable

| 31      | 26   | 25 | 24 |                               | 6 | 5 | 0        |
|---------|------|----|----|-------------------------------|---|---|----------|
| COP0    |      | CO | 1  | Implementation-Dependent Code |   | 1 | WAIT<br> |
| 6       |      | 1  |    | 19                            |   |   | 6        |
| Format: | WAIT |    |    |                               |   |   | MIPS32   |

Purpose: Enter Standby Mode

Wait for Event

#### **Description:**

The WAIT instruction forces the core into low power mode. The pipeline is stalled and when all external requests are completed, the processor's main clock is stopped. The processor will restart when reset (*SI\_Reset* or *SI\_ColdReset*) is signaled, or a non-masked interrupt is taken (*SI\_NMI*, *SI\_Int*, or *EJ\_DINT*). Note that the microAptiv UPcore does not use the code field in this instruction.

If the pipeline restarts as the result of an enabled interrupt, that interrupt is taken between the WAIT instruction and the following instruction (*EPC* for the interrupt points at the instruction following the WAIT instruction).

#### **Restrictions:**

The operation of the processor is **UNDEFINED** if a WAIT instruction is placed in the delay slot of a branch or a jump.

If access to Coprocessor 0 is not enabled, a Coprocessor Unusable Exception is signaled.

#### **Operation:**

```
I: Enter lower power mode
I+1:/* Potential interrupt taken here */
```

#### **Exceptions:**

Coprocessor Unusable Exception

### Chapter 13

# microMIPS<sup>™</sup> Instruction Set Architecture

The microMIPS<sup>™</sup> architecture minimizes the code footprint of applications, thus reducing the cost of memory, which is particularly high for embedded memory. At the same time, the high performance of MIPS cores is main-tained. Using this technology, the customer can generate best results without spending time to profile its application. The smaller code footprint typically leads to reduced power consumption per executed task because of the smaller number of memory accesses.

microMIPS is a replacement for the existing MIPS16e ASE. It is also an alternative to the MIPS instruction encoding and can be implemented in parallel or stand-alone.

Overview of changes from the existing MIPS ISA:

- 16-bit and 32-bit opcodes; for MIPS64, also includes 48-bit opcodes
- Optimized opcode/operand field definitions based on statistics
- Branch and jump delay slots are retained for maximum compatibility and lowest risk
- Removal of branch likely instructions, emulation by assembler
- Fine-tuned register allocation algorithm in compilers for smallest code size

# 13.1 Overview

### 13.1.1 MIPSr3<sup>™</sup> Architecture

MIPSr3 is a family of architectures which includes Release 3.0 of the MIPS Architecture and the first release of the microMIPS architecture. Enhancements included in the MIPSr3 Architecture are:

- MIPS Release 3 ISA and microMIPS ISA.
- The MIPS16e ASE is phased out and is replaced by microMIPS. Therefore these two ASEs never co-exist within the same processor core.
- Branch likely instructions are phased out in microMIPS and are emulated by the assembler. They remain available in the MIPS encoding.

Unless otherwise described in this document, all other aspects of the MIPSr3 architecture are identical to MIPS Release 2.

# 13.1.2 Default ISA Mode

The instruction sets available in an implementation are reported in the *Config3*<sub>ISA</sub> register field (bits 15:14). *Config1*<sub>CA</sub> (bit 2) is not used for microMIPS.

For implementations that support both microMIPS and MIPS, the selected ISA mode following reset is determined by the setting of the *Config3*<sub>ISA</sub> register field, which is a read-only field set by a hardware signal external to the processor core.

For implementations that support both microMIPS and MIPS, the selected ISA mode of an exception handler is determined by the setting of the  $Config3_{ISAOnExc}$  register field (bit 16). The  $Config3_{ISAOnExc}$  register field is writeable by software and has a reset value that is set by a hardware signal external to the processor core. This register field allows privileged software to change the ISA mode to be used for subsequent exceptions. All exception types whose vectors are offsets of the *EBASE* register have this capability.

For implementations that support both microMIPS and MIPS, the selected ISA mode of a debug exception is determined by the setting of the *ISAonDebug* register field in the *EJTAG TAP Control* register. This register field is writeable by EJTAG probe software and has a reset value that is set by a hardware signal external to the processor core.

# 13.1.3 Software Detection

Software can determine if microMIPS is implemented by checking the state of the *ISA* (Instruction Set Architecture) field in the *Config3* CP0 register. *Config1*<sub>CA</sub> (bit 2) is not used for microMIPS.

Software can determine if the MIPS ISA is implemented by checking the state of the *ISA* (Instruction Set Architecture) register field in the *Config3* CP0 register.

Software can determine which ISA is used when handling an exception by checking the state of the *ISAOnExc* (ISA on Exception) field in the *Config3* CP0 register.

Debug Probe Software can determine which ISA is used when handling a debug exception by checking the state of the *ISAOnDebug* field in the *EJTAG TAP Control* register.

# 13.1.4 Compliance and Subsetting

This document does not change the instruction subsets as defined by the other MIPS architecture reference manuals, including the subsets defined by the various ASEs.

# 13.1.5 Mode Switch

The MIPS architecture defines an ISA mode for each processor. An ISA mode value of 0 indicates MIPS instruction decoding. In processors implementing microMIPS, an ISA mode value of 1 selects microMIPS instruction decoding.

In microMIPS implementations, the ISA mode is not directly visible to normal software. When EJTAG is implemented, the ISA mode is reflected in the *EJTAG TAP Control* register.

Mode switching between MIPS and microMIPS uses the same mechanism used by MIPS16e, namely, the JALX, JR, JR.HB, JALR, and JALR.HB instructions, as described below.

The JALX instruction executes a JAL and switches to the other mode.

- The JR and JALR instructions interpret bit 0 of the source registers as the target ISA mode (0=MIPS, 1=micro-MIPS) and therefore set the ISA Mode bit according to the contents of bit 0 of the source register. For the actual jump operation, the PC is loaded with the value of the source register with bit 0 set to 0. The same applies to JR.HB and JALR.HB. The instructions JALR and JALR.HB save the ISA mode into bit 0 of the destination register.
- When exceptions or interrupts occur and the processor writes to *EPC*, *DEPC*, or *ErrorEPC*, the ISA Mode bit is saved into bit 0 of these registers. Then the ISA Mode bit is set according to the *Config3<sub>ISA</sub>* register field. On return from an exception, the processor loads the ISA Mode bit based on the value from either *EPC*, *DEPC*, or *ErrorEPC*.

If only one ISA mode exists (either MIPS or microMIPS), then this mode switch mechanism does not exist, and the ISA mode has a fixed value (0=MIPS, 1=microMIPS). Executing the JALX instruction will cause a Reserved Instruction exception. JR and JALR instructions cause an Address exception on the target instruction fetch when bit 0 of the source register is different from the ISA mode. The same applies to JR.HB and JALR.HB. Exception handlers must be encoded in the instruction format supported by the processor.

## 13.1.6 Branch and Jump Offsets

In the MIPS architecture, because instructions are always 32 bits in size, the jump and branch target addresses are word (32-bit) aligned. Jump/branch offset fields are shifted left by two bits to create a word-aligned effective address.

In the microMIPS architecture, because instructions can be either bits in size, the jump and branch target addresses are halfword (16-bit) aligned. Branch/jump offset fields are shifted left by only one bit to create halfword-aligned effective addresses.

To maintain the existing MIPS ABIs, link unit/object file entry points are restricted to 32-bit word alignments. In the future, a microMIPS-only ABI can be created to remove this restriction.

### 13.1.7 Coprocessor Unusable Behavior

If an instruction associated with a non-implemented coprocessor is executed, it is implementation-specific whether a processor executing in microMIPS mode raises an RI exception or a coprocessor unusable exception. While in microMIPS mode, the microAptiv UP has the same behavior as in MIPS32 mode; coprocessor unusable exceptions will be raised.

# **13.2 Instruction Formats**

This section defines the formats of microMIPS instructions.

The 6-bit major opcode is left-aligned within the instruction encoding. Instructions can have 0 to 4 register fields. For 32-bit instructions, the register field width is 5 bits, while for most 16-bit instructions, the register field width is 3 bits, utilizing instruction-specific register encoding. All 5-bit register fields are located at a constant position within the instruction encoding.

The immediate field is right-aligned in the following instructions:

- some 16-bit instructions with 3-bit register fields
- 32-bit instructions with 16-bit or 26-bit immediate field

The name 'immediate field' as used here includes the address offset field for branches and load/store instructions as well as the jump target field.

Other instruction-specific fields are typically located between the immediate and minor opcode fields. Instructions that have multiple "other" fields are listed in alphabetical order according to the name of the field, with the first name of the order located at the lower bit position. An empty bit field that is not explicitly shown in the instruction format is located next to the minor opcode field.

Figure 13.1 and Figure 13.2 show the 16-bit and 32-bit instruction formats.

|        | 15 |              | 10 | 9 |       |       |     |       |        |       |       |       | 0  |
|--------|----|--------------|----|---|-------|-------|-----|-------|--------|-------|-------|-------|----|
| S3R0   |    | Major Opcode |    |   |       |       | Mir | nor C | )pc/lr | nm    |       |       |    |
|        | 15 |              | 10 | 9 |       | 7     | 6   |       |        |       |       |       | 0  |
| S3R117 |    | Major Opcode |    |   | rs1/d |       |     | I     | Vinor  | • Оро | :/Imm |       |    |
|        |    |              |    |   |       |       |     |       |        | -     |       |       |    |
|        | 15 |              | 10 | 9 |       |       | 6   | 5     |        | 3     | 2     |       | 0  |
| S3R2I0 |    | Major Opcode |    |   | Minor | Оро   | ;   |       | rs2/d  |       |       | rs1   |    |
|        |    |              |    |   |       |       |     |       |        |       |       |       |    |
|        | 15 |              | 10 | 9 |       | 7     | 6   |       | 4      | 3     |       | 1     | 0  |
| S3R2I3 |    | Major Opcode |    |   | rs2/d |       |     | rs1   |        |       | Imm   |       | М  |
|        |    |              |    |   |       |       |     |       |        |       |       |       |    |
|        | 15 |              | 10 | 9 |       | 7     | 6   |       | 4      | 3     |       |       | 0  |
| S3R2I4 |    | Major Opcode |    |   | rs2/d |       |     | rs1   |        | Mlı   | nor O | pc/Ir | nm |
|        |    |              |    |   |       |       |     |       |        |       |       |       |    |
|        | 15 |              | 10 | 9 |       | 7     | 6   |       | 4      | 3     |       | 1     | 0  |
| S3R3I0 |    | Major Opcode |    |   | rd    |       |     | rs2   |        |       | rs1   |       | М  |
|        |    |              |    |   |       |       |     |       |        |       |       |       |    |
|        | 15 |              | 10 | 9 |       |       |     | 5     | 4      |       |       |       | 0  |
| S5R1I0 |    | Major Opcode |    |   | Mir   | nor o | рс  |       |        |       | rs1/d |       |    |
|        |    |              |    |   |       |       |     | _     |        |       |       |       |    |
|        | 15 |              | 10 | 9 |       |       |     | 5     | 4      |       |       |       | 0  |
| S5R1I5 |    | Major Opcode |    |   |       | rd    |     |       | N      | Aino  | r Opc | /Imn  | า  |
|        | 45 |              | 40 | 0 |       |       |     | F     |        |       |       |       | 0  |
| S5R2I0 | 15 | Major Opendo | 10 | 9 |       | rd    |     | 5     | 4      |       | rs1   |       | 0  |
| 33K2IU |    | Major Opcode |    |   |       | IU    |     |       |        |       | 151   |       |    |

#### Figure 13.1 16-Bit Instruction Formats

|    | 31 |              | 26 | 25 |             |    |    |            |    |      |               |      |          |            |        |       |                 | 0  |
|----|----|--------------|----|----|-------------|----|----|------------|----|------|---------------|------|----------|------------|--------|-------|-----------------|----|
| R0 |    | Major Opcode |    |    |             |    |    |            | In | nmed | iate/Minor Op | code | /Other   |            |        |       |                 |    |
|    | 31 |              | 26 | 25 |             | 21 | 20 |            | 16 | 15   |               |      |          |            |        |       |                 | 0  |
| R1 |    | Major Opcode |    |    | Imm/Other   |    |    | rs/fs/base |    |      |               | In   | nmediate | e/Minor Op | code   | /Othe | r               |    |
|    | 31 |              | 26 | 25 |             | 21 | 20 |            | 16 | 15   |               |      |          |            |        |       |                 | 0  |
| R2 |    | Major Opcode |    |    | rt/ft/index |    |    | rs/fs/base |    |      |               | In   | nmediate | e/Minor Op | code   | /Othe | r               |    |
|    | 31 |              | 26 | 25 |             | 21 | 20 |            | 16 | 15   |               | 11   | 10       |            |        |       |                 | 0  |
| R3 |    | Major Opcode |    |    | rt/ft/index |    |    | rs/fs/base |    |      | rd/fd         |      |          | Immedia    | ite/Mi | nor O | pcode/Other     |    |
|    | 31 |              | 26 | 25 |             | 21 | 20 |            | 16 | 15   |               | 11   | 10       |            | 6      | 5     |                 | 0  |
| R4 |    | Major Opcode |    |    | rt/ft       |    |    | rs/fs      |    |      | rd/fd         |      |          | rr/fr      |        | M     | inor Opcode/Oth | er |

#### Figure 13.2 32-Bit Instruction Formats

## Figure 13.3 Immediate Fields within 32-Bit Instructions

32-bit instruction formats with 26-bit immediate fields:

|       | 31                  | 26           | 25                      |            |    |          |         |    |           | 0 |
|-------|---------------------|--------------|-------------------------|------------|----|----------|---------|----|-----------|---|
| R0I26 | Major Op            | code         |                         |            |    | Im       | mediate | ;  |           |   |
|       | 31                  | 26           | 25                      |            | 16 | 15       |         |    |           | 0 |
| R0I16 | Major Op            | code         | Minor Op                | code/Other |    |          |         |    | Immediate |   |
|       | 32-bit instruc      | tion formats | s with 16-bit immediate | fields:    |    |          |         |    |           |   |
|       | 31                  | 26           | 25 21                   | 20         | 16 | 15       |         |    |           | 0 |
| R1I16 | Major Op            | code         | Minor Opcode/Other      | rs/fs      |    |          |         |    | Immediate |   |
|       | 31                  | 26           | 25 21                   | 20         | 16 | 15       |         |    |           | 0 |
| R2I16 | Major Op            | code         | rt/ft                   | rs/fs      |    |          |         |    | Immediate |   |
| 32-   | bit instruction for | mats with 1  | 2-bit immediate fields: |            |    |          |         |    |           |   |
|       | 31                  | 26           | 25 21                   | 20         | 16 | 15       | 12      | 11 |           | 0 |
| R1I12 | Major Op            | code         | Other                   | rs/fs      |    | Minor Op | ocode   |    | Immediate |   |
|       | 31                  | 26           | 25 21                   | 20         | 16 | 15       | 12      | 11 |           | 0 |
| R2I12 | Major Op            | code         | rt/ft                   | rs/fs      |    | Minor Op | ocode   |    | Immediate |   |

The instruction size can be completely derived from the major opcode. For 32-bit instructions, the major opcode also defines the position of the minor opcode field and whether or not the immediate field is right-aligned.

Instructions formats are named according to the number of the register fields and the size of the immediate field. The names have the structure R < x > I < y >. For example, an instruction based on the format R2I16 has 2 register fields and a 16-bit immediate field.

### 13.2.1 Instruction Stream Organization and Endianness

16-bit instructions are placed within the 32-bit (or 64-bit) memory element according to system endianness.

- On a 32-bit processor in big-endian mode, the first instruction is read from bits 31..16, and the second instruction is read from bits 15..0.
- On a 32-bit processor in little-endian mode, the first instruction is read from bits 15..0, and the second instruction is read from bits 31..16.

The above rule also applies to the halfwords of 32-bit instructions. This means that a 32-bit instruction is not treated as a word data type; instead, the halfwords are treated in the same way as individual 16-bit instructions. The halfword containing the major opcode is always the first in the sequence.

| Example:       |                              |                       |                           |       |       |       |        |
|----------------|------------------------------|-----------------------|---------------------------|-------|-------|-------|--------|
| SRL r1, r1, 7  | binary opcod<br>hex represen |                       | 000000 00001<br>0021 3840 | 00001 | 00111 | 00001 | 000000 |
| Little Endian: | Address: 3<br>Data: 38       |                       |                           |       |       |       |        |
| Big Endian:    |                              | ) 1 2 3<br>) 21 38 40 |                           |       |       |       |        |

Instructions are placed in memory such that they are in-order with respect to the address.

# 13.3 microMIPS Re-encoded Instructions

In the 16-bit category:

• Frequent MIPS instructions and macros, re-encoded as 16-bit. Register and immediate fields are reduced in size by using encodings of frequently occurring values.

In the 32-bit category:

- Opcode space for user-defined instructions (UDIs).
- New instructions designed primarily to reduce code size.

ADD16, ADD32, ADD32.PS

If these suffixes are omitted, the assembler automatically chooses the smallest instruction size.

For each instruction, the tables in this chapter provide all necessary information about the bit fields. The formats of the instructions are defined in Section 11.1 "CPU Instruction Formats". Together with the major and minor opcode encodings, which can be derived from the tables in Section 12.2 "microAptiv<sup>™</sup> UP Core Opcode Map", the complete instruction encoding is provided.

Most register fields have a width of 5 bits. 5-bit register fields use linear encoding (r0='00000', r1='00001', etc.). For 16-bit instructions, whose register field size is variable, the register field width is explicitly stated in the instruction table (Table 13.1 and Table 13.2), and the individual register and immediate encodings are shown in Table 13.3. The 'other fields' are defined by the respective column, with the order of these fields in the instruction encoding defined by the order in the tables.

### 13.3.1 16-Bit Category

#### 13.3.1.1 Frequent MIPS Instructions

These are frequent MIPS instructions with reduced register and immediate fields containing frequently used registers and immediate values.

MOVE is a very frequent instruction. It therefore supports full 5-bit unrestricted register fields for maximum efficiency. In fact, MOVE used to be a simplified macro of an existing MIPS instruction.

There are 2 variants of the LW and SW instructions. One variant implicitly uses the SP register to allow for a larger offset field. The value in the offset field is shifted left by 2 before it is added to the base address.

There are four variants of the ADDIU instruction:

- 1. A variant with one 5-bit register specifier that allows any GPR to be the source and destination register
- 2. A variant that uses the stack pointer as the implicit source and destination register
- 3. A variant that has separate 3-bit source and destination register specifiers
- 4. A variant that has the stack pointer as the implicit source register and one 3-bit destination register specifier

A 16-bit NOP instruction is needed because of the new 16-bit instruction alignment and the need in specific cases to align instructions on a 32-bit boundary. It can save code size as well. NOP is not shown in the table because it is realized as a macro (as is NEGU).

NOP16 = MOVE16 r0, r0 NEGU16 rt, rs = SUBU16 rt, r0, rs

Because microMIPS instructions are 16-bit aligned, the 16-bit branch instructions support 16-bit aligned branch target addresses. The offset field is left shifted by 1 before it is added to the PC.

The compact instruction JRC is to be used instead of JR, when the jump delay slot after JR cannot be filled. This saves code size. Because JRC may execute as fast as JR with a NOP in the delay slot, JR is preferred if the delay slot can be filled.

The breakpoint instructions, BREAK and SDBBP, include a 16-bit variant that allows a breakpoint to be inserted at any instruction address without overwriting more than a single instruction.

| Instruction | Major<br>Opcode<br>Name | Number of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register<br>Field<br>Width<br>(bit) | Total<br>Size of<br>Other<br>Fields | Empty 0<br>Field Size<br>(bit) | Minor<br>Opcode<br>Size (bit) | Comment                                                              |
|-------------|-------------------------|---------------------------------|----------------------------------|-------------------------------------|-------------------------------------|--------------------------------|-------------------------------|----------------------------------------------------------------------|
| ADDIUS5     | POOL16D                 | 5bit:1                          | 4                                | 5                                   |                                     | 0                              | 1                             | Add Immediate<br>Unsigned Word Same<br>Register                      |
| ADDIUSP     | POOL16D                 | 0                               | 9                                | 0                                   |                                     | 0                              | 1                             | Add Immediate<br>Unsigned Word to<br>Stack Pointer                   |
| ADDIUR2     | POOL16E                 | 2                               | 3                                | 3                                   |                                     | 0                              | 1                             | Add Immediate<br>Unsigned Word<br>Two Registers                      |
| ADDIUR1SP   | POOL16E                 | 1                               | 6                                | 3                                   |                                     | 0                              | 1                             | Add Immediate<br>Unsigned Word<br>One Registers and<br>Stack Pointer |
| ADDU16      | POOL16A                 | 3                               | 0                                | 3                                   |                                     | 0                              | 1                             | Add Unsigned Word                                                    |
| AND16       | POOL16C                 | 2                               | 0                                | 3                                   |                                     | 0                              | 4                             | AND                                                                  |
| ANDI16      | ANDI16                  | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | AND Immediate                                                        |
| B16         | B16                     | 0                               | 10                               |                                     |                                     | 0                              | 0                             | Branch                                                               |
| BREAK16     | POOL16C                 | 0                               | 0                                |                                     | 4                                   | 0                              | 6                             | Cause Breakpoint<br>Exception                                        |
| JALR16      | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Jump and Link<br>Register, 32-bit<br>delay-slot                      |
| JALRS16     | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Jump and Link<br>Register, 16-bit<br>delay-slot                      |
| JR16        | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Jump Register                                                        |
| LBU16       | LBU16                   | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Load Byte Unsigned                                                   |
| LHU16       | LHU16                   | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Load Halfword                                                        |
| LI16        | LI16                    | 1                               | 7                                | 3                                   |                                     | 0                              | 0                             | Load Immediate                                                       |
| LW16        | LW16                    | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Load Word                                                            |
| LWGP        | LWGP16                  | 1                               | 7                                | 3                                   |                                     | 0                              | 0                             | Load Word GP                                                         |
| LWSP        | LWSP16                  | 5bit:1                          | 5                                | 5                                   |                                     | 0                              | 0                             | Load Word SP                                                         |
| MFHI16      | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Move from<br>HI Register                                             |
| MFLO16      | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Move from<br>LO Register                                             |
| MOVE16      | MOVE16                  | 2                               | 0                                | 5                                   |                                     | 0                              | 0                             | Move                                                                 |
| NOT16       | POOL16C                 | 2                               | 0                                | 3                                   |                                     | 0                              | 4                             | NOT                                                                  |
| OR16        | POOL16C                 | 2                               | 0                                | 3                                   |                                     | 0                              | 4                             | OR                                                                   |

Table 13.1 16-Bit Re-encoding of Frequent MIPS Instructions

| Instruction | Major<br>Opcode<br>Name | Number of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register<br>Field<br>Width<br>(bit) | Total<br>Size of<br>Other<br>Fields | Empty 0<br>Field Size<br>(bit) | Minor<br>Opcode<br>Size (bit) | Comment                             |
|-------------|-------------------------|---------------------------------|----------------------------------|-------------------------------------|-------------------------------------|--------------------------------|-------------------------------|-------------------------------------|
| SB16        | SB16                    | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Store Byte                          |
| SDBBP16     | POOL16C                 | 0                               | 0                                |                                     | 4                                   | 0                              | 6                             | Cause Debug<br>Breakpoint Exception |
| SH16        | SH16                    | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Store Halfword                      |
| SLL16       | POOL16B                 | 2                               | 3                                | 3                                   |                                     | 0                              | 1                             | Shift Word Left<br>Logical          |
| SRL16       | POOL16B                 | 2                               | 3                                | 3                                   |                                     | 0                              | 1                             | Shift Word Right<br>Logical         |
| SUBU16      | POOL16A                 | 3                               | 0                                | 3                                   |                                     | 0                              | 1                             | Sub Unsigned                        |
| SW16        | SW16                    | 2                               | 4                                | 3                                   |                                     | 0                              | 0                             | Store Word                          |
| SWSP        | SWSP16                  | 5bit:1                          | 5                                | 5                                   |                                     | 0                              | 0                             | Store Word SP                       |
| XOR16       | POOL16C                 | 2                               | 0                                | 3                                   |                                     | 0                              | 4                             | XOR                                 |

Table 13.1 16-Bit Re-encoding of Frequent MIPS Instructions (Continued)

### **13.3.1.2 Frequent MIPS Instruction Sequences**

These 16-bit instructions are equivalent to frequently-used short sequences of MIPS instructions. The instruction-specific register and immediate value selection are shown in Table 13.3.

| Instruction | Major<br>Opcode<br>Name | Number of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register<br>Field<br>Width<br>(bit) | Total<br>Size of<br>Other<br>Fields | Empty 0<br>Field Size<br>(bit) | Minor<br>Opcode<br>Size (bit) | Comment                     |
|-------------|-------------------------|---------------------------------|----------------------------------|-------------------------------------|-------------------------------------|--------------------------------|-------------------------------|-----------------------------|
| BEQZ16      | BEQZ16                  | 1                               | 7                                | 3                                   |                                     | 0                              | 0                             | Branch on Equal Zero        |
| BNEZ16      | BNEZ16                  | 1                               | 7                                | 3                                   |                                     | 0                              | 0                             | Branch on<br>Not Equal Zero |
| JRADDIUSP   | POOL16C                 | 0                               | 5                                |                                     |                                     |                                | 5                             | Jump Register;<br>ADDIU SP  |
| JRC         | POOL16C                 | 1                               | 0                                | 5                                   |                                     | 0                              | 5                             | Jump Register Com-<br>pact  |
| LWM16       | POOL16C                 | 0                               | 4                                |                                     | 2                                   | 0                              | 4                             | Load Word Multiple          |
| SWM16       | POOL16C                 | 0                               | 4                                |                                     | 2                                   | 0                              | 4                             | Store Word Multiple         |

### 13.3.1.3 Instruction-Specific Register Specifiers and Immediate Field Encodings

| Instruction | Number<br>of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register 1<br>Decoded<br>Value | Register 2<br>Decoded<br>Value | Register 3<br>Decoded<br>Value | Immediate Field Decoded<br>Value                                    |  |
|-------------|------------------------------------|----------------------------------|--------------------------------|--------------------------------|--------------------------------|---------------------------------------------------------------------|--|
| ADDIUS5     | 5bit:1                             | 4                                | rd: 5-bit field                |                                |                                | -807                                                                |  |
| ADDIUSP     | 0                                  | 9                                |                                |                                |                                | (-2583, 2257) << 2                                                  |  |
| ADDIUR2     | 2                                  | 3                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                | -1, 1, 4, 8, 12, 16, 20, 24                                         |  |
| ADDIUR1SP   | 1                                  | 6                                | rd:2-7,16, 17                  |                                |                                | (063) << 2                                                          |  |
| ADDU16      | 3                                  | 0                                | rs1:2-7,16, 17                 | rs2:2-7,16, 17                 | rd:2-7,16, 17                  |                                                                     |  |
| AND16       | 2                                  | 0                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                |                                                                     |  |
| ANDI16      | 2                                  | 4                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                | 1, 2, 3, 4, 7, 8, 15, 16, 31, 32, 63,<br>64, 128, 255, 32768, 65535 |  |
| B16         | 0                                  | 10                               |                                |                                |                                | (-512511) << 1                                                      |  |
| BEQZ16      | 1                                  | 7                                | rs1:2-7,16, 17                 |                                |                                | (-6463) << 1                                                        |  |
| BNEZ16      | 1                                  | 7                                | rs1:2-7,16, 17                 |                                |                                | (-6463) << 1                                                        |  |
| BREAK16     | 0                                  | 4                                |                                |                                |                                | 015                                                                 |  |
| JALR16      | 5bit:1                             | 0                                | rs1:5-bit field                |                                |                                |                                                                     |  |
| JALRS16     | 5bit:1                             | 0                                | rs1:5-bit field                |                                |                                |                                                                     |  |
| JRADDIUSP   | 0                                  | 5                                |                                |                                |                                | (031) << 2                                                          |  |
| JR16        | 5bit:1                             | 0                                | rs1:5 bit field                |                                |                                |                                                                     |  |
| JRC         | 5bit:1                             | 0                                | rs1:5 bit field                |                                |                                |                                                                     |  |
| LBU16       | 2                                  | 4                                | rb:2-7,16,17                   | rd:2-7,16, 17                  |                                | -1,014                                                              |  |
| LHU16       | 2                                  | 4                                | rb:2-7,16,17                   | rd:2-7,16, 17                  |                                | (015) << 1                                                          |  |
| LI16        | 1                                  | 7                                | rd:2-7,16, 17                  |                                |                                | -1,0126                                                             |  |
| LW16        | 2                                  | 4                                | rb:2-7,16,17                   | rd:2-7,16, 17                  |                                | (015) << 2                                                          |  |
| LWM16       | 2bit list:1                        | 4                                |                                |                                |                                | (015)<<2                                                            |  |
| LWGP        | 1                                  | 7                                | rd:2-7,16,17                   |                                |                                | (-6463)<<2                                                          |  |
| LWSP        | 5bit:1                             | 5                                | rd:5-bit field                 |                                |                                | (031)<<2                                                            |  |
| MFHI16      | 5bit:1                             | 0                                | rd:5-bit field                 |                                |                                |                                                                     |  |
| MFLO16      | 5bit:1                             | 0                                | rd:5-bit field                 |                                |                                |                                                                     |  |
| MOVE16      | 5bit:2                             | 0                                | rd:5-bit field                 | rs1:5-bit field                |                                |                                                                     |  |
| NOT16       | 2                                  | 0                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                |                                                                     |  |
| OR16        | 2                                  | 0                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                |                                                                     |  |
| SB16        | 2                                  | 4                                | rb:2-7,16,17                   | rs1:0, 2-7, 17                 |                                | 015                                                                 |  |
| SDBBP16     | 0                                  | 0                                |                                |                                |                                | 015                                                                 |  |
| SH16        | 2                                  | 4                                | rb:2-7,16,17                   | rs1:0, 2-7, 17                 |                                | (015) << 1                                                          |  |
| SLL16       | 2                                  | 3                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                | 18 (see encoding tables)                                            |  |

# Table 13.3 Instruction-Specific Register Specifiers and Immediate Field Values

| Instruction | Number<br>of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register 1<br>Decoded<br>Value | Register 2<br>Decoded<br>Value | Register 3<br>Decoded<br>Value | Immediate Field Decoded<br>Value |
|-------------|------------------------------------|----------------------------------|--------------------------------|--------------------------------|--------------------------------|----------------------------------|
| SRL16       | 2                                  | 3                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                | 18 (see encoding tables)         |
| SUBU16      | 3                                  | 0                                | rs1:2-7,16, 17                 | rs2:2-7,16, 17                 | rd:2-7,16, 17                  |                                  |
| SW16        | 2                                  | 4                                | rb:2-7,16,17                   | rs1:0, 2-7, 17                 |                                | (015) << 2                       |
| SWSP        | 5bit:1                             | 5                                | rs1: 5 bit field               |                                |                                | (031) << 2                       |
| SWM16       | 2- bit list:1                      | 4                                |                                |                                |                                | (015)<<2                         |
| XOR16       | 2                                  | 0                                | rs1:2-7,16, 17                 | rd:2-7,16, 17                  |                                |                                  |

| Table 13.3 Instruction-Specific Register Specifiers and Immediate Field Values (Continued) |
|--------------------------------------------------------------------------------------------|
|--------------------------------------------------------------------------------------------|

# 13.3.2 16-bit Instruction Register Set

Many of the 16-bit instructions use 3-bit register specifiers in their binary encodings. The register set used for most of these 3-bit register specifiers is listed in Table 13.4. The register set used for SB16, SH16, SW16 source register is listed in Table 13.5. These register sets are a true subset of the register set available in 32-bit mode; the 3-bit register specifiers can directly access 8 of the 32 registers available in 32-bit mode (which uses 5-bit register specifiers).

In addition, specific instructions in the 16-bit instruction set implicitly reference the stack pointer register (*sp*), global pointer register (*gp*), the return address register (*ra*), the integer multiplier/divider output registers (*HI/LO*) and the program counter (*PC*). Of these, Table 13.6 lists *sp*, *gp* and *ra*. Table 13.7 lists the microMIPS special-purpose registers, including *PC*, *HI* and *LO*.

The microMIPS also contains some 16-bit instructions that use 5-bit register specifiers. Such 16-bit instructions provide access to all 32 general-purpose registers.

| 16-Bit<br>Register<br>Encoding <sup>1</sup> | 32-Bit MIPS<br>Register<br>Encoding <sup>2</sup> | Symbolic Name<br>(From<br><i>ArchDefs.h</i> ) | Description              |  |
|---------------------------------------------|--------------------------------------------------|-----------------------------------------------|--------------------------|--|
| 0                                           | 16                                               | s0                                            | General-purpose register |  |
| 1                                           | 17                                               | s1                                            | General-purpose register |  |
| 2                                           | 2                                                | v0                                            | General-purpose register |  |
| 3                                           | 3                                                | v1                                            | General-purpose register |  |
| 4                                           | 4                                                | a0                                            | General-purpose register |  |
| 5                                           | 5                                                | al                                            | General-purpose register |  |
| 6                                           | 6                                                | a2                                            | General-purpose register |  |
| 7                                           | 7                                                | a3                                            | General-purpose register |  |

Table 13.4 16-Bit Instruction General-Purpose Registers - \$2-\$7, \$16, \$17

- 1. "0-7" correspond to the register's 16-bit binary encoding and show how that encoding relates to the MIPS registers. "0-7" never refer to the registers, except within the binary microMIPS instructions. From the assembler, only the MIPS names (\$16, \$17, \$2, etc.) or the symbolic names (s0, s1, v0, etc.) refer to the registers. For example, to access register number 17 in the register file, the programmer references \$17 or s1, even though the micro-MIPS binary encoding for this register is 001.
- General registers not shown in the above table are not accessible through the 16-bit instruction using 3-bit register specifiers. The Move instruction can access all 32 general-purpose registers.

| 16-Bit<br>Register<br>Encoding <sup>1</sup> | 32-Bit MIPS<br>Register<br>Encoding <sup>2</sup> | Symbolic Name<br>(From<br><i>ArchDefs.h</i> ) | Description              |
|---------------------------------------------|--------------------------------------------------|-----------------------------------------------|--------------------------|
| 0                                           | 0                                                | zero                                          | Hard-wired Zero          |
| 1                                           | 17                                               | s1                                            | General-purpose register |
| 2                                           | 2                                                | v0                                            | General-purpose register |
| 3                                           | 3                                                | v1                                            | General-purpose register |
| 4                                           | 4                                                | a0                                            | General-purpose register |
| 5                                           | 5                                                | al                                            | General-purpose register |
| 6                                           | 6                                                | a2                                            | General-purpose register |
| 7                                           | 7                                                | a3                                            | General-purpose register |

Table 13.5 SB16, SH16, SW16 Source Registers - \$0, \$2-\$7, \$17

1. "0-7" correspond to the register's 16-bit binary encoding and show how that encoding relates to the MIPS registers. "0-7" never refer to the registers, except within the binary microMIPS instructions. From the assembler, only the MIPS names (\$16, \$17, \$2, etc.) or the symbolic names (s0, s1, v0, etc.) refer to the registers. For example, to access register number 17 in the register file, the programmer references \$17 or s1, even though the micro-MIPS binary encoding for this register is 001.

2. General registers not shown in the above table are not accessible through the 16-bit instructions using 3-bit register specifier. The Move instruction can access all 32 general-purpose registers.

| 16-Bit<br>Register<br>Encoding | 32-Bit MIPS<br>Register<br>Encoding | Symbolic Name<br>(From<br><i>ArchDefs.h</i> ) | Description             |
|--------------------------------|-------------------------------------|-----------------------------------------------|-------------------------|
| Implicit                       | 28                                  | gp                                            | Global pointer register |
| Implicit                       | 29                                  | sp                                            | Stack pointer register  |
| Implicit                       | 31                                  | ra                                            | Return address register |

| Symbolic Name | Purpose                                                                        |
|---------------|--------------------------------------------------------------------------------|
| РС            | Program counter. The PC-relative ADDIU can access this register as an operand. |
| HI            | Contains high-order word of multiply or divide result.                         |
| LO            | Contains low-order word of multiply or divide result.                          |

Table 13.7 16-Bit Instruction Special-Purpose Registers

## 13.3.3 32-Bit Category

#### 13.3.3.1 New 32-bit instructions

The following table lists the 32-bit instructions introduced in the microMIPS ISA.

| Instruction | Major<br>Opcode<br>Name | Number of<br>Register<br>Fields | Immediate<br>Field Size<br>(bit) | Register<br>Field<br>Width<br>(bit) | Total<br>Size of<br>Other<br>Fields | Empty 0<br>Field Size<br>(bit) | Minor<br>Opcode<br>Size (bit) | Comment                                                                |
|-------------|-------------------------|---------------------------------|----------------------------------|-------------------------------------|-------------------------------------|--------------------------------|-------------------------------|------------------------------------------------------------------------|
| ADDIUPC     | ADDIUPC                 | 1                               | 23                               | 3                                   |                                     | 0                              | 0                             | ADDIU PC-Relative                                                      |
| BEQZC       | POOL32I                 | 2:5 bit                         | 16                               | 5                                   |                                     |                                | 0                             | Branch on<br>Equal to Zero, No<br>Delay Slot                           |
| BNEZC       | POOL32I                 | 2:5 bit                         | 16                               | 5                                   |                                     |                                | 0                             | Branch on<br>Not Equal to Zero, No<br>Delay Slot                       |
| JALRS       | POOL32A                 | 2:5 bit                         | 0                                | 5                                   |                                     |                                | 16                            | Jump and Link Regis-<br>ter, Short Delay Slot                          |
| JALRS.HB    | POOL32A                 | 2:5 bit                         | 0                                | 5                                   |                                     |                                | 16                            | Jump and Link Regis-<br>ter with Hazard Bar-<br>rier, Short Delay Slot |
| JALS        | JALS32                  | 0                               | 26                               |                                     |                                     |                                | 0                             | Jump and Link, Short<br>Delay Slot                                     |
| JALX        | JALX                    |                                 | 26                               | 5                                   |                                     | 0                              | 5                             | Jump and Link<br>Exchange                                              |
| LWP         | POOL32B                 | 2:5 bit                         | 12                               |                                     | 5                                   | 0                              | 4                             | Load Word Pair                                                         |
| LWXS        | POOL32A                 | 3:5 bit                         | 0                                | 5                                   | 0                                   | 1                              | 10                            | Load Word Indexed,<br>Scale                                            |
| LWM32       | POOL32B                 | 1:5bit                          | 12                               |                                     | 5                                   | 0                              | 4                             | Load Word Multiple                                                     |
| SWP         | POOL32B                 | 2:5 bit                         | 12                               |                                     |                                     | 0                              | 4                             | Load Word Pair                                                         |
| SWM32       | POOL32B                 | 1:5bits                         | 12                               |                                     | 5                                   | 0                              | 4                             | Store Word Multiple                                                    |

Table 13.8 32-bit Instructions introduced within microMIPS

microMIPS<sup>™</sup> Instruction Set Architecture

Appendix A

# References

This appendix lists other publications available from MIPS Technologies, Inc. that are referenced in this document. These documents may be included in the \$MIPS\_PROJECT/doc area of a typical microAptiv UP soft or hard core release, or in some cases may be available on the MIPS web site https://www.mips.com.

- 1. MIPS32<sup>®</sup> microAptiv<sup>™</sup> UP Processor Core Family Data Sheet MIPS Document: MD00939
- 2. MIPS32<sup>®</sup> microAptiv<sup>™</sup> UP Processor Core Family Integrator's Guide MIPS Document: MD00941
- 3. MIPS32<sup>®</sup> microAptiv<sup>™</sup> UP Processor Core Family Implementor's Guide MIPS Document: MD00940
- MIPS32<sup>®</sup> microAptiv<sup>™</sup> UP Processor Core Family System Package & Simulation Flow User's Manual MIPS Document: MD00943
- 5. *MIPS*® *Architecture For Programmers*, Volume I: Introduction to the MIPS32® Architecture MIPS Document: MD0082
- 6. *MIPS*® *Architecture For Programmers*, Volume I: Introduction to the microMIPS32<sup>™</sup> Architecture MIPS Document: MD0741
- 7. *MIPS*® *Architecture For Programmers*, Volume II: The MIPS32® Instruction Set MIPS Document: MD0086
- 8. *MIPS*® *Architecture For Programmers*, Volume II: The microMIPS32<sup>™</sup> Instruction Set MIPS Document: MD0582
- MIPS<sup>®</sup> Architecture For Programmers Volume III: The MIPS32<sup>®</sup> and microMIPS32<sup>™</sup> Privileged Resource Architecture MIPS Document: MD00090
- MIPS® Architecture for Programmers Volume IV-h: The MCU Application-Specific Extension to the MIPS32® Architectures MIPS Document: MD00834
- MIPS<sup>®</sup> Architecture for Programmers Volume IV-h: The MCU Application-Specific Extension to the microMIPS32<sup>™</sup> Architectures MIPS Document: MD00838
- 12. MIPS® EJTAG Specification MIPS Document: MD00047

- 13. MIPS® cJTAG Adapter User's Manual MIPS Document: MD00862
- 14. MIPS® Architecture Reference Manual Volume IV-e: The MIPS® DSP Module of the MIPS32® Architecture MIPS Document: MD00372
- MIPS® Architecture Reference Manual Volume IV-e: The MIPS® DSP Module of the microMIPS32® Architecture MIPS Document: MD00762
- 16. Five Methods of Utilizing the MIPS® DSP Module MIPS Document: MD00783
- 17. Efficient DSP Module Programming in C: Tips and Tricks MIPS Document: MD00485
- 18. Accelerating DSP Filter Loops with MIPS® CorExtend® Instructions MIPS Document: MD00303

# Revision History

I

Change bars (vertical lines) in the margins of this document indicate significant changes in the document since its last release. Change bars are removed for changes that are more than one revision old.

This document may refer to Architecture specifications (for example, instruction set descriptions and EJTAG register definitions), and change bars in these sections indicate changes since the previous version of the relevant Architecture document.

| Revision | Date              | Description                                  |
|----------|-------------------|----------------------------------------------|
| 01.00    | July 31, 2013     | • Initial 3_0_0 General Availability release |
| 01.01    | November 12, 2013 | Updates to CACHE instruction                 |
| 01.02    | July 30, 2014     | • Changes to timer interrupt input.          |