

**PROGRAMMER'S GUIDE** 

# Applicable to Apollo4 Family SoCs: Apollo4 / Apollo4 Blue, Apollo4 Plus / Apollo4 Blue Plus, Apollo4 Lite / Apollo4 Blue Lite

Ultra-low Power Apollo SoC Family Doc. ID: PG-A4-8p0 Doc. Revision: 8.0, July 2023



# Legal Information and Disclaimers

AMBIQ MICRO INTENDS FOR THE CONTENT CONTAINED IN THE DOCUMENT TO BE ACCURATE AND RELI-ABLE. THIS CONTENT MAY, HOWEVER, CONTAIN TECHNICAL INACCURACIES, TYPOGRAPHICAL ERRORS OR OTHER MISTAKES. AMBIQ MICRO MAY MAKE CORRECTIONS OR OTHER CHANGES TO THIS CONTENT AT ANY TIME. AMBIQ MICRO AND ITS SUPPLIERS RESERVE THE RIGHT TO MAKE CORRECTIONS, MODIFI-CATIONS, ENHANCEMENTS, IMPROVEMENTS AND OTHER CHANGES TO ITS PRODUCTS, PROGRAMS AND SERVICES AT ANY TIME OR TO DISCONTINUE ANY PRODUCTS, PROGRAMS, OR SERVICES WITHOUT NOTICE.

THE CONTENT IN THIS DOCUMENT IS PROVIDED "AS IS". AMBIQ MICRO AND ITS RESPECTIVE SUPPLIERS MAKE NO REPRESENTATIONS ABOUT THE SUITABILITY OF THIS CONTENT FOR ANY PURPOSE AND DIS-CLAIM ALL WARRANTIES AND CONDITIONS WITH REGARD TO THIS CONTENT, INCLUDING BUT NOT LIM-ITED TO, ALL IMPLIED WARRANTIES AND CONDITIONS OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, TITLE AND NON-INFRINGEMENT OF ANY THIRD PARTY INTELLECTUAL PROPERTY RIGHT.

AMBIQ MICRO DOES NOT WARRANT OR REPRESENT THAT ANY LICENSE, EITHER EXPRESS OR IMPLIED, IS GRANTED UNDER ANY PATENT RIGHT, COPYRIGHT, MASK WORK RIGHT, OR OTHER INTELLECTUAL PROP-ERTY RIGHT OF AMBIQ MICRO COVERING OR RELATING TO THIS CONTENT OR ANY COMBINATION, MACHINE, OR PROCESS TO WHICH THIS CONTENT RELATE OR WITH WHICH THIS CONTENT MAY BE USED.

USE OF THE INFORMATION IN THIS DOCUMENT MAY REQUIRE A LICENSE FROM A THIRD PARTY UNDER THE PATENTS OR OTHER INTELLECTUAL PROPERTY OF THAT THIRD PARTY, OR A LICENSE FROM AMBIQ MICRO UNDER THE PATENTS OR OTHER INTELLECTUAL PROPERTY OF AMBIQ MICRO.

INFORMATION IN THIS DOCUMENT IS PROVIDED SOLELY TO ENABLE SYSTEM AND SOFTWARE IMPLE-MENTERS TO USE AMBIQ MICRO PRODUCTS. THERE ARE NO EXPRESS OR IMPLIED COPYRIGHT LICENSES GRANTED HEREUNDER TO DESIGN OR FABRICATE ANY INTEGRATED CIRCUITS OR INTE-GRATED CIRCUITS BASED ON THE INFORMATION IN THIS DOCUMENT. AMBIQ MICRO RESERVES THE RIGHT TO MAKE CHANGES WITHOUT FURTHER NOTICE TO ANY PRODUCTS HEREIN. AMBIQ MICRO MAKES NO WARRANTY, REPRESENTATION OR GUARANTEE REGARDING THE SUITABILITY OF ITS PRODUCTS FOR ANY PARTICULAR PURPOSE, NOR DOES AMBIQ MICRO ASSUME ANY LIABILITY ARISING OUT OF THE APPLICATION OR USE OF ANY PRODUCT OR CIRCUIT, AND SPECIFICALLY DISCLAIMS ANY AND ALL LIABIL-ITY, INCLUDING WITHOUT LIMITATION CONSEQUENTIAL OR INCIDENTAL DAMAGES. "TYPICAL" PARAME-TERS WHICH MAY BE PROVIDED IN AMBIQ MICRO DATA SHEETS AND/OR SPECIFICATIONS CAN AND DO VARY IN DIFFERENT APPLICATIONS AND ACTUAL PERFORMANCE MAY VARY OVER TIME. ALL OPERATING PARAMETERS, INCLUDING "TYPICALS" MUST BE VALIDATED FOR EACH CUSTOMER APPLICATION BY CUS-TOMER'S TECHNICAL EXPERTS. AMBIQ MICRO DOES NOT CONVEY ANY LICENSE UNDER NEITHER ITS PAT-ENT RIGHTS NOR THE RIGHTS OF OTHERS. AMBIQ MICRO PRODUCTS ARE NOT DESIGNED, INTENDED, OR AUTHORIZED FOR USE AS COMPONENTS IN SYSTEMS INTENDED FOR SURGICAL IMPLANT INTO THE BODY, OR OTHER APPLICATIONS INTENDED TO SUPPORT OR SUSTAIN LIFE, OR FOR ANY OTHER APPLICA-TION IN WHICH THE FAILURE OF THE AMBIQ MICRO PRODUCT COULD CREATE A SITUATION WHERE PER-SONAL INJURY OR DEATH MAY OCCUR. SHOULD BUYER PURCHASE OR USE AMBIQ MICRO PRODUCTS FOR ANY SUCH UNINTENDED OR UNAUTHORIZED APPLICATION, BUYER SHALL INDEMNIFY AND HOLD AMBIQ MICRO AND ITS OFFICERS, EMPLOYEES, SUBSIDIARIES, AFFILIATES, AND DISTRIBUTORS HARM-LESS AGAINST ALL CLAIMS, COSTS, DAMAGES, AND EXPENSES, AND REASONABLE ATTORNEY FEES ARIS-ING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PERSONAL INJURY OR DEATH ASSOCIATED WITH SUCH UNINTENDED OR UNAUTHORIZED USE, EVEN IF SUCH CLAIM ALLEGES THAT AMBIQ MICRO WAS NEGLIGENT REGARDING THE DESIGN OR MANUFACTURE OF THE PART.

# **Table of Content**

| 1. Introduction                                              | 12         |
|--------------------------------------------------------------|------------|
| 2. Document Revision History                                 | 13         |
| 3. SoC Architecture                                          | 16         |
| 3.1 Interrupts                                               | 16         |
| 3.2 Memory Map                                               | 21         |
| 3.3 Power Management Programming                             | 27         |
| 3.4 Instrumentation Trace Macrocell (ITM)                    | 29         |
| 3.5 SoC Control                                              | 29         |
| 3.6 Memory Controller Programming                            | 29         |
| 4. Clock Generation (CLKGEN)                                 | 33         |
| 4.1 HFRC Auto-adjustment                                     | 33         |
| 4.2 Generating 100 Hz                                        | 33         |
| 4.3 Clock Enablement for Modules with a Special Mux          | 34         |
| 4.4 External 32.768 kHz Clock Input on X0                    | 40         |
| 5. Real Time Clock (RTC)                                     | 41         |
| 5.1 Calendar Counters                                        | 41         |
| 5.2 Calendar Counter Reads                                   | 41         |
| 5.3 Alarms                                                   | 41         |
| 5.4 Century Control and Leap Year Management                 | 42         |
| 5.5 Weekday Function                                         | 42         |
| 6 Counter/Timer Module (TIMER)                               | 13         |
| 6.1 Counter/Timer Functions                                  | <b>4</b> 3 |
| 6.2 Triggering Functions                                     | 40<br>53   |
| 6.2 Clocking Timor/Counters with Other Counter/Timor Outputs | 55         |
| 6.4 Clobal Timer/Counter Enable                              | 54         |
| 6.5 Concrating the Sample Pate for the ADC                   | 55         |
| 6.6 Concrating the Sample Rate for the Audio ADC             | 55         |
| 6.7 CLP and EN Details                                       | 55         |
| 6.2 NOSYNC Eurotion                                          | 55         |
| 6.0 Counter Functions                                        | 56         |
| 6.10 Interconnecting Timers                                  | 56         |
|                                                              | 50         |
| 7. System Timer (STIMER)                                     | 57         |
| 8. Watch Dog Timer (WDT)                                     | 58         |
| 8.1 Basic Operation                                          | 58         |
| 8.2 Register Functions                                       | 58         |
| 9. General Purpose Input/Output (GPIO)                       | 59         |
| 9.1 General Purpose I/O (GPIO) Functions                     | 59         |
| 9 2 Pad Connection Summary                                   | 61         |
| 9.3 Module-specific Pad Configuration                        | 64         |
|                                                              | υ.         |

| 10. General Purpose ADC and Temperature Sensor Module (GPADC)  | 87         |
|----------------------------------------------------------------|------------|
| 10.1 Clock Source and Divider                                  | 87         |
| 10.2 Eleven Selectable Analog Input Channels                   | 87         |
| 10.3 Triggering and Trigger Sources                            | 88         |
| 10.4 Voltage Reference Source                                  | 88         |
| 10.5 Eight Automatically Managed Conversion Slots              | 88         |
| 10.6 Automatic Sample Accumulation and Scaling                 | 89         |
| 10.7 Sixteen Entry Result FIFO                                 | 90         |
| 10.8 Window Comparator                                         | 92         |
| 10.9 Operating Modes and the Mode Controller                   | 93         |
| 10.10 Interrupts                                               | 96         |
| 10.11 Generating the Sample Rate for the GPADC                 | 97         |
| 11. Multi-bit Serial Peripheral Interface Master Module (MSPI) | 98         |
| 11.1 Configuration                                             | 98         |
| 11.2 PIO Operation                                             | 99         |
| 11.3 DMA Operations                                            | 99         |
| 11.4 Execute in Place (XIP) Operations                         | 100        |
| 11.5 Command Queueing (CQ)                                     | 102        |
| 11 6 Data Scrambling                                           | 106        |
| 11.7 Auto Power Down                                           | 106        |
| 11.8 Board/Package Considerations for MSPI Pin Timing          | 107        |
|                                                                |            |
| 12. I2C/SPI Master (IOM) 1                                     | 111        |
| 12.1 Programmer's Reference                                    | 111        |
| 12.2 Interface Clock Generation                                | 111        |
| 12.3 Command Operation                                         | 112        |
| 12.4 FIFO                                                      | 113        |
| 12.5 I2C Interface                                             | 113        |
| 12.6 SPI Operations                                            | 118        |
| 12.7 Bit Orientation                                           | 121        |
| 12.8 SPI Flow Control                                          | 121        |
| 12.9 Minimizing Power                                          | 123        |
| 13. I2C/SPI Slave (IOS) 1                                      | 24         |
| 13.1 Local RAM Allocation                                      | 124        |
| 13.2 Direct Area Functions                                     | 125        |
| 13.3 FIFO Area Functions                                       | 129        |
| 13.4 Rearranging the FIFO                                      | 130        |
| 13.5 Interface Interrupts                                      | 131        |
| 13.6 Command Completion Interrupts                             | 132        |
| 13.7 Host Address Space and Registers                          | 132        |
| 13.8 I2C Interface                                             | 133        |
| 13.9 SPI Interface                                             | 136        |
| 13.10 Bit Orientation                                          | 138        |
| 13.11 Wakeup Using the I2C/SPI Slave                           | 138        |
| 13.12 Host Side Address Space and Register                     | 140        |
| 14 Universal Asynchronous Receiver/Transmitter (UAPT)          | 145        |
| 14.1 Enabling and Selecting the LIAPT Cleak                    | 1/5        |
| 14.1 Enabling and Selecting the UART Clock                     | 140<br>175 |
| 14.2 Configuration                                             | 140        |
|                                                                | 140        |

| 15. Universal Serial Bus (USB)                                   | 146          |
|------------------------------------------------------------------|--------------|
| 15.1 Functional Overview                                         | . 146        |
| 15.2 USB Reset                                                   | . 146        |
| 15.3 Soft Connect/Disconnect                                     | . 147        |
| 15.4 High-speed Mode                                             | . 147        |
| 15.5 USB Interrupt Handling                                      | . 147        |
| 15.6 Index 0 Register Fields                                     | . 149        |
| 15.7 Response to USB Conditions or Host Actions                  | . 151        |
| 15.8 Suspend/Resume                                              | . 152        |
| 15.9 Start of Frame Packets                                      | . 153        |
| 15.10 Dynamic FIFO Sizing                                        | . 154        |
| 15.11 Endpoint 0 Handling                                        | . 155        |
| 15.12 IN Endpoint Packet Handling                                | . 166        |
| 15.13 OUT Endpoint Packet Handling                               | . 170        |
| 15.14 Bulk Transactions                                          | . 175        |
| 15.15 Isochronous Transactions                                   | . 180        |
| 15.16 Transaction Flows                                          | . 187        |
| 15.17 Test Modes                                                 | . 198        |
|                                                                  | 400          |
| 16. Secure Digital Input Output (SDIO)                           | 199          |
| 16.1 Functional Description                                      | . 199        |
| 16.2 Clocks                                                      | . 202        |
| 16.3 Advanced DMA                                                | . 204        |
| 16.4 Driver flow sequence                                        | . 207        |
| 16.5 Using the AmbiqSuite SDK to Program and Use the SDIO Module | . 214        |
| 17 Diaplay Controllar (DC)                                       | 240          |
| 17. Display Controller (DC)                                      | 219          |
| 17.1 Software Support                                            | . 219        |
| 17.2 Timing Generator                                            | . 221        |
| 17.3 Layer Overlays                                              | . 223        |
| 17.4 Dienaing Modes                                              | . 220        |
| 17.5 Palette/Gamma Correction                                    | . 221<br>220 |
| 17.0 Diuleining                                                  | . 220        |
| 17.7 COIDI MOUES                                                 | 229          |
| 17.0 TSC Framebuller Decompression                               | 234          |
| Tr.5 Display Formats                                             | . 200        |
| 18 Display Sorial Interface (DSI)                                | 248          |
|                                                                  | 240          |
| 19 Graphics and the GEV Library on the Apollo/ Family MCUs       | 210          |
| 10.1 Introduction to Graphics                                    | 243          |
| 19.1 Introduction to Graphics                                    | 249          |
| 19.2 GFU and Graphics Soltware Support                           | 201          |
| 19.5 GFA LIDIALY AICHRECIULE                                     | 252          |
| 19.4 Graphilos Fipeline of the GFU                               | 204          |
| 19.6 CFX Library ΔPI Guidelines                                  | 204          |
| 19.7 Color Modes and Binding Textures                            | 201          |
| 19.8 Geometry Primitives                                         | 277          |
| 19 9 Blending                                                    | 278          |
| 19 10 Fonts                                                      | 285          |
| 19 11 GEX Library Platform Porting                               | 287          |
| 19.12 GFX Library Functions                                      | . 298        |
| •                                                                |              |

| 20. PDM-to-PCM Converter (PDM)                    | 359 |
|---------------------------------------------------|-----|
| 20.1 PDM Clock Configuration                      | 359 |
| 20.2 Operating Modes                              | 361 |
| 20.3 Supported Data Format                        | 361 |
| 20.4 Digital Volume Control                       | 362 |
| 20.5 Low Pass Filter (LPF)                        | 363 |
| 20.6 High Pass Filter (HPF)                       | 364 |
| 21. Low Power Analog Audio Interface              | 365 |
| 21.1 Automatic Sample Accumulation and Scaling    | 365 |
| 21.2 Sixteen Entry Result FIFO                    | 366 |
| 21.3 DMA                                          | 368 |
| 21.4 Window Comparator                            | 369 |
| 21.5 Operating Modes and the Mode Controller      | 370 |
| 21.6 Interrupts                                   | 373 |
| 21.7 Generating the Sample Rate for the Audio ADC | 374 |
| 22. Inter-IC Sound (I2S)                          | 375 |
| 22.1 I2S Clock Management                         | 375 |
| 22.2 DMA                                          | 376 |
| 22.3 Interrupts                                   | 377 |
| 22.4 Data Configurations                          | 377 |
| 22.5 Configuration and Control                    | 379 |
| 22.6 Serial Audio Interface                       | 381 |
| 23. Ordering Information                          | 385 |

# List of Figures

| Figure 1. Clock Source Selection Flowchart                               | 37  |
|--------------------------------------------------------------------------|-----|
| Figure 2. Timer Edge Mode (TMRnFN = 0x1) - CMP0 > CMP1                   | 46  |
| Figure 3. Timer Edge Mode (TMRnFN = 0x1) - CMP0 < CMP1                   | 47  |
| Figure 4. Timer Repeated Pulse Up-counter Compare Mode (TMRnFN = 0x2)    | 48  |
| Figure 5. Timer PWM Mode (TMRnFN = 0x4)                                  | 49  |
| Figure 6. Timer Repeated Pulse Down-counter Compare Mode (TMRnFN = 0x6)  | 50  |
| Figure 7. Timer Single 32-bit Pattern Output (TMRnFN = 0xC)              | 51  |
| Figure 8. Timer Single 64-bit Pattern Output (TMRnFN = 0xC)              | 51  |
| Figure 9. Single 32-bit Pattern Output                                   | 52  |
| Figure 10. Single 64-bit Pattern Output                                  | 52  |
| Figure 11. Timer Event Counter (TMRnFN = 0xE)                            | 53  |
| Figure 12. Pad Connection Details                                        | 62  |
| Figure 13. Scan Flowchart                                                | 95  |
| Figure 14. XIP Block Diagram                                             | 101 |
| Figure 15. MSPI TX Interface Timing at SCLK = 48 MHz                     | 108 |
| Figure 16. MSPI RX Interface Timing at SCLK = 48 MHz                     | 109 |
| Figure 17, MSPI RX Interface Timing at SCLK = 96 MHz                     | 109 |
| Figure 18, I2C/SPI Master Clock Generation                               | 112 |
| Figure 19 Basic I2C Conditions                                           | 113 |
| Figure 20, I2C Acknowledge                                               | 114 |
| Figure 21, I2C 7-bit Address Operation                                   | 115 |
| Figure 22, 12C 10-bit Address Operation                                  | 115 |
| Figure 23, I2C Offset Address Transmission                               | 115 |
| Figure 24, I2C Write Operation with Address Offset                       | 116 |
| Figure 25, I2C Read Operation with Address Offset                        | 116 |
| Figure 26, I2C Write Operation with No Address Offset                    | 116 |
| Figure 27, 12C Read Operation with No Address Offset                     | 117 |
| Figure 28 SPI Normal Write Operation (Single-byte Offset Address)        | 118 |
| Figure 29 SPI Normal Read Operation                                      | 119 |
| Figure 30 SPI Raw Write Operation                                        | 119 |
| Figure 31 SPI Raw Read Operation                                         | 120 |
| Figure 32 SPI Combined Operation                                         | 120 |
| Figure 33 SPI CPOL and CPHA                                              | 121 |
| Figure 34 Flow Control at Beginning of a Write Transfer                  | 122 |
| Figure 35. Flow Control at Beginning of a Raw Read Transfer              | 122 |
| Figure 36. Flow Control in the Middle of a Write Transfer                | 123 |
| Figure 37. Flow Control in the Middle of a Read Transfer                 | 123 |
| Figure 38, I2C/SPI Slave Module LRAM Addressing in Standard Address Mode | 125 |
| Figure 39, I2C/SPI Slave Module FIFO                                     | 130 |
| Figure 40. Basic I2C Conditions                                          | 133 |
| Figure 41, I2C Acknowledge                                               | 134 |
| Figure 42, I2C 7-bit Address Operation                                   | 134 |
| Figure 43, I2C 10-bit Address Operation                                  | 135 |
| Figure 44, I2C Offset Address Transmission                               | 135 |
| Figure 45, I2C Write Operation                                           | 135 |
| Figure 46. I2C Read Operation                                            | 136 |
| Figure 47. SPI Write Operation                                           | 137 |
| Figure 48. SPI Read Operation                                            | 137 |
| Figure 49. SPI CPOL and CPHA                                             | 138 |
| Figure 50. USB Interrupt Service Routine                                 | 148 |
| Figure 51, Endpoint 0 States                                             | 158 |
| Figure 52. Endpoint 0 Service Routine                                    | 160 |
|                                                                          |     |

| Figure 53. | SETUP Phase of Control Transfer                                               | 161 |
|------------|-------------------------------------------------------------------------------|-----|
| Figure 54. | IN Data Phase for Control Transfer                                            | 162 |
| Figure 55. | OUT Data Phase for Control Transfer                                           | 164 |
| Figure 56. | High-speed Isochronous IN Endpoint Transmission                               | 168 |
| Figure 57. | Packets Sent per Microframe                                                   | 172 |
| Figure 58. | High-Speed High-Bandwidth IN Endpoint                                         | 184 |
| Figure 59. | High-speed High-bandwidth OUT Endpoint                                        | 185 |
| Figure 60. | Setup Phase Transaction                                                       | 187 |
| Figure 61  | IN Data Phase Transaction                                                     | 188 |
| Figure 62  | Status Phase Following IN Data Phase Transaction                              | 189 |
| Figure 63  | OUT Data Phase Transaction                                                    | 190 |
| Figure 64  | Status Phase Following OUT Data Phase Transaction                             | 191 |
| Figure 65  | Bulk/Interrunt IN Transaction                                                 | 192 |
| Figure 66  | Bulk OUT Transaction                                                          | 102 |
| Figure 67  | Full speed leachronous IN Transaction                                         | 10/ |
| Figure 67  | Full-speed isochronous IN Transaction                                         | 104 |
| Figure 60  | High handwidth leashrangua IN Transaction                                     | 190 |
| Figure 69. | High-bandwidth leastrongue OUT Transaction                                    | 190 |
| Figure 70. | High-bandwidth isochronous OUT Transaction                                    | 197 |
| Figure 71. |                                                                               | 199 |
| Figure 72. | SDIO Module Clock Derivation                                                  | 203 |
| Figure 73. | ADMA2 Block Diagram                                                           | 204 |
| Figure 74. | Example Descriptor Table                                                      | 205 |
| Figure 75. | 32-bit Addressing Descriptor Table                                            | 206 |
| Figure 76. | Descriptor Table Length Field Definitions                                     | 206 |
| Figure 77. | Non-DMA Transfer Flow Sequence                                                | 208 |
| Figure 78. | DMA Transfer Flow Sequence                                                    | 210 |
| Figure 79. | ADMA Transaction Flow Sequence                                                | 212 |
| Figure 80. | Display Controller Timing                                                     | 221 |
| Figure 81. | Display Controller RGBA Background Color                                      | 223 |
| Figure 82. | Display Controller First Layer                                                | 223 |
| Figure 83. | Display Controller Second Layer                                               | 224 |
| Figure 84. | Display Controller Third and Fourth Layers                                    | 225 |
| Figure 85. | Gamma Correction of RGB Values                                                | 227 |
| Figure 86. | Dithering on Limited Color Palette                                            | 228 |
| Figure 87. | FTSC <sup>™</sup> 4 /TSC <sup>™</sup> 6 Framebuffer Compression Module        | 234 |
| Figure 88. | DBI-Type B Write Sequence                                                     | 235 |
| Figure 89. | DBI-Type B Read Sequence                                                      | 236 |
| Figure 90  | SPI 3- or 4-wire Interface (Dashed Line Used in 4-wire) - Data Out Only       | 241 |
| Figure 91  | SPI 3- or 4-wire Interface (Dashed Line Used in 4-wire) - Bi-directional Data | 241 |
| Figure 92  | SPI 3-wire Serial Write Transmission                                          | 242 |
| Figure 93  | SPI 4-wire Serial Write Transmission                                          | 242 |
| Figure 0/  | Single Line Undate Mode                                                       | 242 |
| Figure 05  | Multiple Lines Undate Mode                                                    | 242 |
| Figure 95. | OuadSPI Interface                                                             | 240 |
| Figure 90. |                                                                               | 240 |
| Figure 97. | Pendering Elow                                                                | 241 |
| Figure 96. |                                                                               | 250 |
| Figure 40  | GFA LIDIALY AICHILECIULE                                                      | 202 |
| Figure 100 |                                                                               | 254 |
| Figure 10  |                                                                               | 264 |
| ⊢igure 102 | 2. NEIMA® PIX-Presso State atter a Conversion                                 | 265 |
| ⊢igure 103 | 3. Original Empty Framebuffer                                                 | 2/1 |
| ⊢igure 104 | 1. Image Background                                                           | 272 |
| Figure 10  | 5. Final Output of the Drawing Process                                        | 272 |
| Figure 106 | ö. Rendered Scene with Two Icons                                              | 274 |

| Figure 107. Scene Textures                             | 275 |
|--------------------------------------------------------|-----|
| Figure 108. Predefined Blending Modes                  | 280 |
| Figure 109. Original Framebuffer before Blending       | 280 |
| Figure 110. Scene Textures                             | 281 |
| Figure 111. User-defined Blending Modes                | 282 |
| Figure 112. Source Textures                            | 284 |
| Figure 113. Additional Operations Example              | 284 |
| Figure 114. Vector and Bitmap Fonts                    | 285 |
| Figure 115. Scan Flowchart                             | 371 |
| Figure 116. I2S Block Diagram                          | 375 |
| Figure 117. DMA Configurations for 16 Bits and 24 Bits | 378 |
| Figure 118. DMA Configurations for 8 Bits and 32 Bits  | 378 |
| Figure 119. Programmable Frame Period and Width        | 381 |
| Figure 120. Data Delay                                 | 381 |
| Figure 121. Dual-phase Frame Example                   | 382 |
| Figure 122. Right Justification                        | 382 |
| Figure 123. I2S Formatted Audio Frame                  | 383 |
| Figure 124. Left-justified Audio Frame                 | 383 |
| Figure 125. Right-justified Audio Frame                | 384 |

# **List of Tables**

| Table 1: Document Revision History                              | 13 |
|-----------------------------------------------------------------|----|
| Table 2: ARM Cortex-M4 Vector Table for the Apollo4 Family      | 16 |
| Table 3: SoC Interrupt Assignments                              | 19 |
| Table 4: High-level Apollo4 / Apollo4 Blue Memory Map           | 21 |
| Table 5: High-level Apollo4 Plus / Apollo4 Blue Plus Memory Map | 21 |
| Table 6: High-level Apollo4 Lite / Apollo4 Blue Lite Memory Map | 22 |
| Table 7: Detailed Apollo4 / Apollo4 Blue Memory Map             | 22 |
| Table 8: Detailed Apollo4 Plus / Apollo4 Blue Plus Memory Map   | 23 |
| Table 9: Detailed Apollo4 Lite / Apollo4 Blue Lite Memory Map   | 24 |
| Table 10: SoC Peripheral Device Memory Map                      | 25 |
| Table 11: CPU Power Mode Transitions                            | 27 |
| Table 12: Clock Usage Table                                     | 38 |
| Table 13: Alarm RPT Function                                    | 42 |
| Table 14: Timer Modes                                           | 43 |
| Table 15: Interrupt Trigger Options                             | 60 |
| Table 16: IO Master 0 I2C Configuration                         | 64 |
| Table 17: 10 Master 0 4-wire SPI Configuration                  | 64 |
| Table 18: 10 Master 0.3-wire SPI Configuration                  | 65 |
| Table 19: 10 Slave I2C Configuration                            | 65 |
| Table 20: 10 Slave 4-wire SPI Configuration                     | 65 |
| Table 21: 10 Slave 3-wire SPI Configuration                     | 66 |
| Table 21: IC Clave & wild Clifforniguration                     | 67 |
| Table 23: LIARTO TX Configuration                               | 68 |
| Table 24: LIARTO RX Configuration                               | 68 |
| Table 25: LIARTO RTS Configuration                              | 69 |
| Table 26: UARTO CTS Configuration                               | 60 |
| Table 27: LIART1 TX Configuration                               | 70 |
| Table 28: LIART1 RX Configuration                               | 70 |
| Table 20: UARTI TAX Configuration                               | 70 |
| Table 29: UARTERING Configuration                               | 71 |
| Table 30: UARTECTS Configuration                                | 71 |
| Table 31: UART2 TA Configuration                                | 72 |
| Table 32: UART2 RA Configuration                                | 72 |
| Table 33. UART2 RTS Configuration                               | 73 |
| Table 34. UART2 CTS Configuration                               | 73 |
| Table 35. UARTS TA Configuration                                | 74 |
| Table 30. UARTS RA Configuration                                | 74 |
| Table 37: UART3 RTS Configuration                               | 10 |
| Table 38: UART3 CTS Configuration                               | 10 |
| Table 39: PDM GLK Configuration                                 | 70 |
| Table 40: PDM DATA Configuration                                | 70 |
| Table 41: 125 CLK Configuration                                 | 76 |
|                                                                 | 11 |
|                                                                 | 11 |
| Table 43: I2S DATA Configuration                                | 11 |
| Table 45: I2S SDOUT Configuration                               | 78 |
| Table 46: SDIO/SDIF Configuration                               | 78 |
| Table 47: CLKOUT Configuration                                  | 79 |
| Table 48: 32 kHz CLKOUT Configuration                           | 79 |
| I able 49: CLKOUT_32M Configuration                             | 80 |
| Table 50: ADC Analog Input Configuration                        | 80 |
| Table 51: ADC Trigger Input Configuration                       | 80 |
| Table 52: Voltage Comparator Reference Configuration            | 82 |

| Table 53: Voltage Comparator Input Configuration                                        | 83  |
|-----------------------------------------------------------------------------------------|-----|
| Table 54: Voltage Comparator Output Configuration                                       | 83  |
| Table 55: SWO Configuration                                                             | 84  |
| Table 56: SW Trace Configuration                                                        | 85  |
| Table 57: One SLOT Configuration Register                                               | 89  |
| Table 58: Per Slot Sample Accumulator                                                   | 90  |
| Table 59: Accumulator Scaling                                                           | 90  |
| Table 60: FIFO Register                                                                 | 90  |
| Table 61: 12-bit FIFO Data Format                                                       | 91  |
| Table 62: 10-bit FIFO Data Format                                                       | 91  |
| Table 63: 8-bit FIFO Data Format                                                        | 91  |
| Table 64: Window Comparator Lower Limit Register                                        | 93  |
| Table 65: Window Comparator Upper Limit Register                                        | 93  |
| Table 66: GPADC Power Modes                                                             | 95  |
| Table 67: Command Queue Example                                                         | 103 |
| Table 68: CQFLAGS                                                                       | 104 |
| Table 69: Mapping of Direct Area Access Interrupts and Corresponding REGACCINTSTAT Bits | 128 |
| Table 70: I/O Interface Interrupt Control                                               | 131 |
| Table 71: HOST IER Register                                                             | 140 |
| Table 72: HOST IER Register Bits                                                        | 140 |
| Table 73: HOST ISR Register                                                             | 141 |
| Table 74: HOST ISR Register Bits                                                        | 141 |
| Table 75: HOST WCR Register                                                             | 141 |
| Table 76: HOST WCR Register Bits                                                        | 142 |
| Table 77: HOST WCS Register                                                             | 142 |
| Table 78: HOST WCS Register Bits                                                        | 142 |
| Table 79: FIFOCTRLO Register                                                            | 143 |
| Table 80: FIFOCTRLO Register Bits                                                       | 143 |
| Table 81: FIFOCTRUP Register                                                            | 143 |
| Table 82: FIFOCTRUP Register Bits                                                       | 143 |
| Table 83: FIFO Register                                                                 | 144 |
| Table 84: FIFO Register Bits                                                            | 144 |
| Table 85: Index 0 (IDX0) Register Multi-function Fields                                 | 149 |
| Table 86: USB Controller Register Map Additions for Dynamic FIFO Sizing                 | 154 |
| Table 87: Unloaded Packet Sizes < MAXPAYLOAD Requiring Manual Clear of OutPktRdv        | 170 |
| Table 88: PID Errors and IncompRx Responses in Peripheral Mode                          | 173 |
| Table 89: IDX0 Register Field Settings for Bulk IN Endpoint                             | 175 |
| Table 90: IDX0 Register Field Settings for Bulk OUT Endpoint                            | 177 |
| Table 91: IDX0 Register Field Settings for Isochronous IN Endpoint                      | 180 |
| Table 92: IDX0 Register Field Settings for Isochronous OUT Endpoint                     | 182 |
| Table 93: PID Error or IncompRx Reporting in Peripheral Mode                            | 186 |
| Table 94: Non-DMA Transfer Flow Sequence Steps                                          | 209 |
| Table 95: DMA Transfer Flow Sequence Steps                                              | 210 |
| Table 96: ADMA Transaction Flow Sequence Steps                                          | 213 |
| Table 97: Laver 0 Example                                                               | 224 |
| Table 98: Blending Modes                                                                | 226 |
| Table 99: DestA Operand                                                                 | 255 |
| Table 100: SUBOP0-Instructions                                                          | 255 |
| Table 101: A0 Operand                                                                   | 255 |
| Table 102: A1 Operand                                                                   | 256 |
| Table 103: A2 Operand                                                                   | 256 |
| Table 104: SUBOP1 Instructions                                                          | 257 |
| Table 105: DestRGB Operand                                                              | 257 |
| Table 106: RGB0 Operand                                                                 | 258 |
|                                                                                         |     |

| Table 107: RGB1 Operand                                                 | 258 |
|-------------------------------------------------------------------------|-----|
| Table 108: RGB2 Operand                                                 | 259 |
| Table 109: SUBOP2 Instructions                                          | 259 |
| Table 110: Pixel Operand                                                | 260 |
| Table 111: Addressing Coords Operand                                    | 260 |
| Table 112: TEXnBASE Coords Operand                                      | 260 |
| Table 113: SUBOP3 Instructions                                          | 261 |
| Table 114: LHS, RHS Operands                                            | 261 |
| Table 115: Supported Formats                                            | 273 |
| Table 116: Shader conventions                                           | 274 |
| Table 117: Predefined Blending Modes                                    | 279 |
| Table 118: Blend Factors                                                | 282 |
| Table 119: ops Arguments                                                | 284 |
| Table 120: PDMA_CKO and OSR Settings for Different Sampling Frequencies | 360 |
| Table 121: PDM-to-PCM Converter Operating Mode                          | 361 |
| Table 122: 24-bit Unpacked Data                                         | 362 |
| Table 123: PGA Gain Control                                             | 362 |
| Table 124: SOFTMUTE Register Configuration                              | 363 |
| Table 125: LPF Parameters                                               | 363 |
| Table 126: Per Slot Sample Accumulator                                  | 365 |
| Table 127: Accumulator Scaling                                          | 366 |
| Table 128: FIFO Register                                                | 366 |
| Table 129: 12-bit FIFO Data Format                                      | 366 |
| Table 130: 10-bit FIFO Data Format                                      | 367 |
| Table 131: 8-bit FIFO Data Format                                       | 367 |
| Table 132: Window Comparator Lower Limit Register                       | 369 |
| Table 133: Window Comparator Upper Limit Register                       | 369 |
| Table 134: Audio ADC Power Modes                                        | 372 |
| Table 135: Configurations Supported by the I2S Module                   | 377 |
| Table 136: Apollo4 SoC Ordering Information                             | 385 |
| Table 137: Apollo4 Plus SoC Ordering Information                        | 385 |
| Table 138: Apollo4 Lite SoC Ordering Information                        | 385 |
|                                                                         |     |

# 1. Introduction

The purpose of this Programmer's Guide is to provide developers using an Apollo4 family SoC with additional information to that found in the datasheets, specifically as it relates to programming the device for intended operation. The current SoCs that make up the Apollo4 family include the following:

- Apollo4
- Apollo4 Blue
- Apollo4 Plus
- Apollo4 Blue Plus
- Apollo4 Lite
- Apollo4 Blue Lite

The Programmer's Guide and the relevant datasheet(s) are intended to supplement each other and the user should have ready access to one when referencing the other. This Programmer's Guide also refers to the AmbiqSuite SDK extensively, as the SDK is the programmers' primary source of software to use and control Apollo4 operation properly.

Be sure to check the latest errata list document for the SoC of interest to understand any issues which might affect operation or features specified in this document.

Unless stated otherwise, references in this guide to "Apollo4 SoC" pertain to all members of the Apollo4 family.

# 2. Document Revision History

| Revision | Date     | Description                                                                                                                                                                                                                                                                                                                                                                          |
|----------|----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1.0      | Apr 2020 | Document initial release                                                                                                                                                                                                                                                                                                                                                             |
| 2.0      | Apr 2020 | Added chapters: WDT, UART, DC, BLE Controller (appendix)<br>Updated chapters:<br>- GPIO: Updated Pin Muxing Table (CSP package pins)<br>- PDM: Added "Supported Data Formats" section                                                                                                                                                                                                |
| 3.0      | Jul 2020 | <ul> <li>Initial version for silicon revision B</li> <li>Added chapters: USB</li> <li>Updated chapters:</li> <li>- GPIO: Pin Mapping, Pad Function Color Code, and Special Pad Types tables updated.</li> <li>- DC: Corrected pin names in "Serial Formats" section.</li> <li>- BLE Controller Appendix: Minor edits to transaction diagrams for clarity.</li> </ul>                 |
| 3.1      | Oct 2020 | <ul> <li>CLKGEN: Removed digital calibration procedures for XT/LFRC and auto-calibration of LFRC, which have been deprecated.</li> <li>RTC: Removed references to deprecated features (12-hour clock mode, read error status (CKERR).</li> <li>GPIO: Removed unavailable external clocks in Pin Mapping Tables.</li> <li>DC: Display Formats content added from datasheet</li> </ul> |
| 4.0      | Oct 2020 | - I2C/SPI Slave Module: Direct Area Functions section updated to include Address Pointer Wrap Mode.                                                                                                                                                                                                                                                                                  |
| 4.1      | Dec 2020 | <ul> <li>CLKGEN: HFRC Auto-adjustment section updated.</li> <li>TIMER: References to deprecated CONTINUOUS mode removed.</li> <li>GPIO: Added replicated I2S0/I2S1 functions for sharing I2S ports on different devices in Rev B Pin Mapping Table. IO Master 0 4-wire SPI Connection section updated.</li> <li>IOM: SPI Configuration section updated.</li> </ul>                   |
| 5.0      | Jun 2021 | <ul> <li>SoC Architecture: "DAXI Considerations" section added.</li> <li>CLKGEN: "Clock Enablement for Modules with a Special Mux" section added.</li> <li>TIMER: Updated modes of operation.</li> <li>IOSLAVE: Added detail in "Wakeup Using the I2C/SPI Slave" section.</li> </ul>                                                                                                 |
| 6.0      | Sep 2021 | <ul> <li>SoC Architecture: Removed unsupported entries in "ARM Cortex-M4 Vector Table for Apollo4 SoC" and<br/>"SoC Interrupt Assignments"</li> <li>TIMER: Removed references to STIMER selection as a timer's trigger.</li> </ul>                                                                                                                                                   |

# Table 1: Document Revision History

| Revision | Date     | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|----------|----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 7.0      | Oct 2022 | <ul> <li>All:</li> <li>Apollo4 and Apollo4 Plus Programmer's Guides have been consolidated to include information/support for Apollo4 derivatives Apollo4, Apollo4 Blue, Apollo4 Plus, Apollo4 Blue Plus.</li> <li>SoC Architecture:</li> <li>Added note about mode transitioning requirements described in ERR076.</li> <li>Updated "ARM Cortex-M4 Vector Table for Apollo4 SoC" to remove Secure Fault ISR and exception numbers above 99.</li> <li>Updated TDAXI Buffers and Flush/Invalidate Operations" section.</li> <li>TIMER:</li> <li>Updated Edge mode operation in the "Timer Modes" table.</li> <li>Corrected pattern lengths of SINGLEPATTERN and REPEATPATTERN.</li> <li>Corrected 'Terminating a Repeat Operation' section'.</li> <li>GPIO:</li> <li>Added note specifying that the DPI-2 interface is not supported.</li> <li>"Module-specific Pad Configuration" section updated to cover all Apollo4 devices.</li> <li>GPADC:</li> <li>Updated "Clock Source and Divider" section and specified that only 24 MHz ADC clock is supported and noted that a minimum of 37 sampling/tracking cycles should be used.</li> <li>Added note specifying that a delay is required after CNVCMP ISR is asserted and before reading the FIFO.</li> <li>MSPI:</li> <li>Added section "Clocking and Other Limitations"</li> <li>On Apollo4 Blue the maximum clock rate for MSPI2 for quad and octal data widths has been updated to 24MHz SDR and 12MHz DDR.</li> <li>Added note specifying limitations for High Speed mode.</li> <li>SDIO:</li> <li>Added note specifying limitations for High Speed mode.</li> <li>SDIO:</li> <li>Added note specifying limitations for High Speed mode.</li> <li>Isplay Controller:</li> <li>"Video Timing Generator" section renamed and updated.</li> <li>Izs:</li> <li>Added.</li> <li>UART:</li> <li>Clock selection updated to include 48 MHz.</li> </ul> |

# Table 1: Document Revision History

| Revision | Date     | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
|----------|----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 8.0      | Jul 2023 | <ul> <li>All:</li> <li>Additions and updates to include information on specific features and operation of Apollo4 Lite and Blue Lite SoC throughout.</li> <li>SoC_Architecture:</li> <li>Note added about SysTick not operational in sleep mode.</li> <li>Updated "Detailed Apollo4 Lite / Apollo4 Blue Lite Memory Maps.</li> <li>Updated "Detailed Apollo0 Plus / Apollo4 Blue Plus Memory Map" table.</li> <li>Corrected 0x10060000 - 0x07FFFFFF range Executable to 'X'.</li> <li>Corrected 0x10160000 - 0x101BFFFF range Executable to 'N'.</li> <li>Corrected 0x10160000 - 0x101BFFFF range Executable to 'N'.</li> <li>Corrected 0x10100000 - 0x17FFFFFF range Executable to 'N'.</li> <li>Corrected 0x1000000 - 0x17FFFFFFF range Executable to 'N'.</li> <li>Corrected 0x1000000 - 0x17FFFFFFF range Executable to 'N'.</li> <li>Corrected 0x1000000 - 0x17FFFFFFF range Executable to 'N'.</li> <li>Corrected 0x1000000 - 0x1015FFFF range Executable to 'N'.</li> <li>SoC Architecture: Updated "Detailed Apollo4 Plue Alue Memory Map" table.</li> <li>Corrected 0x1000000 - 0x1015FFFF range Executable to 'X'.</li> </ul> TIMER: <ul> <li>Note added that on the Apollo4 Plus and Apollo4 Blue Plus SoCs, the TIMERn registers are readonly.</li> <li>Updated note in Section 6.2 regarding triggering behavior.</li> </ul> ADC: <ul> <li>Restructured sections in chapter; updated "Automatic Sample Accumulation and Scaling" on page 89</li> </ul> Graphics: <ul> <li>Note added describing interface restrictions when using RGBA4444 or ARGB4444 input color modes.</li> </ul> PDM: <ul> <li>Corrected "Supported Data Format" section to specify only 24-bit</li></ul> |

# Table 1: Document Revision History

# 3. SoC Architecture

# 3.1 Interrupts

Within the SoC, multiple peripherals can generate interrupts. In some cases, a single peripheral may be able to generate multiple different interrupts. Each interrupt signal generated by a peripheral is connected back to the M4 core in two places. First, the interrupts are connected to the Nested Vectored Interrupt Controller, NVIC, in the core. This connection provides the standard changes to program flow associated with interrupt processing. Additionally, they are connected to the WIC outside of the core, allowing the interrupt sources to wake the M4 core when it is in a deep sleep (SRPG) mode.

The SoC supports the M4 NMI as well as the normal interrupt types. For details on the Interrupt model of the M4, please see the **"Cortex-M4 Devices Generic User Guide,**" document number DUI0553A.

Below is the M4 Vector Table for the Apollo4 family. Note that not all modules listed in the table are on all family derivatives.

| Exception Number | IRQ   | Offset      | Vector      | Description      |  |
|------------------|-------|-------------|-------------|------------------|--|
| 99               | 83    | 0x18C       | IRQ83       | CPU (cache/DAXI) |  |
| 98               | 82    | 0x168       | IRQ82       | Timer 15         |  |
| 97               | 81    | 0x184       | IRQ81       | Timer 14         |  |
| 96               | 80    | 0x180       | IRQ80       | Timer 13         |  |
| 95               | 79    | 0x17C       | IRQ79       | Timer 12         |  |
| 94               | 78    | 0x178       | IRQ78       | Timer 11         |  |
| 93               | 77    | 0x174       | IRQ77       | Timer 10         |  |
| 92               | 76    | 0x170       | IRQ76       | Timer 9          |  |
| 91               | 75    | 0x16C       | IRQ75       | Timer 8          |  |
| 90               | 74    | 0x168       | IRQ74       | Timer 7          |  |
| 89               | 73    | 0x164       | IRQ73       | Timer 6          |  |
| 88               | 72    | 0x160       | IRQ72       | Timer 5          |  |
| 87               | 71    | 0x15C       | IRQ71       | Timer 4          |  |
| 86               | 70    | 0x158       | IRQ70       | Timer 3          |  |
| 85               | 69    | 0x154       | IRQ69       | Timer 2          |  |
| 84               | 68    | 0x150       | IRQ68       | Timer 1          |  |
| 83               | 67    | 0x14C       | IRQ67       | Timer 0          |  |
| 82               | 66    | 0x148       | IRQ66       | Reserved         |  |
| 81               | 65    | 0x144       | IRQ65       | Reserved         |  |
| 80               | 64    | 0x140       | IRQ64       | Reserved         |  |
| 72-79            | 56-63 | 0x120-0x13C | IRQ56-IRQ63 | GPIO             |  |
| 71               | 55    | 0x11C       | IRQ55       | Reserved         |  |
| 70               | 54    | 0x118       | IRQ54       | Reserved         |  |
| 69               | 53    | 0x114       | IRQ53       | Reserved         |  |
| 68               | 52    | 0x110       | IRQ52       | Reserved         |  |
| 67               | 51    | 0x10C       | IRQ51       | PDM3             |  |
| 66               | 50    | 0x108       | IRQ50       | PDM2             |  |

## Table 2: ARM Cortex-M4 Vector Table for the Apollo4 Family

| Exception Number | IRQ   | Offset    | Vector      | Description                                    |  |
|------------------|-------|-----------|-------------|------------------------------------------------|--|
| 65               | 49    | 0x104     | IRQ49       | PDM1                                           |  |
| 64               | 48    | 0x100     | IRQ48       | PDM0                                           |  |
| 63               | 47    | 0xFC      | IRQ47       | Reserved                                       |  |
| 62               | 46    | 0xF8      | IRQ46       | Reserved                                       |  |
| 61               | 45    | 0xF4      | IRQ45       | I2S1                                           |  |
| 60               | 44    | 0xF0      | IRQ44       | 12S0                                           |  |
| 59               | 43    | 0xEC      | IRQ43       | Reserved                                       |  |
| 58               | 42    | 0xE8      | IRQ42       | LP-ADC                                         |  |
| 57               | 41    | 0xE4      | IRQ41       | Reserved                                       |  |
| 56               | 40    | 0xE0      | IRQ40       | Stimer Capture/Overflow                        |  |
| 48-55            | 32-39 | 0xC0-0xDC | IRQ32-IRQ39 | Stimer Compare[0:7]                            |  |
| 47               | 31    | 0xBC      | IRQ31       | Reserved                                       |  |
| 46               | 30    | 0xB8      | IRQ30       | DSI                                            |  |
| 45               | 29    | 0xB4      | IRQ29       | Display                                        |  |
| 44               | 28    | 0xB0      | IRQ28       | Graphics                                       |  |
| 43               | 27    | 0xAC      | IRQ27       | USB                                            |  |
| 42               | 26    | 0xA8      | IRQ26       | SDIO                                           |  |
| 41               | 25    | 0xA4      | IRQ25       | Reserved                                       |  |
| 40               | 24    | 0xA0      | IRQ24       | CRYPTO Secure                                  |  |
| 39               | 23    | 0x9C      | IRQ23       | Clock Control                                  |  |
| 38               | 22    | 0x98      | IRQ22       | MSPI2                                          |  |
| 37               | 21    | 0x94      | IRQ21       | MSPI1                                          |  |
| 36               | 20    | 0x90      | IRQ20       | MSPI0                                          |  |
| 35               | 19    | 0x8C      | IRQ19       | GP-ADC                                         |  |
| 34               | 18    | 0x88      | IRQ18       | UART3                                          |  |
| 33               | 17    | 0x84      | 1RQ17       | UART2                                          |  |
| 32               | 16    | 0x80      | IRQ16       | UART1                                          |  |
| 31               | 15    | 0x7C      | IRQ15       | UART0                                          |  |
| 30               | 14    | 0x78      | IRQ14       | Counter/Timers (combined - also see IRQ 67-82) |  |
| 29               | 13    | 0x74      | IRQ13       | I2C/SPI Master7                                |  |
| 28               | 12    | 0x70      | IRQ12       | I2C/SPI Master6                                |  |
| 27               | 11    | 0x6C      | IRQ11       | I2C/SPI Master5                                |  |
| 26               | 10    | 0x68      | IRQ10       | I2C/SPI Master4                                |  |
| 25               | 9     | 0x64      | IRQ9        | I2C/SPI Master3                                |  |
| 24               | 8     | 0x60      | IRQ8        | I2C/SPI Master2                                |  |
| 23               | 7     | 0x5C      | IRQ7        | I2C/SPI Master1                                |  |
| 22               | 6     | 0x58      | IRQ6        | I2C/SPI Master0                                |  |
| 21               | 5     | 0x54      | IRQ5        | I2C/SPI Slave Register Access                  |  |
| 20               | 4     | 0x50      | IRQ4        | I2C/SPI Slave                                  |  |
| 19               | 3     | 0x4C      | IRQ3        | Voltage Comparator                             |  |

Table 2: ARM Cortex-M4 Vector Table for the Apollo4 Family

| Exception Number | IRQ | Offset     | Vector         | Description             |
|------------------|-----|------------|----------------|-------------------------|
| 18               | 2   | 0x48       | IRQ2           | RTC                     |
| 17               | 1   | 0x44       | IRQ1           | Watchdog Timer          |
| 16               | 0   | 0x40       | IRQ0           | Brownout Detection      |
| 15               | -1  | 0x3C       | Systick_S      |                         |
| 14               | -2  | 0x38       | PendSV_S       |                         |
| 13               | -   | 0x34       | Reserved       |                         |
| 12               | -4  | 0x30       | DebugMonitor   |                         |
| 11               | -5  | 0x2C       | SVCall_S       |                         |
| 10               | -   | 0x28       |                |                         |
| 9                | -   | 0x24       | Deserved       |                         |
| 8                | -   | 0x20       | Reserved       |                         |
| 7                | -   | 0x1C       |                |                         |
| 6                | -10 | 0x18       | UsageFault_S   | Usage Fault             |
| 5                | -11 | 0x14       | BusFault_S     | Bus Fault               |
| 4                | -12 | 0x10       | MemoryManage_S | Memory Management Fault |
| 3                | -13 | 0xC        | HardFault_S    | Hard Fault              |
| 2                | -14 | 0x8        | NMI_S          | Unused                  |
| 1                | -   | 0x4        | Reset          |                         |
| 0x0 Initial SP   |     | Initial SP |                |                         |

## Table 2: ARM Cortex-M4 Vector Table for the Apollo4 Family

#### NOTE

Not all members of the Apollo4 family include all peripherals (and their interrupts) as shown in Table 2. Interrupts for modules not included on an SoC can be ignored.

# NOTE

MSPI1 is not available on the Apollo4 Blue SoC or on the KBR package of the Apollo4 Blue Plus SoC.

#### NOTE

Various aspects of the Arm Cortex-M4 core, including operation and interrupt handling of the system counter, SysTick, are SoC specific. For Apollo4 family SoCs during Deep Sleep Mode, SysTick is completely powered down, and in normal Sleep Mode the clocks are gated off. In either case, the SysTick counter is not running and therefore cannot be used to wake the CPU.

When the CPU enters a sleep mode, the CPU clock is gated off. Normally there is a free-running clock to keep the Wake-Up Interrupt Controller (WIC) and/or the Nested Vectored Interrupt Controller (NVIC) running. But for power saving purposes, this "free-running" clock is gated off in both sleep modes, and can be resumed when external interrupts are received.

In the case of SysTick, its interrupt is generated by the SysTick counter, which is clocked by this "free-running" clock. Since this clock is gated off, the counter stops and cannot generate an interrupt.

However, when a debugger is attached, the clock is running and the SysTick interrupt can be generated.

Hardware interrupts are assigned in the SoC to the M4 NVIC as shown below.

| IRQ         | Peripheral/Description                       |
|-------------|----------------------------------------------|
| NMI         | Unused                                       |
| IRQ0        | Brownout Detection                           |
| IRQ1        | Watchdog Timer                               |
| IRQ2        | RTC                                          |
| IRQ3        | Voltage Comparator                           |
| IRQ4        | I <sup>2</sup> C / SPI Slave                 |
| IRQ5        | I <sup>2</sup> C / SPI Slave Register Access |
| IRQ6-IRQ13  | I <sup>2</sup> C / SPI Master0-7             |
| IRQ14       | Counter/Timers                               |
| IRQ15-IRQ18 | UART0-UART3                                  |
| IRQ19       | GP-ADC                                       |
| IRQ20-IRQ22 | MSPI0-MSPI2                                  |
| IRQ23       | Clock Control                                |
| IRQ24       | CRYPTO Secure                                |
| IRQ25       | Reserved                                     |
| IRQ26       | SDIO                                         |
| IRQ27       | USB                                          |

#### Table 3: SoC Interrupt Assignments

| IRQ         | Peripheral/Description  |
|-------------|-------------------------|
| IRQ28       | Graphics                |
| IRQ29       | Display                 |
| IRQ30       | DSI                     |
| IRQ31       | Reserved                |
| IRQ32-IRQ39 | Stimer Compare[0:7]     |
| IRQ40       | Stimer Capture/Overflow |
| IRQ41       | Reserved                |
| IRQ42       | LP-ADC                  |
| IRQ43       | Reserved                |
| IRQ44-IRQ45 | I2S0-I2S1               |
| IRQ46-IRQ47 | Reserved                |
| IRQ48-IRQ51 | PDM0-PDM3               |
| IRQ52       | Reserved                |
| IRQ53       | Reserved                |
| IRQ54       | Reserved                |
| IRQ55       | Reserved                |
| IRQ56-IRQ63 | GPIO                    |
| IRQ64       | Reserved                |
| IRQ65       | Reserved                |
| IRQ66       | Reserved                |
| IRQ67-IRQ82 | Timer0-Timer15          |
| IRQ83       | CPU (cache/DAXI)        |

# Table 3: SoC Interrupt Assignments

## NOTE

Not all members of the Apollo4 family include all peripherals (and their interrupts) as shown in Table 3. Interrupts for modules not included on an SoC can be ignored.

# 3.2 Memory Map

The high-level SoC-specific memory map is as shown in the following tables.

| Address                 | Name            | Executable <sup>a</sup> | Description                                                         |  |
|-------------------------|-----------------|-------------------------|---------------------------------------------------------------------|--|
| 0x00000000 – 0x001FFFFF | MRAM            | Y                       | Internal Non-volatile MRAM memory                                   |  |
| 0x00200000 - 0x07FFFFFF | Reserved        | х                       | No device at this address range                                     |  |
| 0x08000000 – 0x08000FFF | Boot Loader ROM | Y                       | Execute Only Boot Loader and MRAM Helper<br>Functions.              |  |
| 0x08001000 – 0x0FFFFFF  | Reserved        | х                       | No device at this address range                                     |  |
| 0x10000000 – 0x101D7FFF | SRAM            | Y/N                     | SRAM Memory (execution depends on which memory is being referenced) |  |
| 0x101D8000 – 0x13FFFFFF | Reserved        | х                       | Reserved                                                            |  |
| 0x14000000 – 0x1FFFFFFF | External NVM    | Y                       | External Memory                                                     |  |
| 0x20000000 – 0x3FFFFFFF | Reserved        | Х                       | Reserved                                                            |  |
| 0x40000000 – 0x4FFFFFF  | Peripheral      | N                       | Peripheral devices                                                  |  |
| 0x50000000 – 0xDFFFFFF  | Reserved        | х                       | No device at this address range                                     |  |
| 0xE0000000 – 0xE00FFFFF | РРВ             | N                       | NVIC, System timers, System Control Block                           |  |
| 0xE0100000 – 0xFFFFFFFF | Reserved        | Х                       | No device at this address range                                     |  |

Table 4: High-level Apollo4 / Apollo4 Blue Memory Map

a. "Y" = Yes, "N" = No, "X" = Does not apply

| Address                  | Name            | Executable <sup>a</sup> | Description                                         |  |
|--------------------------|-----------------|-------------------------|-----------------------------------------------------|--|
| 0x00000000 – 0x001FFFFF  | MRAM            | Y                       | Internal Non-volatile MRAM memory                   |  |
| 0x00200000 - 0x07FFFFFFF | Reserved        | Х                       | No device at this address range                     |  |
| 0x08000000 – 0x08000FFF  | Boot Loader ROM | Y                       | Execute Only Boot Loader and MRAM Helper Functions. |  |
| 0x08001000 – 0x0FFFFFFF  | Reserved        | Х                       | No device at this address range                     |  |
| 0x14000000 – 0x1FFFFFFF  | External NVM    | Y                       | External Memory                                     |  |
| 0x20000000 – 0x3FFFFFFF  | Reserved        | х                       | Reserved                                            |  |
| 0x40000000 – 0x4FFFFFFF  | Peripheral      | N                       | Peripheral devices                                  |  |
| 0x50000000 – 0xDFFFFFFF  | Reserved        | х                       | No device at this address range                     |  |
| 0xE0000000 – 0xE00FFFFF  | PPB             | N                       | NVIC, System timers, System Control Block           |  |
| 0xE0100000 – 0xFFFFFFFF  | Reserved        | Х                       | No device at this address range                     |  |

# Table 5: High-level Apollo4 Plus / Apollo4 Blue Plus Memory Map

a. "Y" = Yes, "N" = No, "X" = Does not apply

| Address Name Executable <sup>a</sup> |                 | Description |                                                                     |
|--------------------------------------|-----------------|-------------|---------------------------------------------------------------------|
| 0x00000000 – 0x001FFFFF              | MRAM            | Y           | Internal Non-volatile MRAM memory                                   |
| 0x00200000 - 0x07FFFFFF              | Reserved        | Х           | No device at this address range                                     |
| 0x08000000 – 0x08003FFF              | Boot Loader ROM | Y           | Execute Only Boot Loader and MRAM Helper Functions.                 |
| 0x08004000 – 0x0FFFFFF               | Reserved        | Х           | No device at this address range                                     |
| 0x10000000 – 0x102BFFFF              | SRAM            | Y/N         | SRAM Memory (execution depends on which memory is being referenced) |
| 0x102C0000 - 0x13FFFFFF              | Reserved        | Х           | Reserved                                                            |
| 0x14000000 – 0x1FFFFFF               | External NVM    | Y           | External Memory                                                     |
| 0x20000000 – 0x3FFFFFFF              | Reserved        | Х           | Reserved                                                            |
| 0x40000000 – 0x5FFFFFFF              | Peripheral      | N           | Peripheral devices                                                  |
| 0x60000000 – 0xDFFFFFF               | Reserved        | Х           | No device at this address range                                     |
| 0xE0000000 – 0xE00FFFFF              | РРВ             | Ν           | NVIC, System timers, System Control Block                           |
| 0xE0100000 – 0xFFFFFFFF              | Reserved        | Х           | No device at this address range                                     |

Table 6: High-level Apollo4 Lite / Apollo4 Blue Lite Memory Map

a. "Y" = Yes, "N" = No, "X" = Does not apply

A more detailed view of the device-specific memory map which breaks out the various memory types and address ranges for each SoC, is as shown in the following tables.

| Address                 | Name            | Executable <sup>a</sup> | Cacheable <sup>a</sup> | Description                                         |
|-------------------------|-----------------|-------------------------|------------------------|-----------------------------------------------------|
| 0x00000000 – 0x001FFFFF | MRAM            | Y                       | I & D                  | Internal Non-volatile MRAM Memory                   |
| 0x00200000 - 0x07FFFFFF | Reserved        | Х                       | Х                      | No device at this address range                     |
| 0x08000000 – 0x08000FFF | Boot Loader ROM | Y                       | Ν                      | Execute Only Boot Loader and MRAM Helper Functions. |
| 0x08001000 – 0x0FFFFFF  | Reserved        | Х                       | Х                      | No device at this address range                     |
| 0x10000000 – 0x1005FFFF | CPU SRAM (TCM)  | Y                       | Ν                      | Low-power / Low Latency SRAM (TCM)                  |
| 0x10060000 – 0x1015FFFF | System SRAM     | Х                       | I only                 | Shared System SRAM (SSRAM)                          |
| 0x10160000 – 0x101D7FFF | Extended SRAM   | Ν                       | I only                 | Extended Memory                                     |
| 0x101D8000 – 0x13FFFFFF | Reserved        | Х                       | Х                      | Reserved                                            |
| 0x14000000 – 0x17FFFFFF | MSPI0           | Ν                       | I only                 | Memory Mapped MSPI 0 Aperture                       |
| 0x18000000 – 0x1BFFFFFF | MSPI1           | N                       | I only                 | Memory Mapped MSPI 1 Aperture                       |

Table 7: Detailed Apollo4 / Apollo4 Blue Memory Map

| Address                 | Name       | Executable <sup>a</sup> | Cacheable <sup>a</sup> | Description                                  |
|-------------------------|------------|-------------------------|------------------------|----------------------------------------------|
| 0x1C000000 – 0x1FFFFFF  | MSPI2      | N                       | I only                 | Memory Mapped MSPI 2 Aperture                |
| 0x20000000 – 0x3FFFFFFF | Reserved   | Х                       | Х                      | Reserved                                     |
| 0x40000000 – 0x4FFFFFFF | Peripheral | N                       | N                      | Peripheral devices                           |
| 0x50000000 – 0xDFFFFFF  | Reserved   | Х                       | Х                      | Reserved                                     |
| 0xE0000000 – 0xE00FFFFF | PPB        | N                       | N                      | NVIC, System timers, System Control<br>Block |
| 0xE0100000 – 0xFFFFFFFF | Reserved   | Х                       | Х                      | No device at this address range              |

## Table 7: Detailed Apollo4 / Apollo4 Blue Memory Map

a. "Y" = Yes, "N" = No, "X" = Does not apply

# Table 8: Detailed Apollo4 Plus / Apollo4 Blue Plus Memory Map

| Address                 | Name            | Executable | Cacheable | Daxiable | Description                                         |
|-------------------------|-----------------|------------|-----------|----------|-----------------------------------------------------|
| 0x00000000 – 0x001FFFFF | MRAM            | Y          | I & D     | Х        | Internal Non-volatile MRAM Memory                   |
| 0x00200000 - 0x07FFFFFF | Reserved        | Х          | Х         | Х        | No device at this address range                     |
| 0x08000000 – 0x08000FFF | Boot Loader ROM | Y          | N         | х        | Execute-only Boot Loader and MRAM Helper Functions. |
| 0x08001000 – 0x0FFFFFF  | Reserved        | Х          | Х         | Х        | No device at this address range                     |
| 0x10000000 – 0x1005FFFF | CPU SRAM (TCM)  | Y          | N         | х        | Low-power / Low Latency SRAM (TCM) [384<br>kB]      |
| 0x10060000 – 0x1015FFFF | System SRAM     | Х          | l only    | D only   | Shared System SRAM (SSRAM0) [1 MB]                  |
| 0x10160000 – 0x101BFFFF | Extended Memory | N          | l only    | D only   | Extended SRAM [384 kB]                              |
| 0x101C0000 - 0x102BFFFF | System SRAM     | Х          | l only    | D only   | Shared System SRAM (SSRAM1) [1 MB]                  |
| 0x102C0000 - 0x13FFFFF  | Reserved        | Х          | Х         | Х        | Reserved                                            |
| 0x14000000 – 0x17FFFFFF | MSPI0           | N          | l only    | D only   | Memory Mapped MSPI 0 Aperture [64 MB]               |
| 0x18000000 – 0x1BFFFFFF | MSPI1           | N          | l only    | D only   | Memory Mapped MSPI 1 Aperture [64 MB]               |
| 0x1C000000 – 0x1FFFFFF  | MSPI2           | N          | l only    | D only   | Memory Mapped MSPI 2 Aperture [64 MB]               |
| 0x20000000 – 0x3FFFFFFF | Reserved        | Х          | Х         | Х        | Reserved                                            |
| 0x40000000 – 0x4FFFFFFF | Peripheral      | N          | N         | N        | Peripheral devices                                  |
| 0x50000000 – 0xDFFFFFF  | Reserved        | Х          | Х         | Х        | Reserved                                            |
| 0xE0000000 – 0xE00FFFFF | PPB             | N          | N         | N        | NVIC, System timers, System Control Block           |
| 0xE0100000 – 0xFFFFFFF  | Reserved        | Х          | Х         | Х        | No device at this address range                     |

| Address                 | Name            | Executable | Cacheable | Daxiable | Description                                         |  |
|-------------------------|-----------------|------------|-----------|----------|-----------------------------------------------------|--|
| 0x00000000 – 0x001FFFFF | MRAM            | Y          | I & D     | Х        | Internal Non-volatile MRAM Memory                   |  |
| 0x00200000 - 0x07FFFFFF | Reserved        | Х          | Х         | Х        | No device at this address range                     |  |
| 0x08000000 – 0x08003FFF | Boot Loader ROM | Y          | Ν         | Х        | Execute-only Boot Loader and MRAM Helper Functions. |  |
| 0x08004000 – 0x0FFFFFF  | Reserved        | Х          | Х         | Х        | No device at this address range                     |  |
| 0x10000000 – 0x1005FFFF | CPU SRAM (TCM)  | Y          | Ν         | Х        | Low-power / Low Latency SRAM (TCM) [384 kB]         |  |
| 0x10060000 – 0x1015FFFF | System SRAM     | Х          | I only    | D only   | Shared System SRAM (SSRAM0) [1 MB]                  |  |
| 0x10160000 – 0x13FFFFFF | Reserved        | Х          | Х         | Х        | Reserved                                            |  |
| 0x14000000 – 0x17FFFFFF | MSPI0           | N          | I only    | D only   | Memory Mapped MSPI 0 Aperture [64 MB]               |  |
| 0x18000000 – 0x1BFFFFFF | MSPI1           | N          | I only    | D only   | Memory Mapped MSPI 1 Aperture [64 MB]               |  |
| 0x1C000000 – 0x1FFFFFFF | MSPI2           | N          | I only    | D only   | Memory Mapped MSPI 2 Aperture [64 MB]               |  |
| 0x20000000 – 0x3FFFFFFF | Reserved        | Х          | Х         | Х        | Reserved                                            |  |
| 0x40000000 – 0x4FFFFFF  | Peripheral      | N          | Ν         | N        | Peripheral devices                                  |  |
| 0x50000000 – 0xDFFFFFFF | Reserved        | Х          | Х         | Х        | Reserved                                            |  |
| 0xE0000000 – 0xE00FFFFF | PPB             | N          | Ν         | N        | NVIC, System timers, System Control Block           |  |
| 0xE0100000 – 0xFFFFFFFF | Reserved        | Х          | Х         | Х        | No device at this address range                     |  |

# Table 9: Detailed Apollo4 Lite / Apollo4 Blue Lite Memory Map

## NOTE

The Apollo4 Lite and Blue Lite SoCs have less system memory than that of the Apollo4 Plus. In particular, the Apollo 4 Plus's second System SRAM block (0x101C0000 – 0x102BFFFF) and the extended memory space (0x10160000 – 0x101BFFFF) are not offered on the Lite SoCs. If migrating from Plus to Lite, the (re)location of variables and other resources mapped in system memory need to be remapped according to the Lite's reduced memory space.

## NOTE

MSPI1 is not available on the Apollo4 Blue SoC or on the KBR package of the Apollo4 Blue Plus SoC.

Peripheral devices within the memory map are allocated on 4 kB boundaries, allowing each device up to 1024 32-bit control and status registers. Peripherals will return undefined read data when an attempt to access a register which does not exist occurs. Peripherals, whether accessed via the APB or the AHB, will always accept any write data sent to their registers without attempting to return an ERROR response. Specifically, a write to a read-only register would just become a don't-care write.

Table 10: SoC Peripheral Device Memory Map Address Device 0x4000000 - 0x400003FF Reset/BoD Control 0x40000400 - 0x40003FFF Reserved 0x40004000 - 0x400041FF **Clock Generator** 0x40004200 - 0x400047FF Reserved 0x40004800 - 0x40004BFF RTC 0x40004C00 - 0x40007FFF Reserved 0x40008000 - 0x400083FF Timers 0x40008400 - 0x4000BFFF Reserved 0x4000C000 - 0x4000C3FF Voltage Comparator 0x4000C400 - 0x4000FFFF Reserved 0x40010000 - 0x400107FF **GPIO** Control 0x40010800 - 0x40010FFF Fast GPIO Control 0x40011000 - 0x400113FF Reserved 0x40011400 - 0x40013FFF Reserved 0x40014000 - 0x400143FF NVM Cache Control 0x40014400 - 0x4001BFFF Reserved 0x4001C000 - 0x4001C3FF UART0 0x4001C400 - 0x4001CFFF Reserved UART1 0x4001D000 - 0x4001D3FF 0x4001D400 - 0x4001DFFF Reserved 0x4001E000 - 0x4001E3FF UART2 0x4001E400 - 0x4001EFFF Reserved 0x4001F000 - 0x4001F3FF UART3 0x4001F400 - 0x4001FFFF Reserved 0x40020000 - 0x400203FF **Miscellaneous Control** 0x40020400 - 0x40020FFF Reserved Power Control 0x40021000 - 0x400213FF 0x40021400 - 0x40023FFF Reserved 0x40024000 - 0x400243FF Watchdog Timer 0x40024400 - 0x4002FFFF Reserved 0x40030000 - 0x400303FF Security 0x40030400 - 0x40033FFF Reserved 0x40034000 - 0x400343FF I2C / SPI Slave 0x40034400 - 0x40037FFF Reserved GP-ADC 0x40038000 - 0x400383FF 0x40038400 - 0x4003FFFF Reserved

Table 10 shows the address mapping for the peripheral devices of the Base Platform.

|                         | · · · · · · · · · · · · · · · · · · · |
|-------------------------|---------------------------------------|
| Address                 | Device                                |
| 0x40040000 - 0x400403FF | Miscellaneous Control - NonCV         |
| 0x40040400 - 0x4004FFFF | Reserved                              |
| 0x40050000 - 0x40050FFF | I2C / SPI Master0                     |
| 0x40051000 - 0x40051FFF | I2C / SPI Master1                     |
| 0x40052000 - 0x40052FFF | I2C / SPI Master2                     |
| 0x40053000 - 0x40053FFF | I2C / SPI Master3                     |
| 0x40054000 - 0x40054FFF | I2C / SPI Master4                     |
| 0x40055000 - 0x40055FFF | I2C / SPI Master5                     |
| 0x40056000 - 0x40056FFF | I2C / SPI Master6                     |
| 0x40057000 - 0x40057FFF | I2C / SPI Master7                     |
| 0x40058000 - 0x4005FFFF | Reserved                              |
| 0x40060000 - 0x400603FF | MSPI Master0                          |
| 0x40060400 - 0x40060FFF | Reserved                              |
| 0x40061000 - 0x400613FF | MSPI Master1                          |
| 0x40061400 - 0x40061FFF | Reserved                              |
| 0x40062000 - 0x400623FF | MSPI Master2                          |
| 0x40062400 - 0x4006FFFF | Reserved                              |
| 0x40070000 - 0x4007FFFF | SDIO                                  |
| 0x40080000 - 0x40087FFF | Debug Subsystem                       |
| 0x40088000 - 0x400AFFFF | Reserved                              |
| 0x40090000 - 0x4009FFFF | Graphics Subsystem                    |
| 0x400A0000 - 0x400A7FFF | Display Controller                    |
| 0x400A8000 - 0x400AFFFF | Display PHY                           |
| 0x400B0000 - 0x400B3FFF | USB                                   |
| 0x400B4000 - 0x400B7FFF | USB PHY                               |
| 0x400B8000 - 0x400BFFFF | Reserved                              |
| 0x400C0000 – 0x400C3FFF | Crypto                                |
| 0x400C4000 - 0x40200FFF | Reserved                              |
| 0x40201000 – 0x402013FF | PDM0                                  |
| 0x40201400 - 0x40201FFF | Reserved                              |
| 0x40202000 – 0x402023FF | PDM1                                  |
| 0x40202400 – 0x40202FFF | Reserved                              |
| 0x40203000 – 0x402033FF | PDM2                                  |
| 0x40203400 – 0x40203FFF | Reserved                              |
| 0x40204000 – 0x402043FF | PDM3                                  |
| 0x40204400 – 0x40207FFF | Reserved                              |
| 0x40208000 – 0x402083FF | I2S0                                  |

Table 10: SoC Peripheral Device Memory Map

| Address                  | Device    |
|--------------------------|-----------|
| 0x40208400 – 0x40208FFF  | Reserved  |
| 0x40209000 - 0x402093FF  | I2S1      |
| 0x40209400 – 0x4020FFFF  | Reserved  |
| 0x40210000 - 0x4024103FF | Audio ADC |
| 0x40210400 – 0x41FFFFFF  | Reserved  |
| 0x42000000 – 0x4200FFFF  | NVM OTP   |
| 0x42010000 – 0x4FFFFFFF  | Reserved  |

#### Table 10: SoC Peripheral Device Memory Map

## NOTE

Not all members of the Apollo4 family include all modules as shown in Table 10. Address ranges for modules not included on an SoC should be regarded as reserved.

## 3.2.1 Memory Access Restriction

There is a restriction when a DMA write crosses the boundary between SRAM and Extended RAM. The previously shown Detailed SoC-specific Memory Map in Table 9 lists the memory ranges for Shared System SRAM (SSRAM0) from 0x10060000 to 0x1015FFFF and Extended SRAM starting at 0x10160000. When doing a single, continuous DMA write across the SSRAM0 / Extended SRAM boundary at 0x10160000, the DMA start address must be no closer than 32 bytes before the boundary (0x1015FFE0).

#### WARNING

Failure to adhere to the above boundary-crossing access restriction may result in an SoC hang.

# 3.3 **Power Management Programming**

The transition between power modes is largely directed by software and the workload/task requirements.

# 3.3.1 CPU

Transitioning to different CPU power modes is initiated through a couple methods depending on the software intent and performance/power/latency requirements. The different mode transitions are described in Table 11.

| Mode        | Deep Sleep | Sleep   | Active - LP | Active - HP |
|-------------|------------|---------|-------------|-------------|
| Sleep       | Х          | х       | WAKE+LP     | WAKE+HP     |
| Active - LP | WFI (DS)   | WFI (S) | Х           | HP          |

| Mode        | Deep Sleep            | Sleep   | Active - LP | Active - HP |
|-------------|-----------------------|---------|-------------|-------------|
| Active - HP | WFI (DS) <sup>a</sup> | WFI (S) | LP          | Х           |

#### Table 11: CPU Power Mode Transitions

a. Transitioning from Active-HP mode to Deep Sleep mode requires an intermediate transition to Active-LP mode.

WFI (DS): Issue WFI instruction Deep Sleep

WFI (S): Issue WFI instruction Sleep

LP: Write to PWRCTRL\_MCUPERFREQ\_MCUPERFREQ field indicating "LP". Check PWRCTRL\_MCUPER-FREQ\_MCUPERFSTATUS to check if mode switch completed and if performance mode is available. Continue execution.

HP: Write to PWRCTRL\_MCUPERFREQ\_MCUPERFREQ field indicating "HP". Check PWRCTRL\_MCUPER-FREQ\_MCUPERFSTATUS to check if mode switch completed and if performance mode is available. Continue execution.

#### NOTE

In the Apollo4/Apollo4 Blue Errata Lists, ERR076 addresses an issue that the Power Controller in the Apollo4 and Apollo4 Blue does not allow adequate time for voltages to settle before re-enabling clocks when switching between high performance and low power modes. The AmbiqSuite SDK uses a TIMER interrupt (TIMER13) configured as the highest priority (0) interrupt to prevent unintentional break out by other interrupts. In order for this to work reliably, it is required that all other interrupts in the system be set at a lower priority, reserving the highest priority interrupt exclusively for this workaround.

# 3.3.2 I/O

The I/O power modes are determined by how the I/O controller is configured. For example:

- OFF: Controller power domain is OFF, "device enable" bit not set in power controller. This transition can also be made by the I/O PMU if auto-poweroff is supported.
- Sleep: Controller is "device enabled" but the controller interface has not been enabled/activated (AKA IDLE).
- Active: Controller is "active", meaning it is enabled and configured for active transactions.

## 3.3.3 Power Control

The Power Control block provides control and status for the power state of all the power domains and voltage regulators in the SoC. Software can control these blocks via power control registers within this block.

The power control block controls the power sequence to power up or power down a particular peripheral device and memory power domain. Status of each of these can be monitored in the respective power control status register. The power controller also supports event notification to indicate peripheral power transition completion. Event notification is the preferred power-optimized method in lieu of status polling.

The power controller is also the primary control block for the SIMO Buck converter as well as the LDO regulator when the SIMO Buck is disabled. Similarly, event notification is supported to provide the appropriate handshake to software as needed as well as status register indicators.

This block handles all power sequencing during initial power on and all power mode transitions.

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 3.4 Instrumentation Trace Macrocell (ITM)

For system trace the processor integrates an Instrumentation Trace Macrocell (ITM) alongside data watchpoints and a profiling unit. To enable simple and cost-effective profiling of the system events these generate, a Serial Wire Viewer (SWV) can export a stream of software-generated messages, data trace, and profiling information through a single pin.

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 3.5 SoC Control

High-level functionality configuration and status of the SoC are offered within an extensive set of SoC Control registers. Among the various device settings and status are the following functions:

- Chip identification and revision information
- Debug control
- Brown-out detection and power-up configuration
- ADC and Audio ADC configuration
- Programmable Gain Amplifier (PGA) configuration
- Crystal (XTAL) clock trimming
- Boot loader and secure boot loader enablement
- MRAM read/write protection
- SRAM write protection from DMA transfers

Please refer to the MCUCTRL register set which is included in the AmbiqSuite SDK.

# 3.6 Memory Controller Programming

The Cortex-M4 supports the ARMv7 ISA. Details on the instruction set and overall programming model can be referenced in the "ARM Cortex-M4 Devices Generic User Guide" and the "ARM Cortex-M4 Processor Technical Reference Manual".

The following subsections describe the set of CPU registers to interface with and control memory. The CPU registers are CM-4 Complex registers which set up and control the use of cache, TCM and DAXI (CM-4's local AHB-D interface to the primary AXI crossbar for accesses outside of the NVM and local TCM interfaces).

Please refer to the CPU register set which is included in the AmbiqSuite SDK.

# 3.6.1 DAXI Considerations

## 3.6.1.1 DAXI Controls

The DAXI operates fairly autonomously with little programming intervention after the initial setup. The only control provided is the ability for the CPU to flush and invalidate the DAXI's line buffers via operations on the system bus to the APB interface. Mode bits allow for fine-tuning the DAXI settings.

A couple registers are available for configuring for, and controlling, the DAXI operation.

**CPU\_DAXICTRL Register:** This register provides controls to flush and invalidate buffers in the DAXI interface.

DAXIFLUSHWRITE:

• Writing a 1 to this write-only single-bit field forces a flush of all buffers in the WRITE or MODIFIED state, which is useful for making modified data visible to the rest of the system.

## DAXIINVALIDATE:

• Writing a 1 to this write-only single-bit field invalidates all buffers in the SHARED state, which is useful for making visible data modified by other parts of the system.

#### NOTE

Setting these bits in the register just initiates the respective operation. The operation is not atomic and blocking, and could take a variable amount of time depending on the content of the DAXI buffers at that time. The SDK HAL implementation incorporates additional steps to ensure the flush/invalidate has finished.

CPU\_DAXICFG Register: This register allows configuration of the aging counter, buffers and flush level.

## AGINGCOUNTER:

 An 8-bit R/W field which specifies the time in CPU clock cycles that DAXI buffers may remain unused before being flushed. The setting indicates the number of clock cycles between increments of the aging counter. For Apollo4, buffers will generally be flushed in 1-2 AGINGCOUNTER time steps. The Apollo4 Plus allows for up to 65,535 steps per age with a default value of 2048.

#### BUFFERENABLE:

 A 2-bit (Apollo4) or 4-bit (Apollo4 Plus) R/W field that enables a number of DAXI buffers from 1 to 4 (Apollo4) or from 1 to 32 (Apollo4 Plus). The default is single-buffer mode (Apollo4) or 4 buffers (Apollo4 Plus).

## FLUSHLEVEL:

- For Apollo4, a single-bit R/W field that sets the number of buffers to try to keep free. When set to 0 (default), the DAXI will attempt to maintain two free buffers at all times. When set to 1, the DAXI will attempt to keep only a single buffer free.
- For Apollo4 Plus, a single-bit R/W field that sets the number of buffers to try to keep free. When set to 0 (default), flushing out dirty buffers occurs if 3 or more buffers are enabled and less than two are free, or if 2 buffers are enabled and none are free. When set to 1, flushing out dirty buffers occurs if 3 or more buffers are enabled and less than three are free, or if 2 buffers are enabled and less than three are free, or if 2 buffers are enabled and less then two are free.

#### NOTE

Due to an existing limitation on Apollo4, the DAXI must be used with either 2 buffers or 1 buffer enabled and, in either case, the flush level must be set to 0.

# 3.6.1.2 DAXI Buffers and Flush/Invalidate Operations

A few considerations need to be followed in software when working with AXI memory (SSRAM, Extended RAM and XIP RAM) in order to maintain RAM coherency or when going into a sleep mode. RAM

coherency must be maintained when other masters access or change data that is in the DAXI buffer. Other masters include DMA to/from any peripheral such as IOM (SPI/I2C), MSPI, ADC, I2S, PDM, SDIO, CRC Engine, Crypto, GPU and Display Controller.

The DAXI subsystem requires that a few guidelines must be followed in order to ensure software coherency and proper functionality of the DAXI buffers. In user applications the following actions must be taken.

- 1. For DMA Reads from a shared access memory written by the CPU:
  - Flush the DAXI buffer before starting the DMA transfer:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_FLUSH, 0);

- Practical integration point in software: When the CPU writes/modifies data in a buffer which will be read by another master, the flush can be performed at the point where the CPU completes its writing before signaling the other master read to start.
- 2. For DMA Writes to a shared access memory which needs to be consumed by the CPU:
  - Invalidate the DAXI buffer after the DMA completes, before the CPU reads it:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_INVALIDATE, 0);

- Practical integration point in software: When the DMA write transfer completes, a DAXI buffer invalidate must be performed before signaling to the CPU that the data is available.
- 3. Alignment restrictions:
  - All DMA buffers which are to be accessed by non-CPU masters need to be padded to 128 bits.
  - The buffers should be aligned such that the CPU and an external agent are never in control of the same 128-bit segment of the buffer at any point in time.
- 4. Entering normal sleep or deepsleep:
- A DAXI flush must be performed prior to entering sleep.

When using the AmbiqSuite HAL implementation to manage DAXI, in addition to the above requirements, user applications need to make a call as part of initialization.

For Apollo4 and Apollo4 Blue, the call is:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_AXIMEM, pBuf);

For Apollo4 Plus and Apollo4 Blue Plus, the call is:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_ENABLE, pBuf);

The pBuf parameter in both cases is a 16-byte aligned memory of at least 64 bytes in SSRAM/Extended SRAM.

It is expected that if any of the AXI mapped memory is in use, this memory is enabled. For example, if pBuf is in SSRAM memory, there is never a case where MSPI XIP or Extended SRAM is being used but SSRAM is turned off. This function provides scratch pad space in the user application for proper execution of DAXI flushing, and keeps the HAL independent of linker configurations and customer memory layout.

Transitions to sleep, as well as all of the internal transition points of buffer hand-off, are taken care of inside of the HAL. For any instance not handled internal to the HAL, such as ADC, AUDADC, PDM, Crypto, and Display Controller (if Display Controller is using buffers composed by the CPU directly, bypassing the GPU), the application must follow the above DAXI management guidelines.

For ADC, AUDADC and PDM, before reading the received DMA data the application must directly handle the interrupts and call:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_INVALIDATE, 0);

For Crypto, when operating on non-TCM buffers, the application needs to call:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_FLUSH, 0);

or

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_INVALIDATE, 0);

For Display Controller, when operating on non-TCM buffers and in scenarios where using buffers that are composed by the CPU directly (and not through the GPU), applications must call flush:

am\_hal\_daxi\_control(AM\_HAL\_DAXI\_CONTROL\_FLUSH, 0);

As described in section "4.3.1 DAXI" of the Apollo4 Datasheet version 0.9.0, these flush and invalidate operations help ensure software coherency in the end application. In the case of the flush, it ensures that any data meant for non-CPU agents has made it to the actual destination before signaling the agent to retrieve it. Similarly, the invalidate operation ensures that no stale data is read when fetching a buffer from an external agent, by invalidating any DAXI buffer contents and re-fetching the data from memory.

The following considerations must be followed in software when using XIP memory:

- Ensure all XIP transactions have completed before disabling XIP, and before disabling or powering off the corresponding MSPI.
  - The HAL implementation calls DAXI flush before any of these operations. However, this just ensures that the transaction has been flushed out of DAXI.
  - To ensure any outstanding Writes to XIPMM have completed, it must be ensured that they have made it to the external device.
  - Additional delays need to be provisioned between last usage of XIP Write, and disabling of XIP/MSPI.

# 4. Clock Generation (CLKGEN)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 4.1 HFRC Auto-adjustment

In some applications it is important that the High Frequency RC Oscillator (HFRC) frequency be more accurate than the  $\pm 2\%$  variation typically seen, particularly in cases where the temperature may vary widely. A good example of this is in cases where the MCU communicates with another device via the UART. The frequency matching with the other device in the connection is an important factor in the reliability of the connection. In order to support a highly accurate HFRC, a function called Auto-adjustment is provided.

It should be noted that Auto-adjustment is dependent on an accurate clock source such as the crystal. The min/max variation of the HFRC frequency with and without adjustment is different. During auto-adjustment, the number of counted 48 MHz cycles which occur in one 32.768 kHz crystal oscillator (XT) cycle is compared to a target value. If the count is different from the target, an HFRC tuning value is modified to change the HFRC frequency. The target count is held in the REG\_CLKGEN\_HFADJ\_HFXTADJ field, and is initially set to a default value of 0x5B8 (48 MHz / 32.768 kHz).

Auto-adjustment works by periodically enabling the HFRC and the XT, counting the number of cycles for a divided-down HFRC clock which occur in a single XT cycle, subtracting that value from HFXTADJ and adding the resulting difference to the actual HFRC tuning value which is stored in an internal register field. Auto-adjustment is enabled in the REG\_CLKGEN\_HFADJ Register by loading the repeat frequency value into the HFADJCK field and then setting the HFADJEN bit.

Auto-adjustment cycles will occur continuously if both the XT and the HFRC are currently requested as reference clocks by the modules. If either oscillator is disabled, Auto-adjustment cycles will then occur at intervals determined by the REG\_CLKGEN\_HFADJ\_HFADJCK field, as shown in the register description. Shorter repeat intervals will result in more accurate HFRC frequencies, especially if the temperature is changing rapidly, but will result in higher power consumption. When an Auto-adjustment cycle occurs, if the XT was disabled it is enabled and then a delay occurs to allow the XT to stabilize. This delay is defined by the REG\_CLKGEN\_HFADJ\_HFWARMUP field as defined in the register documentation. Once the HFRC is stable, the HFRC is enabled and several Auto-adjustments occur, each of which results in a refinement of the tuning value. Once those adjustments are complete, the HFRC and XT are powered down unless they are in use by other functions.

# 4.2 Generating 100 Hz

The Real Time Clock (RTC) module requires a 100 Hz clock which is provided by the Clock Generator. This clock may come either from the LFRC or the XT oscillators, as determined by the REG\_CLKGEN\_OCTRL\_OSEL bit. Since 100 Hz is not a simple power of two division of either of these oscillators, special functions are used to create it.

For both the XT and LFRC, the 512 Hz signal is divided by 5.12 (128/25) to produce a maximum jitter of less than 4 ms.

# 4.3 Clock Enablement for Modules with a Special Mux

NOTE This section and sub-sections apply specifically to Apollo4 and Apollo4 Blue.

Module clock selection should be performed with system-wide coordination of enabling only clock sources that are needed while keeping unneeded clock sources powered off. Note that not all modules are pinned out on all Apollo4 packages.

The I2Sn, IOMn, ADC and AUDADC modules use a special clock source selection mux which requires a specific selection procedure to avoid causing clock glitches that may affect clocking/operation of any module using the same clock. These modules may include up to three possible clock sources - HFRC, HFRC2 and high-speed crystal oscillator (XTHS). However, only the usage (enabling/disabling) of HFRC2 and XTHS clock sources is required to be monitored and controlled, as HFRC is enabled always and does not require or support software enabling/disabling.

Throughout this section, reference to the FSEL clock source means the source clock selection when FSEL = 0. The FSEL = 0 clock source is the clock that samples FSEL when changing the clock for a module. Note that for the AUDADC module, the FSEL = 0 clock source selection, as indicated in the CFG\_CLKSEL field, is clock off. In this case, the HFRC clock is used as the FSEL sampling clock.

Clock source selection for the IOMn (and ADC modules for Apollo4) is included in this discussion because each IOM has the special mux and requires a 40 µs delay after switching the clock source frequency selection FSEL. However, the only clock source option for these modules is HFRC which, as mentioned above, does not require any special software action to enable or disable this clock source.

The PDMn modules do not have this special mux, but clock selection for these modules is discussed here since they can be clocked with the HFRC2 or XTHS clock source and therefore should be considered when changing clock sources for any of these modules.

For the affected modules, when switching from the currently selected clock source to a new clock source, up to 3 clocks must be running during the clock switch – the currently selected clock source, the new clock source and the FSEL/CLKSEL sample clock source, which may or may not be the default selection, and may or may not be either the currently selected clock source or the new clock source. The FSEL sample clock source for a module is always the clock source of FSEL = 0 in the clock select register, and that clock source must be enabled to sample the changing FSEL clock selection. If the FSEL = 0 clock source is not enabled, then switching the clock source cannot occur.

The final step in the procedure for selecting a (new) clock for a module is to power down the replaced clock source and the FSEL sample clock source if:

- 1. Either is HFRC2 or XTHS
- 2. Either is *not* the new clock
- 3. Neither is used elsewhere

To make the determination as to whether any clock source is not used elsewhere and can be powered off, it is necessary to either maintain a Clock Usage Table to keep track of clocks currently being used by all modules, or to individually check the clock source setting of each module which can use either HFRC2 or XTHS as its clock source to see if in fact either is being used as a clock source.

When switching between clocks, it is important to allow a settling delay after enabling and after switching to the clock source, and before disabling the replaced clock source to ensure the new clock has settled and is clocking properly.

Settings required to use the XTHS clock include the adjustment of several trim settings, whether using the internal high-speed crystal oscillator or an external high-speed oscillator. This sequence of settings, as

well as a number of field settings in the MCUCTRL\_XTALHSCTRL register, follow the procedure found in the AmbiqSuite SDK's am\_hal\_mcuctrl\_control () function of the am\_hal\_mcuctrl.c file. In particular, the EXTCLK32M\_KICK\_START case of that function is used to enable the 32 MHz circuit/clock, and the EXTCLK32M\_DISABLE case is used to disable this clock.

#### NOTE

The XTHS clock requires that the VDDAUDA supply is provided. Consult the Electrical Characteristics in the Apollo4 Datasheet for supply specifications.

## 4.3.1 Clock Switching Procedure

Selecting or changing the clock source for a module requires enabling up to three clock sources - the currently selected clock source, the FSEL sample clock source and the new clock source.

# 1. Enable each of the three possible clock sources as follows.

#### HFRC clock:

• No software-triggered actions required.

#### HFRC2 clock:

• Set CLKGEN\_MISC\_FRCHFRC2 = 1 if not already set.

#### XTHS clock:

- Set MCUCTRL\_XTALHSTRIMS register to 0x0FFF8D2C, resulting in the individual field settings:
  - XTALHSCAP2TRIM = 44
  - XTALHSCAPTRIM = 4
  - XTALHSDRIVETRIM = 3
  - XTALHSDRIVERSTRENGTH = 0
  - XTALHSIBIASCOMP2TRIM = 3
  - XTALHSIBIASCOMPTRIM = 15
  - XTALHSIBIASTRIM = 127
  - XTALHSRSTRIM = 0
  - XTALHSSPARE = 0
- Set MCUCTRL\_XTALHSCTRL register to 0x0A2, resulting in the individual field settings:
  - XTALHSCOMPPDNB = 1
  - XTALHSPDNPNIMPROVE = 1
  - XTALHSPADOUTEN = 1
- Set MCUCTRL\_XTALHSCTRL\_XTALHSPDNB = 1
- Set MCUCTRL\_XTALHSCTRL\_XTALHSINJECTIONENABLE = 1
- Set MCUCTRL\_XTALHSCTRL\_XTALHSIBSTENABLE = 1
- Delay 5 µs.
- Set MCUCTRL\_XTALHSCTRL\_XTALHSINJECTIONENABLE = 0
- Delay AM\_HAL\_MCUCTRL\_CRYSTAL\_IBST\_DURATION µs.
- Set MCUCTRL\_XTALHSTRIMS register to 0x17118D1C, resulting in the individual field settings:
  - XTALHSCAP2TRIM = 28
  - XTALHSCAPTRIM = 4
  - XTALHSDRIVETRIM = 3
  - XTALHSDRIVERSTRENGTH = 0
  - XTALHSIBIASCOMP2TRIM = 3
  - XTALHSIBIASCOMPTRIM = 8
- XTALHSIBIASTRIM = 56
- XTALHSRSTRIM = 1
- XTALHSSPARE = 0
- If using an external oscillator, set the MCUCTRL\_XTALHSCTRL register fields:
  - XTALHSIBSTENABLE = 0
  - XTALHSEXTERNALCLOCK = 1
  - XTALHSPDNB = 0
- Else if using internal crystal oscillator, set the MCUCTRL\_XTALHSCTRL register fields:
  - XTALHSPDNPNIMPROVE = 0
  - XTALHSIBSTENABLE = 1
- 2. Delay(s) for clock power-up when required. If the FSEL sample clock source or the new clock source (if different) requires power-up, delay the following times after enable/power-up.

#### HFRC2 clock:

Delay 10 µs.

#### XTHS using internal crystal oscillator:

Delay at least 520 µs.

#### XTHS using an external oscillator:

- Delay the specified start-up time required for the external oscillator plus some additional margin (> 10 µs).
- 3. Set the module's clock select field (FSEL) to the desired new clock/divider selection.
- 4. Delay 40 µs to ensure new clock is running.
- 5. Disable unused clock sources if not used anywhere else by referencing the Clock Usage Table.

#### HFRC clock:

• No software-triggered actions required.

#### HFRC2 clock:

• Set CLKGEN\_MISC\_FRCHFRC2= 0.

#### XTHS clock:

- Set MCUCTRL\_XTALHSTRIMS register to 0x0311F12C, resulting in the individual field settings:
  - XTALHSCAP2TRIM = 44
  - XTALHSCAPTRIM = 4
  - XTALHSDRIVETRIM = 0
  - XTALHSDRIVERSTRENGTH = 7
  - XTALHSIBIASCOMP2TRIM = 3
  - XTALHSIBIASCOMPTRIM = 8
  - XTALHSIBIASTRIM = 24
  - XTALHSRSTRIM = 0
  - XTALHSSPARE = 0
- Set MCUCTRL\_XTALHSCTRL register to 0x0A2, resulting in the individual field settings:
  - XTALHSEXTERNALCLOCK = 0
  - XTALHSPADOUTEN = 1
  - XTALHSPDNPNIMPROVE = 1
  - XTALHSIBSTENABLE = 0
  - XTALHSCOMPPDNB = 1
  - XTALHSPDNB = 0

## 4.3.2 Clock Switching Process Flowchart

Figure 1 shows a flowchart of the clock source selection process, including clock source enabling, necessary delays and unused clock source disabling.



\*References to FSEL clock source means the source clock selection when FSEL = 0. Note that for the AUDADC module, the FSEL = 0 source clock (actually called CFG\_CLKSEL) is clock off. In this case the module uses the HFRC clock as the FSEL sampling clock, and for HFRC there is no FSEL clock enabling required.

## Figure 1. Clock Source Selection Flowchart

# 4.3.3 Clock Usage Table

The following table may be used as a clock selection reference and/or a software lookup table to keep track of HFRC2 and XTHS source clock usage by module so that a used clock type is not inadvertently disabled and any unused clock type can be disabled to save power. Where there are multiple instances of a module, each module has its own clock source setting. They are represented in the table on a single line for brevity. In addition to these modules which have the option to use one or more of the clock sources, the table also includes pads which support CLKOUT function selection for which these clock sources can be selected.

| Madula (Dad                                                                                                    | Selectable Clock Source <sup>a</sup>                                   |                                                                                                    |      |  |
|----------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------|------|--|
| Module/Pad                                                                                                     | HFRC2                                                                  | HFRC                                                                                               | XTHS |  |
| I2S0 - I2S1                                                                                                    | X <sup>s</sup>                                                         | X                                                                                                  | X    |  |
| AUDADC                                                                                                         | x                                                                      | xs                                                                                                 | x    |  |
| ADC                                                                                                            |                                                                        | x <sup>s</sup>                                                                                     |      |  |
| IOM0 – IOM7                                                                                                    |                                                                        | x <sup>s</sup>                                                                                     |      |  |
| PDM0 – PDM3                                                                                                    | x                                                                      | x                                                                                                  | x    |  |
| CLKOUT pads <sup>b</sup><br>(GPIO33, GPIO63,<br>GPIO66, GPIO67,<br>GPIO71, GPIO72,<br>GPIO80 and/or<br>GPIO81) | X<br>When CLKGEN_<br>CLKOUT_CKSEL =<br>0x3A - 0x3C (HFRC2<br>divisors) | <b>X</b><br>When CLKGEN_<br>CLKOUT_CKSEL =<br>0x19 – 0x20, 0x2F –<br>0x33, 0x39 (HFRC<br>divisors) |      |  |
| CLKOUT_32M pad:<br>GPIO46 <sup>c</sup>                                                                         |                                                                        |                                                                                                    | x    |  |

| Table | 12. | Clock | Usage | Table |
|-------|-----|-------|-------|-------|
| Table | 14. | Olock | USuge | Table |

a. X<sup>S</sup> indicates the clock source which samples FSEL when changing the clock for a module. Referred to as "FSEL sample clock source"; applies only to modules with the special muxes. Modules not using the special mux (but may select these clock sources) are also listed here to note where each clock source may be in use in order to prevent inadvertent power-down of a clock source in use.

b. Valid when CLKGEN\_CLKOUT\_CKEN = 1 and GPIO\_PINCFGn\_FNCSELn = CLKOUT for the pad.

c. Valid when GPIO\_PINCFG46\_FNCSEL46 = CLKOUT\_32M, the XTHS enable settings of the Clock Switching Procedure are set.

## 4.3.4 Clock Selection Examples

Refer to the latest register set for register field selections.

- 1. Change I2S0 clock selection from HFRC\_375kHz to XTHS\_EXTREF\_CLK.
  - A. HFRC is the currently selected clock source and is presently enabled (I2S0\_CLKCFG\_FSEL = HFRC\_375kHz (0x0e).
  - B. If HFRC2, the FSEL sample clock source, requires power up, then enable HFRC2 per the Clock Switching Procedure and wait 10  $\mu$ s.
  - C. If XTHS, the new clock source, requires power up, then enable XTHS per the **Clock Switching Procedure** above and wait the specified start-up time for the external oscillator.
  - D. Set I2S0\_CLKCFG\_FSEL = XTHS\_EXTREF\_CLK (0x0f) and wait 40 µs.
  - E. If the HFRC2 clock source is no longer required per the **Clock Usage Table**, disable the HFRC2 clock source per the **Clock Switching Procedure**.
- 2. Change AUDADC clock selection from HFRC2\_48MHz to XTALHS\_24MHz.
  - HFRC2 is the currently selected clock source and is presently enabled (AUDADC\_CFG\_CLKSEL = HFRC2\_48MHz (0x3).
  - 2. HFRC is the FSEL sample clock source by default (even though AUDADC\_CFG\_CLKSEL = OFF for CLKSEL = 0x0). HFRC does not require software power up.
  - 3. If XTHS, the new clock source, requires power up, then enable XTHS per the **Clock Switching Procedure** and wait 800 µs for the internal crystal oscillator to settle.
  - 4. Set AUDADC\_CFG\_CLKSEL = XTALHS\_24MHz (0x3) and wait 40 μs.
  - 5. If the HFRC2 clock source is no longer required per the **Clock Usage Table**, disable the HFRC2 clock source per the **Clock Switching Procedure**.

# 4.4 External 32.768 kHz Clock Input on X0

An external clock can be used in lieu of using a 32.768 kHz crystal as the low frequency clock reference. An external signal of approximately 32 kHz with an amplitude of VDDH/2 to 1.9 V should be connected between XO and ground. The duty cycle of the input signal, which can be a minimum of 10% and a maximum of 90%, determines the duty cycle of the resulting internal clock. Therefore, if a 50% duty cycle for the internal clock is intended, then the external clock's duty cycle should be 50%.

When using an external clock, fields in the MCUCTRL\_XTALCTRL register need to be set as follows:

- XTALPDNB= 'b0
- XTALCOMPPDNB = 'b0
- XTALCOMPBYPASS= 'b1

# 5. Real Time Clock (RTC)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 5.1 Calendar Counters

The real time is held in a set of eight Calendar Counters, which hold the current 1/100<sup>th</sup> of a second (RTC CTRLOW CTR100), the current second (RTC CTRLOW CTRSEC), the minute (RTC\_CTRLOW\_CTRMIN), the hour (RTC\_CTRLOW\_CTRHR), the current day of the month (RTC\_CTRUP\_CTRDATE), the current day of the week (RTC\_CTRUP\_CTRWKDY), the current month (RTC\_CTRUP\_CTRMO), the current year (RTC\_CTRUP\_CTRYR) and the current century (RTC CTRUP CB), all in BCD format. In order to insure that the RTC starts precisely, the timer chain which generates the 100 Hz clock is reset to 0 whenever any of the Calendar Counter Registers is written. Since unintentional modification of the Calendar Counters is a serious problem, the RTC RTCCTL WRTC bit must be set in order to write any of the counters, and should be reset by software after any load of the Calendar Counters.

Software may stop the clock to the Calendar Counters by setting the RTC\_RTCCTL\_RSTOP bit. This may be used in modes like Stopwatch to precisely start and stop the Calendar Counters.

# 5.2 Calendar Counter Reads

The RTC includes special logic to help insure that the Calendar Counters may be read reliably, i.e. that no rollover has occurred. Because two 32-bit reads are required to read the complete set of counters, it is possible that a delay occurs between the two reads which causes a rollover to occur. An interrupt is the most likely reason this could occur. Software should read the RTC counters multiple times to insure that the value is correct in case a rollover occurs.

## 5.3 Alarms

There are seven Alarm Registers which may be used to generate an Alarm interrupt at a specific time. These registers correspond to the 100<sup>th</sup> of a second (RTC\_ALMLOW\_ALM100), second (RTC\_ALMLOW\_ALMSEC), minute (RTC\_ALMLOW\_ALMMIN), hour (RTC\_ALMLOW\_ALMHR), day of the month (RTC\_ALMUP\_ALMDATE), day of the week (RTC\_ALMUP\_ALMWKDY) and month (RTC\_ALMUP\_ALMMO) Calendar Counters. The comparison is controlled by the RTC\_RTCCTL\_RPT field and the RTC\_ALMLOW\_ALM100 Register as shown in 24-hour mode. In the ALM100 Register, n indicates any digit 0-9. When all selected Counters match their corresponding Alarm Register, the ALM interrupt flag is set (see the Clock Generator section for the ALM interrupt control).

| RPT Value | Interval     | Comparison                                           |
|-----------|--------------|------------------------------------------------------|
| 000       | Disabled     | None                                                 |
| 001       | Every year   | 100 <sup>th</sup> , second, minute, hour, day, month |
| 010       | Every month  | 100 <sup>th</sup> , second, minute, hour, day        |
| 011       | Every week   | 100 <sup>th</sup> , second, minute, hour, weekday    |
| 100       | Every day    | 100 <sup>th</sup> , second, minute, hour             |
| 101       | Every hour   | 100 <sup>th</sup> , second, minute                   |
| 110       | Every minute | 100 <sup>th</sup> , second                           |
| 111       | Every second | 100 <sup>th</sup>                                    |

| Table 13: Alarm RPT Function |
|------------------------------|
|------------------------------|

All alarm interrupts are asserted on the next 100 Hz clock cycle after the counters match the alarm register, except for 100ths of a second. To get an interrupt that occurs precisely at a certain time, the comparison value in the corresponding alarm register should be set 10 ms (one 100 Hz count) earlier than the desired interrupt time.

For the 100ths of a second interrupt, the first 100 Hz clock sets the comparison with the alarm register and the next clock asserts the interrupt. Therefore, the first 100ths interrupt will be asserted after 20 ms, not 10 ms. This occurs each and every time the 100ths of a second counter with interrupts is enabled if the RTC is stopped. If the RTC is already running when configured, then the first interrupt will occur between 10 and 20 ms after configuration.

# 5.4 Century Control and Leap Year Management

The RTC\_CTRUP\_CB bit indicates the current century. A value of 0 indicates the 20<sup>th</sup> century, and a value of 1 indicates the 19<sup>th</sup> or 21<sup>st</sup> century. The CB value will toggle when the Years counter rolls over from 99 to 0 if the RTC\_CTRUP\_CEB bit is set, and will remain constant if CEB is clear. The century value is used to control the Leap Year functions, which create the correct insertion of February 29 in years which are divisible by 4 and not divisible by 100, and also the year 2000.

## 5.5 Weekday Function

The Weekday Counter is simply a 3-bit counter which counts up to 6 and then resets to 0. It is the responsibility of software to assign particular days of the week to each counter value.

# 6. Counter/Timer Module (TIMER)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 6.1 Counter/Timer Functions

The Timer Module contains sixteen 32-bit counters that can be used for a number of generic counting or waveform generation functions as described in Table 14 and the following sub-sections.

| Mode                                                                                                                                                                         | Outputs                                                                                                                       | Description/Uses                                                                                                                                                                                                                                                                              |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                                                                                                                                              |                                                                                                                               | Edge generation (0x1): Counts up from zero and stops when CMP0 s reached.                                                                                                                                                                                                                     |
|                                                                                                                                                                              |                                                                                                                               | TMRnLMT: Has no effect in this mode.                                                                                                                                                                                                                                                          |
| EDGE                                                                                                                                                                         | OUT0 transitions when TIMERn = CMP0.<br>OUT1 transitions when TIMERn = CMP1 (if<br>CMP1 < CMP0).                              | TIMERn counts up to CMP0 and stops. A single edge is<br>generated on OUT0 when TIMERn reaches CMP0.<br>If CMP1 < CMP0, then a single edge also is generated<br>on OUT1 when TIMERn reaches CMP1.<br>If CMP0 < CMP1, then no edge is generated on OUT1<br>(because TIMERn never reaches CMP1). |
|                                                                                                                                                                              |                                                                                                                               | Trigger: Timer does not start until trigger occurs (if enabled). Subsequent triggers ignored.                                                                                                                                                                                                 |
| UPCOUNT<br>OUT0 pulses for 1 clock cycle when TIMERn =<br>CMP0; counter resets to 0; repeats per<br>TMRnLMT setting.<br>OUT1 pulses for 1 clock cycle when TIMERn =<br>CMP1. | OUT0 pulses for 1 clock cycle when TIMERn =<br>CMP0; counter resets to 0; repeats per                                         | Repeatable Up-counter (0x2): Counts up from zero to<br>CMP0 and stops (TMRnLMT = 1), repeats N times<br>(TMRnLMT = 2 - 255), or repeats indefinitely (TMRnLMT<br>= 0).                                                                                                                        |
|                                                                                                                                                                              | Timer outputs will be a pulse of one source clock period<br>on the output when the TIMER reaches the associated<br>CMP value. |                                                                                                                                                                                                                                                                                               |
|                                                                                                                                                                              |                                                                                                                               | Trigger: Timer does not start until trigger occurs (if enabled). Subsequent triggers ignored.                                                                                                                                                                                                 |
|                                                                                                                                                                              | OUT0 transitions when TIMERn = CMP1, then                                                                                     | Repeatable PWM (0x4): Counts up from zero to CMP0<br>and stops (TMRnLMT=1), repeat N times (TMRnLMT = 2<br>- 255), or repeat indefinitely (TMRnLMT = 0).                                                                                                                                      |
| PWM                                                                                                                                                                          | transitions again when TIMERn = CMP0;<br>repeats per TMRnLMT setting.                                                         | CMP0 specifies the period and CMP1 specifies number<br>of TIMERn counts for the initial phase of the output<br>waveform.                                                                                                                                                                      |
|                                                                                                                                                                              |                                                                                                                               | Trigger: Timer does not start until trigger occurs (if enabled). Subsequent triggers ignored.                                                                                                                                                                                                 |

### Table 14: Timer Modes

| Mode                                           | Outputs                                                                                                                                                       | Description/Uses                                                                                                                                                                                                                                                                                      |
|------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                | OUT0 pulses for 1 clock cycle when TIMERn =                                                                                                                   | Repeatable Down-counter (0x6): Counts down from<br>CMP0 to zero and stops (TMRnLMT = 1), repeat N times<br>(TMRnLMT = 2 - 255), or repeat indefinitely (TMRnLMT<br>= 0).<br>Timer outputs will be a pulse of one source clock period                                                                  |
| DOWNCOUNT<br>(Apollo4 / Apollo4<br>Blue only)  | 0; repeats per TMRnLMT setting.<br>OUT1 pulses for 1 clock cycle when TIMERn =<br>CMP1 (if CMP1 < CMP0).                                                      | on the outputs will be a puse of one source clock period<br>on the output when the TIMER reaches the associated<br>CMP value. The down-count version can be useful for<br>irregular intervals since software can reprogram the<br>CMP0 value after each interrupt to change the next<br>reload value. |
|                                                |                                                                                                                                                               | Trigger: Timer does not start until trigger occurs (if enabled). Subsequent triggers ignored.                                                                                                                                                                                                         |
|                                                |                                                                                                                                                               | Single-run Pattern Generation (0xC): CMP0 and CMP1 bit-shifted to form output pattern on OUT0 and OUT1, or concatenation of CMP1:CMP0 bit shifted to form output pattern on both OUT0 and OUT1.                                                                                                       |
| SINGLEPATTERN                                  | For TMRnLMT < 32 and for TIMERn count = 0<br>to TMRnLMT:<br>OUT0 = CMP0[TIMERn].<br>OUT1 = CMP1[TIMERn].                                                      | The TIMER count is an up-counter that indexes into the bits of CMP0 (TIMER values from 1-31) and CMP1 (TIMER values 32-63). TMRnLMT value defines the length of the pattern minus 1 (0 = 1-bit pattern, $63 = 64$ -bit pattern).                                                                      |
|                                                | For TMRnLMT between 32 and 63 (maximum<br>64-bit pattern consisting of CMP1:CMP0) and<br>for TIMERn count = 0 to TMRnLMT:<br>OUT0 = OUT1 = CMP1:CMP0[TIMERn]. | OUT1 is the same as OUT0, but can be inverted for<br>motor control applications.<br>INT0 is triggered when all TMRnLMT+1 bits have been<br>streamed out.<br>INT1 is triggered only when TMRnLMT is set to 31 and<br>all 32 bits of CMP0 have been streamed out.                                       |
|                                                |                                                                                                                                                               | Trigger: Timer does not start until trigger occurs (if<br>enabled). After clearing and reconfiguring the timer for<br>this mode, a subsequent trigger restarts the pattern<br>replay.                                                                                                                 |
|                                                | For TMRnLMT < 32 and for TIMERn count = 0<br>to TMRnLMT:<br>OUT0 = CMP0[TIMERn].<br>OUT1 = CMP1[TIMERn].                                                      | Repeated Pattern Generation (0xD): CMP0/CMP1 bit-<br>shifted to form output pattern on OUT0, reset at<br>TMRnLMT back to 0 and repeat pattern.                                                                                                                                                        |
| REPEATPATTERN                                  | For TMRnLMT between 32 and 63 (maximum<br>64-bit pattern consisting of CMP1:CMP0) and<br>for TIMERn count = 0 to TMRnLMT:<br>OUT0 = OUT1 = CMP1:CMP0[TIMERn]. | Same as SINGLEPATTERN with the exception that the timer repeats the pattern after TMRnLMT+1 bits have been streamed out. Since TMRnLMT is used for the repeat pattern length, there is no option to repeat the pattern N times.                                                                       |
|                                                |                                                                                                                                                               | Single-run Edge Event Timer (0xE): Counts up from zero upon trigger event, stops at first clock edge.                                                                                                                                                                                                 |
| EVENTTIMER<br>(Apollo4 / Apollo4<br>Blue only) | OUT0/OUT1: Unused                                                                                                                                             | TIMERn clock = bus clock. Start counter at trigger event<br>and stop counter at first transition of edge source<br>(CTRLn_TMRnCLK).                                                                                                                                                                   |
|                                                |                                                                                                                                                               | Trigger: Timer does not start until trigger occurs (if enabled). Subsequent triggers ignored.                                                                                                                                                                                                         |

| Table | 14: | Timer | Modes |
|-------|-----|-------|-------|
|-------|-----|-------|-------|

### NOTE

For the Apollo4 and Apollo4 Blue SoCs, please refer to ERR059 in the errata list for a comprehensive list of Counter/Timer errata limiting functionality or timer selection.

#### NOTE

Unlike with the Apollo4 and Apollo4 Blue SoCs, on the Apollo4 Plus and Apollo4 Blue Plus SoCs the TIMER\_TIMERn registers are read-only and directly writing to these registers have no effect.

### NOTE

On the Apollo4 and Apollo4 Blue SoCs, stopping the timer by de-asserting the TIMER\_CTRLn\_TMRnEN bit after the timer has completed a mode operation may not always retain/clear the timer count (TIMERn register). The counter should be read before disabling the timer if the count is of interest.

On the Apollo4 Plus and Apollo4 Blue Plus SoCs, stopping the timer by deasserting the TIMER\_CTRLn\_TMRnEN bit after the timer has completed a mode operation will *always* clear the timer count (TIMERn register). The counter should be read before disabling the timer if the count is of interest.

#### NOTE

On the Apollo4 Plus and Apollo4 Blue Plus SoCs, for EDGE, UPCOUNT and PWM timer modes, the TMRnmINT interrupt (n = 0 to 15, m = 0 or 1) is triggered, if enabled, when TMRnCMPm value is equal to the TIMERn count value. For SINGLEPATTERN and REPEATPATTERN modes, the TMRn0INT interrupt (n = 0 to 15) is triggered, if enabled, when the TMRnLMT value is reached. If TMRnLMT was initially set to 31 then TMRn1INT will also be triggered if enabled.

### NOTE

On the Apollo4 Plus and Apollo4 Blue Plus SoCs,CMP0/CMP1 values should not be changed when the TIMER is running to prevent corrupting the counter.

#### NOTE

On the Apollo4 Plus and Apollo4 Blue Plus SoCs, the TMRnLMTVAL register holds the instantaneous value of a counter for TMRnLMT and can be read to get either (1) the number of executed iterations of the mode in EDGE, UPCOUNT or PWM mode, or (2) the number of bits already streamed out of the TMRnLMT+1 total bits in SINGLEPATTERN OR REPEATPATTERN mode.

## 6.1.1 Edge-generating Single Run Up-counter (TMRnFN = 0x1)

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0x1 (n = 0 to 15).

- When the timer is enabled, the pin outputs, OUT0 and OUT1, are at the level selected by the CTRLn\_T-MRnPOLm bit (m = 0 or 1).
- TIMERn is at zero because CTRLn\_TMRnCLR has been asserted previously.
- The timer will not start until a trigger occurs when enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts up on each clock, the source of which is selected by CTRLn\_TMRnCLK field, until TIM-ERn is equal to the value of the TMRnCMP0 register.
- The state of OUT0 is switched when the TIMERn count hits the value of TMRnCMP0 register, and maintains that level until TIMERn is cleared. TIMERn stops after the successful compare.
- If TMRnCMP1 is set lower than TMRnCMP0 (shown in Figure 2), then the state of OUT1 is switched when the TIMERn count hits the value of TMRnCMP1 register, while TIMERn continues counting. Otherwise, OUT1 does not transition (shown in Figure 3).
- An interrupt, if enabled by setting the INTEN\_TMRnmINT bit (n = 0 to 15, m = 0 or 1), is triggered when TMRnCMPm value is equal to the TIMERn count value. It is cleared by writing a one to the corresponding INTCLR\_TMRnmINT bit.
- If the polarity for an output is set to NORMAL (CTRLn\_TMRnPOLm = 0x0), then OUTm goes high when TMRnCMPm = TIMERn. When an output is set to INVERTED, then the transitions occur at the same times but are just the opposite in direction.
- After the successful compare and the clearing of the timer (asserting CTRLn\_TMRnCLR), subsequent triggers start TIMERn counting again.



#### Figure 2. Timer Edge Mode (TMRnFN = 0x1) - CMP0 > CMP1





# 6.1.2 Repeatable Pulse-generating Up-counter (TMRnFN = 0x2)

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0x2 (n = 0 to 15).

- When the timer is enabled, the pin outputs, OUT0 and OUT1, are at the level selected by the CTRLn\_T-MRnPOLm bit (m = 0 or 1).
- TIMERn is at zero because CTRLn\_TMRnCLR has been asserted previously.
- The timer will not start until a trigger occurs when enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts up on each clock, the source of which is selected by CTRLn\_TMRnCLK field, until TIM-ERn is equal to the value of the TMRnCMP0 register.
- When TIMERn is equal to the value of either of the TMRnCMPm registers, a pulse of polarity opposite of the start state of OUTm is generated, after which OUTm returns to its original state.
  - The pulse is of 1 clock cycle in duration.
- TMRnCMPm must be at least 1 so that the repeat interval is two clock cycles.
- If the polarity for an output is set to NORMAL (CTRLn\_TMRnPOLm = 0), then OUTm starts off in the low state and pulses high when TMRnCMPm = TIMERn. When an output is set to INVERTED (TMRnPOLm = 1), then OUT0/OUT1 starts out in the high state and pulse low for 1 source clock pulse when CMP0/CMP1 value is reached by TIMERn. The generated pulses occur at the same times but are the opposite in polarity.
- TIMERn continues counting after the successful compare and pulses are generated on OUT0 or OUT1 in the above manner, creating a stream of pulses or interrupts at a fixed interval until TIMERn is stopped.
- When stopped or disabled, TIMERn will stop counting but will not be cleared. Asserting CTRLn\_TMRn-CLR will reset TIMERn to zero.
- The CTRLn\_TMRnLMT field can be set from 0 to 255 to specify the number of timer counter cycles before TIMERn stops counting. If this field is set to 0, then TIMERn counts indefinitely until it is disabled. This operation is as shown in Figure 4.
- An interrupt, if enabled by setting the INTEN\_TMRnmINT bit (n = 0 to 15, m = 0 or 1), is triggered when TMRnCMPm value is equal to the TIMERn count value. It is cleared by writing a one to the corresponding INTCLR\_TMRnmINT bit.



• After the successful compare and the clearing of the timer (asserting CTRLn\_TMRnCLR), subsequent triggers start TIMERn counting again.

Figure 4. Timer Repeated Pulse Up-counter Compare Mode (TMRnFN = 0x2)

## 6.1.3 Repeatable PWM-generating Up-counter (TMRnFN = 0x4)

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0x4 (n = 0 to 15).

- When the timer is enabled, the pin outputs, OUT0 and OUT1, are at the level selected by the CTRLn\_T-MRnPOLm bit (m = 0 or 1).
- TIMERn is at zero because CTRLn\_TMRnCLR has been asserted previously.
- The CTRLn\_TMRnLMT field can be set from 0 to 255 to specify the number of timer counter cycles before TIMERn stops counting. If this field is set to 0, then the timer counts until it is disabled.
- The timer will not start until a trigger occurs when enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts up on each clock, the source of which is selected by CTRLn\_TMRnCLK field, until TIM-ERn is equal to the value of the TMRnCMP0 register (output period).
- When TIMERn is equal to the value of the TMRnCMP1 register, OUT0 transitions to the opposite polarity (high if CTRLn\_TMRnPOLm = 0).
- TIMERn continues counting and when it is equal to the value of the TMRnCMP0 register, OUT0 transitions to the original polarity (low if CTRLn\_TMRnPOLm = 0).
  - The complementary signal of OUT0 is output on OUT1.
  - TIMERn clears and repeats the counting-and-compare cycle until the TMRnLMT field value is reached or TIMERn is disabled. This operation is as shown in Figure 5.
- An interrupt, if enabled by setting the INTEN\_TMRnmINT bit (n = 0 to 15, m = 0 or 1), is triggered when TMRnCMPm value is equal to the TIMERn count value. It is cleared by writing a one to the corresponding INTCLR\_TMRnmINT bit.



Figure 5. Timer PWM Mode (TMRnFN = 0x4)

# 6.1.4 Repeatable Pulse-generating Down-counter (TMRnFN = 0x6)

### Supported on the Apollo4 and Apollo4 Blue SoCs only

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0x6 (n = 0 to 15).

- When the timer is enabled, the pin outputs, OUT0 and OUT1, are at the level selected by the CTRLn\_T-MRnPOLm bit (m = 0 or 1).
  - If the polarity for an output is set to NORMAL (CTRLn\_TMRnPOLm = 0x0), then OUTm pulses high when TIMERn = TMRnCMPm or 0. When an output is set to INVERTED, then the generated pulses occur at the same times but are the opposite in polarity.
- TIMERn is set to the value in the TMRnCMP0 register when CTRLn\_TMRnCLR is asserted after the TMRnFN field is set for this mode.
- The timer will not start until a trigger occurs when enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts down on each clock, the source of which is selected by CTRLn\_TMRnCLK field.
- When TIMERn =0, a pulse of polarity opposite of the start state of OUT0 is generated, after which OUT0 returns to its original state.
  - The pulse is of 1 clock cycle in duration.
- TMRnCMP0 must be at least 1 so that the repeat interval is two clock cycles.
- If TMRnCMP1 is set to a value less than TMRnCMP0, then when the counter count-down hits the value in TMRnCMP1, a pulse with a polarity opposite of the start state of OUT1 is generated, after which OUT1 returns to its original state. Again, the pulse is of 1 clock cycle in duration.
- The CTRLn\_TMRnLMT field can be set from 0 to 255 to specify the number of timer counter cycles before TIMERn stops counting. If this field is set to 0, then TIMERn counts until it is disabled. When set for at least one repeat (CTRLn\_TMRnLMT >= 2),
- TIMERn continues counting after reaching zero by loading the value of TMRnCMP0 in TIMERn and continuing counting down from there. This results in pulses being generated on OUT0 (and OUT1 if TMRnC-MP1 < TMRnCMP0), creating a stream of pulses or interrupts at a fixed interval until CTRLn\_TMRnLMT is reached or TIMERn is stopped. This operation is as shown in Figure 6.
- When stopped or disabled, TIMERn will stop counting but will not be cleared. Asserting CTRLn\_TMRn-CLR will reset TIMERn to zero.
- The interrupt, if enabled by setting the INTEN\_TMRn0INT bit, may be cleared by writing a one to the corresponding INTCLR\_TMRn0INT bit.



Figure 6. Timer Repeated Pulse Down-counter Compare Mode (TMRnFN = 0x6)

## 6.1.5 Single Run Pattern-generating Up-counter (TMRnFN = 0xC)

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0xC (n = 0 to 15).

There are two general cases for this mode as follows.

- CTRLn\_TMRnLMT <= 31 (0x1F):
  - OUT0 and OUT1 are independent output patterns derived from (up to) 32 bits stored in TMRnCMP0 and TMRnCMP1, respectively.
  - As TIMERn counts up with value x, OUT0 = TMRnCMP0[x] bit and OUT1 = TMRnCMP1[x] bit for a clock cycle.
  - After CTRLn\_TMRnLMT cycles, the outputs terminate. See Figure 7.
- CTRLn\_TMRnLMT between 32 and 63:
  - OUT0 is a pattern consisting of 33 64 bits comprised of the concatenated TMRnCMP1:TMRnCMP0 value.
  - CMP0 holds TIMERn values from 0-31 and CMP1 holds TIMERn values 32-63.
  - As TIMERn counts up with value x, OUT0 = TMRnCMP1:TMRnCMP0[x] bit for a clock cycle and for CTRLn\_TMRnLMT total cycles.
  - At the same time OUT1 gets the same values as OUT0, but can be inverted for motor control applications with (CTRLn\_TMRnPOL1 = 0x1). See Figure 8.
- When the timer is enabled, the pin outputs, OUT0 and OUT1, are at the zero level because CTRLn\_TM-RnCLR has been asserted previously.
- The timer will not start until a trigger occurs if enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts up on each clock, the source of which is selected by CTRLn\_TMRnCLK field.
- Each successive TIMERn counter value is the index for reading and outputting the bits of the 32-bit values stored in TMRnCMP0 and TMRnCMP1.
- The INT0 interrupt, if enabled by setting the INTEN\_TMRn0INT bit, is triggered when the TIMERn count = TMRnLMT value. It is cleared by writing a one to the corresponding INTCLR\_TMRn0INT bit.

• After pattern output and the clearing of the timer by asserting CTRLn\_TMRnCLR followed by reconfiguring the timer, a subsequent trigger starts TIMERn counting again.









### 6.1.5.1 Single 32-bit Pattern Output Example

- TMRnLMT = 24
- Since value of TMRnLMT < 32, OUT0 and OUT1 are independent.</li>
  - OUT0 is at the level of each successive bit of CMP0 indexed from LSB to MSB.
  - OUT1 is at the level of each successive bit of CMP1 indexed from LSB to MSB.
  - 25 bits (TMRnLMT+1) are indexed.

The 25 bits for the output waveforms are as shown in Figure 9.



# Figure 9. Single 32-bit Pattern Output

#### 6.1.5.2 Single 64-bit Pattern Output Example

- TMRnLMT = 45
- CMP0 and CMP1 are both 0x55555555
- Since value of TMRnLMT >= 32, OUT0 and OUT1 have the same waveform.
  - OUT0/OUT1 is at the level of each successive bit of CMP0:CMP1 indexed starting with the LSB of CMP0 to the MSB of CMP1.
  - A maximum of 64 bits (TMRnLMT+1) are indexed.

The 45 bits for the output waveforms are as shown in Figure 10.



### Figure 10. Single 64-bit Pattern Output

### 6.1.6 Repeated Pattern-generating Up-counter (TMRnFN = 0xD)

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0xD (n = 0 to 15).

The functionality and timing of this mode are exactly the same as Single Pattern Output (TMRnFN = 0xC) except that it repeats the pattern in a continuous manner. Refer to "Single Run Pattern-generating Up-counter" for details.

- Since the CTRLn\_TMRnLMT field is used to specify the number of bits used in CMP0 and CMP1 to form the output pattern, it cannot be used to specify the number of times the pattern is repeated (as in other repeatable modes).
- This mode behaves as if the CTRLn\_TMRnLMT field is set to 0 and repeats indefinitely until TIMERn is disabled.

### 6.1.7 Single Run Edge Event Timer Up-counter (TMRnFN = 0xE)

#### Supported on the Apollo4 and Apollo4 Blue SoCs only

This timer mode is selected for TIMERn by setting the CTRLn\_TMRnFN field to 0xE (n = 0 to 15).

- TIMERn outputs OUT0 and OUT1 are not used in this mode and are not driven.
- The timer will not start until a trigger occurs if enabled by the CTRLn\_TMRnTMODE and as designated by the MODEn\_TMRnTRIGSEL field.
- TIMERn counts up on each *bus* clock.

- TIMERn stops counting upon detection of an input clock edge, the source of which is as specified by the setting of the CTRLn\_TMRnCLK field. See Figure 11.
- An interrupt, if enabled by setting the INTEN\_TMRn0INT bit, may be cleared by writing a one to the corresponding INTCLR\_TMRn0INT bit.
- After event capture and the clearing of the timer by asserting CTRLn\_TMRnCLR, subsequent triggers start TIMERn counting again.

This timer mode is typically used to measure time between transitions on the same GPIO input or different inputs, as any GPIO can be used to trigger the event counter and any GPIO can be used as the timed event source (GPIO or clock).

| Trigger     |      |              |         |
|-------------|------|--------------|---------|
| Event Input | <br> |              |         |
| Counter     | 0    | Incrementing | Stopped |
| EN          |      |              |         |

Figure 11. Timer Event Counter (TMRnFN = 0xE)

# 6.2 Triggering Functions

All timer functions optionally can be enabled with a trigger, which can be a GPIO input or an output of another TIMER. Triggers are enabled by using the CTRLn\_TMRnTMODE field, which can be configured to trigger on the rising, falling or either edge of the trigger source. The trigger source is selected in the MODEn\_TMRnTRIGSEL field as either output of any TIMER (32 selections) or any of the GPIO inputs.

#### NOTE

Due to an existing erratum against Apollo4, Apollo4 Blue, Apollo4 Plus and Apollo4 Blue Plus SoCs, when using a HW trigger to initiate timer start in any mode on any timer, a specific trigger, selected by MODEn\_TMRnTRIGSEL, can be used only once before requiring a reset of the Timer module in order to use the trigger source again. A Timer module reset stops all existing timer operations of all timers and requires the reconfiguration and restart of all desired timer operations.

Please refer to ERR123 in the Apollo4 or Apollo4 Plus Errata Lists for more information about this issue, including application impact and suggested workarounds.

On the Apollo4 Lite and Apollo4 Blue Lite SoCs, this triggering issue is fixed and when using a HW trigger to initiate timer start in any mode on any timer, a subsequent timer trigger does occur after a previous timer trigger/execution without having to reset the timer before the trigger.

#### NOTE

Due to an Apollo4 device erratum, there is no support for an STIMER capture/ compare event to be the trigger for the start of a timer. This functionality has been deprecated on Apollo4 Plus and Apollo4 Lite.

### 6.2.1 Initiating a One-shot Operation

If the mode of a TIMER is as a one-shot (EDGE, SINGLEPATTERN and EVENTTIMER modes on Apollo4 / Apollo4 Blue; EDGE and SINGLEPATTERN on Apollo4 Plus / Apollo4 Blue Plus), when TMRnEN is asserted one cycle of the operation will be executed. At that point, an edge specified by CTRLn\_TMRnTMODE on the trigger signal selected by MODEn\_TMRnTRIGSEL will cause the operation to be executed again. This allows the creation of complex operations with a single configuration.

### 6.2.2 Terminating a Repeat Operation

The Repeat Pattern mode and the repeatable modes (UPCOUNT, PWM, and DOWNCOUNT on Apollo4 / Apollo4 Blue only) are terminated by asserting the TMRnCLR bit. When this is done, all the settings in the timer are cleared and they must be set up again to do a new operation.

# 6.3 Clocking Timer/Counters with Other Counter/Timer Outputs

There are cases where it is useful to use the output of a TIMER as the clock of another TIMER. The TMRnCLK field includes choices which implement this function, in addition to the normal clocks taken from the internal oscillators. If the clock of the timer which produces the TRIG signal were taken from the OUT0 output of the first timer, the CMP0 value used for the trigger generator would be trivially calculated as 2, and would be independent of the actual clock used to generate the OUT0 signal.

# 6.4 Global Timer/Counter Enable

There are times when it is very important to be able to start multiple Timer/Counters at precisely the same time, particularly in cases where one output is used as the trigger of another. The CTIMER\_GLOBEN register contains one enable bit for each Timer/Counter, which is ANDed with the local CTRLn\_TMRnEN bit of the timer. The GLOBEN register normally has all bits set to 1, so that the local CTRLn\_TMRnEN bits control the timers. For synchronized enabling, the GLOBEN\_ENBn register bits to be synchronized are set to 0, and then the local EN bits of those timers are set to 1. At that point a single write to the GLOBEN register will enable all of the selected timers at once.

In addition, a field in the GLOBEN Register, ENABLEALLINPUTS, is a global enable for all selected *GPIO* inputs to the timers. This global GPIO input enable bit affects only the enabling of timer clocks and triggers from a GPIO. If GLOBEN\_ENABLEALLINPUTS is low, then ALL of the timers' GPIO clocks and triggers are disabled. This does not affect any timer which uses a clock other than a GPIO, e.g., HFRC, LFRC, XT, RTC, BUCK or output of another timer output. Nor does it affect any timer which uses a trigger other than a GPIO, e.g., another timer output. This global setting requires, and does not take the place of, setting the CTRLn\_TMRnEN bit (and the CTRLn\_TMRnTMODE field if used) for any timer intended to be used.

For the synchronous starting of timers by concurrently setting the GLOBEN\_ENBn bits as described above, note that if GPIO is selected as the clock for any timers, then the ENABLEALLINPUTS field must be set before (or at the same time of) initiating the synchronous starting of the timers. The ENABLEALLINPUTS field is not an override for the individual ENBn bits, as they serve different functions. It is also important to note that this field does not affect the enabling or use of any GPIO set up as a timer *output*.

The steps to use the GLOBEN\_ENABLEALLINPUTS when GPIO are used as clock or trigger sources are as follows:

- 1. Write to the GLOBEN register to clear the ENABLEALLINPUTS bit (cleared by default).
- 2. Write to the MODEn register to set the TMRnTRIGSEL field and TMRnASYNC field (if used).
- 3. Write to the CTRLn register to set TMRnCLK, TMRnEN and all other intended settings in this register.
- 4. Write to the GLOBEN register to set the ENABLEALLINPUTS bit.

# 6.5 Generating the Sample Rate for the ADC

TIMER7 has a special function which allows it to function as the sample trigger generator for the ADC. If the TIMER\_GLOBEN\_ADCEN bit is set, the output of the timer is sent to the ADC which uses it as a trigger. Typically, TIMER7 is configured in Repeatable Up-counter Compare (FN =2) mode. INTEN\_TMR70INT may be set to generate an interrupt whenever the trigger occurs, but typically the ADC interrupt will be used for this purpose.

## 6.6 Generating the Sample Rate for the Audio ADC

As with the ADC mentioned above, TIMER6 has a special function which allows it to function as the sample trigger generator for the Audio ADC. If the TIMER\_GLOBEN\_AUDADCEN bit is set, the output of the timer is sent to the Audio ADC which uses it as a trigger. Typically, TIMER6 is configured in Repeatable Up-counter Compare (FN = 2) mode. INTEN\_TMR60INT may be set to generate an interrupt whenever the trigger occurs, but typically the Audio ADC interrupt will be used for this purpose.

## 6.7 CLR and EN Details

The overall operation of each TIMER is controlled by two configuration bits, TMRnCLR and TMRnEN, in the CTRLn register. When TMRnEN is set to 1, TIMERn is immediately set to all zeros and will remain there independent of any other configuration. TMRnCLR is typically used (writing a 1 to the bit) to initialize TIMERn to a value specific to the selected mode before use.

TMRnEN is used to enable (when 1) and disable (when 0) the counting function of the TIMER. However, TMRnEN is synchronized to the selected clock, which must be accounted for when used. When TMRnEN is set to 0, the counter will increment on the next clock and then hold its current value. When TMRnEN is set to 1, the Counter will resume counting on the second following edge.

Since the operation of the processor is essentially asynchronous to the selected clock, the synchronization introduces an uncertainty as to when the counter will begin counting. If the frequency of the selected clock is high relative to the processor clock, the impact of the synchronization will be negligible. However, for low frequency clocks, external pin clocks and the buck clocks the effective delay caused by the synchronization may be significant.

## 6.8 NOSYNC Function

Each TIMERn clock is supplied directly by the clock selected in the TIMER\_CTRLn\_TMRnCLK field. Timers run synchronously on the bus clock based on sampling the source clock, which means the edges will have about 20ns of jitter.

# 6.9 Counter Functions

A TIMER operates in counter mode when the TMRnCLK field selects either an external pad input (if 0x00-0xFF) or a buck pulse input (if 0x1C-0x1F). The different clock selections provide different functions.

### 6.9.1 Counting External Edges

If the TMRnCLK field is 0x80-0xFF, the TIMER clock input comes from an external pad. This allows the TIMER to monitor pulses or edges on an external signal.

### 6.9.2 Counting Buck Converter Edges

The MCU includes four separate buck converter inputs which provide a clock source from Buck VDDC TON, Buck VDDF TON, Buck VDDS TON, or Buck VDDC\_LV TON pulses. Each TIMER may be connected to a pulse stream from any of these sources. One pulse is generated each time the Buck Converter inserts charge into the capacitor, and therefore the number of pulses is a good indication of the amount of energy used by the corresponding power domain in a particular time period.

A possible option to determine energy consumption is as follows. Two counters could be configured for repeated UP-counter mode (FN = 2, TMRnLMT=0) so that they count continuously. One is supplied a Buck Converter pulse stream as its clock, and the other is supplied with a divided version of the LFRC clock to avoid creating extra power consumption due to the power measurement. Once configured such, the two counters should be enabled simultaneously, and after some period of system operation they should be disabled and read. The LFRC count value would now define how much real time has elapsed, and the Buck Converter count value would define how much energy was consumed in that time.

## 6.10 Interconnecting Timers

The OUT0 or OUT1 output of any TIMER may be used as either the trigger or the clock of any other TIMER. The selection of the clock or trigger source is made within each TIMERn configuration in the CTRLn\_TMRnCLK field or the MODEn\_TMRnTRIGSEL field, respectively.

# 7. System Timer (STIMER)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 8. Watch Dog Timer (WDT)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

#### NOTE

The Apollo4 Lite and Apollo4 Blue Lite SoCs do not include DSP watch dog timers.

# 8.1 Basic Operation

The basic operation of the watchdog timers is to configure the clock, program the two 8-bit registers that are used as counter thresholds for the timer (INTVAL, RESVAL), and set the WDTEN to enable the timer. As the watchdog counter increments and reaches the interrupt value (INTVAL), the watchdog timer will issue an interrupt to the respective CPU (assuming INTEN is set). The counter will then continue counting to the RESVAL value unless the CPU writes the RESTART KEY to the RSTART register, which will reset the counter to zero and continue counting (the CPU cannot otherwise influence the counter). If the CPU fails to respond to the interrupt and reset the timer, the timer will reach the reset (RESVAL) value. At this point, the watchdog timer will issue a reset to the appropriate reset controller to reset the CPU.

# 8.2 Register Functions

A summary of the registers is listed below.

- CFG: The config register contains the INTVAL, RESVAL, and the CLKSEL (ARM) fields. It also contains the WDTEN to enable the timer, INTEN to enable the interrupt comparator, and RESEN to enable the reset comparator.
- RSTRT: The "restart" register allows the processor to reset its watchdog counter to zero by writing the 8bit key. Failure to restart the counter before the RESVAL will cause a reset.
- TLOCK: The "timer lock" register allows the CPU to irrevocably enable its timer by writing the LOCK key
  to this register. Locking the timer will automatically enable the timer and prevent subsequent writes to the
  associated CFG register, therefore it is important that software ensure that it is programmed properly
  before locking the timer.
- COUNT: The count register returns the current counter value.

The watchdog timer also contains a standard Ambiq interrupt register block for interrupts for each of the processors. The interrupt registers have been split into separate register banks to avoid issues with contention between processors when servicing interrupts.

# 9. General Purpose Input/Output (GPIO)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 9.1 General Purpose I/O (GPIO) Functions

The following sub-sections refer to function selections offered on various pins. Please consult the devicespecific Pin Mapping table in the GPIO section of the applicable device's datasheet for specific function mapping for each GPIO.

For each pad, if the FNCSEL0n field is set to 0x3 the pad is connected to the corresponding GPIO signal. This section describes the configuration functions specific to GPIO pads.

### 9.1.1 Configuring the GPIO Functions

Each GPIO must be configured in the REG\_GPIO\_PINCFGn (n = GPIO number) Registers as an input and/or output before using. Note that the PADKEY Register must be set to the value 0x73 in order to write the REG\_GPIO\_PINCFGn Registers. Each output may be push-pull, open drain, disabled, or tri-stated as selected by the REG\_GPIO\_PINCFGn\_OUTCFGn field. If the output is configured as push-pull, the pad will be driven with the corresponding bit in the REG\_GPIO\_WTn (n = 0-3) Register. If the output is configured as open drain, the pad will be pulled low if the corresponding bit in the WTn Register is a 0, and will be floating if the corresponding bit in the WTn Register is a 1. If the output is configured as tri-state, the pad will be driven with the corresponding bit in the WTn Register if the corresponding bit in the REG\_GPIO\_ENn Register is a 1. If the bit in ENn is a 0, the output will be floating.

If the FNCSELn field is set to "NCEn" (n = GPIO number), the pin functions as the chip enable for the communication channel specified by the NCESRCn field in that PINCFGn register.

### 9.1.2 Reading from a GPIO Pad

All GPIO inputs are readable at all times provided the INPENn bit is enabled and the RDZEROn bit is disabled, even if the pad is not configured as a GPIO. The current values of pads 0 to 31 are read in the REG\_GPIO\_RD0 Register, the current values of pads 32 to 63 are read in the REG\_GPIO\_RD1 Register, the current values of pads 64 to 95 are read in the REG\_GPIO\_RD2 Register, and the current values of pads 96 to 127<sup>1</sup> are read in the REG\_GPIO\_RD3 Register.

### 9.1.3 Writing to a GPIO Pad

The GPIO pad outputs are controlled by the REG\_GPIO\_WT0-WT3 Registers and the REG\_GPIO\_EN0-EN3 Registers. Each of these registers may be directly written and read. Because each GPIO is often an independent function, the capability also exists to set or clear one or more bits without having to perform a read-modify-write operation. If the REG\_GPIO\_WTS0-WTS3 Register is written, the corresponding bit in WT0-WT3 will be set if the write data is 1, otherwise the WT0-WT3 bit will not be changed. If the REG\_GPIO\_WTC0-WTC3 Register is written, the corresponding bit in WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 will be cleared if the write data is 1, otherwise the WT0-WT3 bit will not be changed.

If a GPIO pad is configured for tri-state output mode, the EN0-EN3 Register controls the enabling of each bit. These registers may be directly written, and individual bits may be set or cleared by writing the ENS0-ENS3 or ENC0-ENC3 Registers with a 1 in the desired bit position.

<sup>1.</sup> GPIO105-127 are reserved and unavailable for use.

### 9.1.4 GPIO Interrupts

Each GPIO pad can be configured to generate an interrupt on a high-to-low transition or a low-to-high transition (or either), as selected by setting the REG\_GPIO\_PINCFGn\_IRPTENn bit. This interrupt will be generated even if the pad is not configured as a GPIO in the Pad Configuration logic.

Table 15 below describes the interrupt trigger options.

| IRPTENn | Interrupt                                    |
|---------|----------------------------------------------|
| 00      | Disabled                                     |
| 01      | High -> low transition                       |
| 10      | Low -> high transition                       |
| 11      | Either low -> high or high -> low transition |

#### Table 15: Interrupt Trigger Options

Each interrupt is enabled, disabled, cleared or set with a standard set of interrupt registers REG\_GPIO\_MCUN0INTnEN, REG\_GPIO\_MCUN0INTnSTAT, REG\_GPIO\_MCUN0INTnCLR and REG\_GPIO\_MCUN0INTnSET, where n = 0 to 3 for GPIO pads 0 to 31, 32 to 63, 64 to 95 and 96 to 127, respectively. The N0 designation in these registers indicates that these are interrupt register set 0. A duplicate set of registers, with a designation N1, is available and these registers are similarly named REG\_GPIO\_MCUN1INTnEN, REG\_GPIO\_MCUN1INTnSTAT, REG\_GPIO\_MCUN1INTnCLR and REG\_GPIO\_MCUN1INTnEN, REG\_GPIO\_MCUN1INTnSTAT, REG\_GPIO\_MCUN1INTnCLR and REG\_GPIO\_MCUN1INTnSET.

The purpose of this dual set of registers is to enable the segregation of an interrupt or a small set of interrupts which may have higher importance and thus should be handled at a higher priority level and/or with minimal latency. If only one interrupt is enabled in one of these sets, then there is no need to determine which GPIO caused the interrupt in this bank by reading and processing the Status Register bits. Note that these interrupts get mapped to different IRQs and hence respective interrupts need to be enabled in NVIC and serviced accordingly.

# 9.2 Pad Connection Summary

Figure 12 shows the detailed implementation of each pad. Each element will be described in detail.

### 9.2.1 Output Selection

There is a multiplexer which selects the module signal to be driven to the output based on the REG\_GPIO\_PINCFGn\_FNCSELn (n = GPIO number) field. This implements the multiplexing shown in the device-specific Pin Mapping Table for output pads. For all pads, a FNCSELn value of 0x3 selects the value in the corresponding GPIO\_WTn register bit.

Certain functional groups, Timer and MSPI in particular, have additional pre-muxing configuration as noted.

### 9.2.2 Output Control

The pad driver for each pad has a data input and an output enable input. Each of these controls is selected from among several alternatives based on the OUTDATSEL and OUTENSEL signals which are controlled by the selection of the output type as shown in the Special Pad Types table in the GPIO section of the applicable datasheet, as set in the REG\_GPIO\_PINCFGn\_PULLCFGn field.

OUTDATSEL normally selects the data from the output multiplexer, but if the pad is configured as Open Drain the data input is selected to be low.



Figure 12. Pad Connection Details

OUTENSEL normally selects a ground signal to keep the pad driver enabled. If the pad is configured to be Open Drain, the pad enable is driven with the data from the output multiplexer. If the pad is configured as a GPIO (PADnFNCSEL = 0x3) and the GPIO drive type is tri-state (OUTCFGn = 0x3), the pad enable is driven with the inverse of the corresponding GPIO bit in the GPIO\_ENn register. If the pad is not configured as an output, the pad enable is forced high to turn the driver off.

The drive strength of each pad driver is configured as described in Section 9.1 on page 59.

### 9.2.3 Input Control

The input circuitry of the pad may be disabled by clearing the PINCFGn\_INPENn bit. This configuration should always be set if the pad input is not being used, as it prevents unnecessary current consumption if

the pad voltage happens to float to a level between VDD and Ground. If PADnINPEN is 0, the pad will always read as a 0.

If INPENn is set, the pad input then goes to two places. It is driven to the selected module signal as selected in the device-specific Pin Mapping Table. In addition, the pad input can always be read from the GPIO\_RDn register unless the pad is configured as a GPIO (FNCSELn = 0x3) and RDZEROn is high, which will force the GPIO\_RD input to be a zero. The ability to always read the pad value is very useful in some diagnostic cases.

The pad input is always sent to the GPIO interrupt logic, and a pad transition in the direction selected by IRPTENn will set the MCUNnINTmSTAT flip-flop when the corresponding MCUNnINTmEN bit is set. Note that this interrupt will be set even if the pad is not configured as a GPIO, which may be useful in detecting functions. As an example, this could be used to generate an interrupt when the I<sup>2</sup>C/SPI Slave nCE signal is driven low by the Interface Host.

### 9.2.4 Pull-up/Pull-down Control

If PULLCFGn is set to a pull-up or pull-down value, a pull-up/pull-down resistor is connected between the pad and VDDH/VSSH.

#### 9.2.5 Analog Pad Configuration

Pads which may have analog connections (all pads with ADC or VCOMP functions shown in the devicespecific Pin Mapping Table) include the circuitry shown with the dotted lines of Figure 12. If the pad is configured in analog mode (reference the analog input function selections in the device-specific Pin Mapping Table), the pad is connected directly to the particular analog module signal. In addition, OUTENSEL is forced high to disable the pad output, and the input of the pad is disabled independent of the value of INPENn.

# 9.3 Module-specific Pad Configuration

The following sections describe in detail how to configure the pads for each module function. Reference the device-specific Pin Mapping Table for pads which are available and can be used for the signals described in the following sub-sections.

#### 9.3.1 Implementing IO Master Connections

The eight IO Master modules must be correctly connected to the appropriate pads in order to operate. Reference the device-specific Pin Mapping Table for pads which can used for the various signals for each IO Master instance: MnSCK, MnMOSI and MnMISO for SPI operation, and MnSCL and MnSDAWIRE3 for  $I^2C$  operation, where n = instance number 0 - 7. Any GPIO pad brought out to a pin on the package can be configured as the SPI Chip enable for any of the eight IOM channels. If the FNCSELn field is set to "NCEn" (n = GPIO number), the pin functions as the chip enable for the communication channel specified by the

### 9.3.1.1 IO Master I<sup>2</sup>C Connection

NCESRCn field in that PINCFGn register.

I<sup>2</sup>C mode of IO Master 0 uses pad 5 as SCL and pad 6 as SDA. This mode is configured by setting the FNCSELn fields as shown in Table 16. If the internal I<sup>2</sup>C pullup resistors are to be used, PULLCFG5 and PULLCFG6 should be set to select the desired pullup resistor. If external pullup resistors are used, PULLCFG5 and PULLCFG5 and PULLCFG6 should be cleared.

#### Table 16: IO Master 0 I<sup>2</sup>C Configuration

| Field   | Value |
|---------|-------|
| FNCSEL5 | 0     |
| FNCSEL6 | 0     |

This same setup for I<sup>2</sup>C mode is done for each of the other seven instances of the IO Master as needed. Reference the device-specific Pin Mapping Table for pads which are used for the MnSCL (SCL) and MnSDAWIR3 (SDA) signals for each IO Master instance.

#### 9.3.1.2 IO Master 4-wire SPI Connection

Four-wire SPI mode of IO Master 0 uses pad 5 as SCK, pad 6 as MOSI and pad 7 as MISO. This mode is configured by setting the FNCSELn fields as shown in Table 17. PULLCFG5, PULLCFG6 and PULLCFG7 should be cleared. Any GPIO can be configured as the CE for any of the Master IO instances. This same setup for 4-wire SPI mode is done for each of the other seven instances of the IO Master as needed. Reference the device-specific Pin Mapping Table for pads which are used for the MnSCK, MnMOSI and MnMISO signals for each IO Master instance.

It should also be noted that in 4-Wire mode, the MnMOSI pin should be configured with the FOENn bit set to force output enable active on the pin.

| Field   | Value |
|---------|-------|
| FNCSEL5 | 1     |
| FNCSEL6 | 1     |
| FNCSEL7 | 0     |

#### Table 17: IO Master 0 4-wire SPI Configuration

## 9.3.1.3 IO Master 0 3-wire SPI Connection

Three-wire SPI mode of IO Master 0 uses pad 5 as SCK and pad 6 as MOSI/MISO. This mode is configured by setting the FNCSELn fields as shown in Table 18. PULLCFG5 and PULLCFG6 should be cleared. Pad 7 may be used for other functions.

| Table 18: | <b>IO Master</b> | 0 3-wire SPI | Configuration |
|-----------|------------------|--------------|---------------|
|-----------|------------------|--------------|---------------|

| Field   | Value |
|---------|-------|
| FNCSEL5 | 1     |
| FNCSEL6 | 0     |

A variety of pads may be used for up to four nCE signals to select up to four separate slaves. The INPENn and PULLCFGn bits of any pad used for nCE should be cleared.

Any GPIO pad brought out to a pin can be configured as the CE for any of the Master IO instances.

This same setup for 3-wire SPI mode is done for each of the other seven instances of the IO Master as needed. Reference the device-specific Pin Mapping Table for pads which are used for the MnSCK (SCK) and MnSDAWIR3 (MOSI/MISO) signals for each IO Master instance.

### 9.3.1.4 SPI Flow Control Connections

SPI Flow Control in interrupt mode requires an external pin to be specified as the interrupt pin. This is accomplished by configuring the desired pin in the IOMxIRQ register (x = 0 to 7).

#### 9.3.1.5 Implementing IO Slave Connections

The IO Master module must be correctly connected to the appropriate pads in order to operate.

# 9.3.1.6 IO Slave I<sup>2</sup>C Connection

I<sup>2</sup>C mode of the IO Slave uses pad 0 as SCL and pad 1 as SDA. This mode is configured by setting the FNCSELn fields as shown in Table 19. The INPEN0 and INPEN1 bits must be set. PULLCFG0 and PULLCFG1 should be cleared.

| Table | 19: | 10 | Slave | l <sup>2</sup> C | Configuration |
|-------|-----|----|-------|------------------|---------------|
|-------|-----|----|-------|------------------|---------------|

| Field   | Value |
|---------|-------|
| FNCSEL0 | 1     |
| FNCSEL1 | 1     |

#### 9.3.1.7 IO Slave 4-wire SPI Connection

Four-wire SPI mode of the IO Slave uses pad 0 as SCK, pad 1 as MOSI, pad 2 as MISO and pad 3 as nCE. This mode is configured by setting the FNCSELn fields as shown in Table 20. The INPEN0, INPEN1 and INPEN3 bits must be set. PULLCFG0, PULLCFG1, PULLCFG2 and PULLCFG3 should be cleared.

| Table | 20: | 10 | Slave | 4-wire | SPI | Configuration |
|-------|-----|----|-------|--------|-----|---------------|
|-------|-----|----|-------|--------|-----|---------------|

| Field   | Value |
|---------|-------|
| FNCSEL0 | 2     |
| FNCSEL1 | 2     |

| Field   | Value |
|---------|-------|
| FNCSEL2 | 1     |
| FNCSEL3 | 1     |

#### Table 20: IO Slave 4-wire SPI Configuration

### 9.3.1.8 IO Slave 3-wire SPI Connection

Three-wire SPI mode of the IO Slave uses pad 0 as SCK, pad 1 as MISO/MOSI and pad 3 as nCE. This mode is configured by setting the FNCSELn fields as shown in Table 21. The INPEN0, INPEN1 and INPEN3 bits must be set. PULLCFG0, PULLCFG1 and PULLCFG3 should be cleared. Pad 2 may be used for other functions.

| Table 21: | IO Slave | 3-wire SPI | Configuration |
|-----------|----------|------------|---------------|
|-----------|----------|------------|---------------|

| Field   | Value |
|---------|-------|
| FNCSEL0 | 1     |
| FNCSEL1 | 1     |
| FNCSEL3 | 1     |

#### 9.3.1.9 IO Slave Interrupt Connection

The IO Slave can be configured to generate an interrupt output under a variety of internal conditions. If this function is used, the interrupt will be generated on pad 4. FNCSEL4 must be set to 1, and INPEN4 and PULLCFG4 should be cleared.

#### NOTE

The BLE Controller in the Apollo4 Blue Plus KXR package and the Apollo4 Blue Lite uses GPIO04 for its 32 kHz clock. This pad cannot be configured for the IO Slave interrupt or any other function.

#### 9.3.2 Implementing Display Controller Connections

The Display Controller (DC) supports connection to displays via two serial ports. One set of pads is available for each of these interfaces, and these pads need to be correctly configured for proper display control. Reference the device-specific Pin Mapping Table for pads which can used for the DC signals.

NOTE

The Display Controller is not included on the Apollo4 Lite or the Apollo4 Blue Lite.

The functions of the DC signals are as follows, grouped by interface. See Display Controller chapter for description of signals.

SPI

- DISP\_SPI\_SCK
- DISP\_SPI\_SD

- DISP\_SPI\_DCX
- DC\_SPI\_CS\_N\*

#### Quad SPI

- DISP\_QSPI\_D3 DISP\_QSPI\_D0
- DISP\_QSPI\_SCK
- DC\_QSPI\_CS\_N\*

\*Selectable from any GPIO pad brought out to a pin to be configured as the CE for the DC SPI or QSPI interface

INPENn and PULLCFGn should be cleared for output signals. For input and bi-directional signals, INPENn and PULLCFGn should be set.

#### NOTE

Use of the DPI-2 interface, which includes pad functions DISP\_D0 - DISP\_D23, DISP\_VS, DISP\_HS, DISP\_DE, DISP\_PCLK, DISP\_SD and DISP\_CM, is not recommended or supported.

| Field    | Value | Signal Direction            |                | Pad |
|----------|-------|-----------------------------|----------------|-----|
| FNCSEL78 | 9     | DISP_SPI_SCK                | Output         | 78  |
| FNCSEL74 | 9     | DISP_SPI_SD                 | Bi-directional | 74  |
| FNCSEL75 | 9     | DISP_SPI_DCX Output         |                | 75  |
| FNCSEL74 | 2     | DISP_QSPI_D0                | Bi-directional | 74  |
| FNCSEL75 | 2     | DISP_QSPI_D1                | Bi-directional | 75  |
| FNCSEL76 | 2     | DISP_QSPI_D2                | Bi-directional | 76  |
| FNCSEL77 | 2     | DISP_QSPI_D3 Bi-directional |                | 77  |
| FNCSEL78 | 2     | DISP_QSPI_SCK               | Output         | 78  |

#### Table 22: DISP Interface Configuration

### 9.3.3 Implementing Counter/Timer Connections

Each Counter/Timer can optionally count pulses from an input pad, or generate pulses on an output pad. A pad's FNCSELn field is set to CTn (0x6) to connect a Counter/Timer to that pad. If the pad is used as an input, the INPENn bit should be set, otherwise it should be cleared. The PULLCFGn bit may be set if the input signal is open drain.

Refer to the "Pad Connections from the Timer/Counter" section in the Counter/Timer Module chapter for more information on configuring a timer's input/output pad.

### 9.3.4 Implementing UART Connections

The UART signals can be connected to a variety of pads.

### 9.3.4.1 UART0 TX/RX Connections

The UART0 data signals TX and RX may each be connected to several pads. Note that TX and RX are selected independently. Table 23 shows the connections for TX, which should have the corresponding INPENn and PULLCFGn fields clear. Table 24 shows the connections for RX, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 0   | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 12  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 20  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 30  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 34  | 4      | ~       |                 | ~               |                             |                             | ~               |                      |
| 41  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ~                    |
| 45  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ~                    |
| 53  | 4      | ~       | ~               | ~               | ~                           |                             | ~               |                      |
| 60  | 4      | ~       | ~               | ~               | ~                           |                             |                 |                      |
| 66  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 72  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 78  | 4      | ✓       | ✓               | $\checkmark$    | ✓                           | ✓                           | ✓               | ✓                    |

Table 23: UART0 TX Configuration

### Table 24: UART0 RX Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 2   | 4      | ✓       | ✓               | ✓               | ✓                           | $\checkmark$                | $\checkmark$    | $\checkmark$         |
| 22  | 4      | ~       | ~               | ~               | ~                           | ~                           | ✓               | ✓                    |
| 32  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 36  | 4      | ~       |                 | ~               |                             |                             | ~               |                      |
| 43  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ✓                    |
| 47  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 55  | 4      | ~       | ~               | ~               | ~                           |                             | ~               |                      |
| 62  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 68  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 74  | 4      | ~       | ~               | ~               | ~                           | ~                           | $\checkmark$    | ✓                    |

# 9.3.4.2 UART0 RTS/CTS Connections

The UART modem control signals RTS and CTS may each be connected to one of two pads. Note that RTS and CTS are selected independently. Table 25 shows the connections for RTS, which should have the corresponding INPENn and PULLCFGn fields clear. Table 26 shows the connections for CTS, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 4   | 4      | ~       | ~               | ✓               | ~                           |                             | ~               |                      |
| 24  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 38  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 49  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 57  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 58  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 64  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 70  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 76  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |

Table 25: UART0 RTS Configuration

### Table 26: UART0 CTS Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 6   | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | $\checkmark$         |
| 18  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 40  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ✓                    |
| 51  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | $\checkmark$         |
| 59  | 4      | ~       | ~               | ~               | ~                           |                             |                 |                      |
| 65  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | $\checkmark$         |
| 71  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 77  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |

## 9.3.4.3 UART1 TX/RX Connections

The UART data signals TX and RX may each be connected to several pads. Note that TX and RX are selected independently. Table 27 shows the connections for TX, which should have the corresponding INPENn and PULLCFGn fields clear. Table 28 shows the connections for RX, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 0   | 5      | ~       | ~               | $\checkmark$    | $\checkmark$                | ~                           | $\checkmark$    | $\checkmark$         |
| 12  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 20  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 25  | 5      |         |                 |                 |                             |                             | ~               | ✓                    |
| 41  | 5      |         |                 |                 |                             |                             | ~               | ✓                    |
| 45  | 5      |         |                 |                 |                             |                             | $\checkmark$    | ✓                    |
| 53  | 5      | ~       | ~               | ~               | $\checkmark$                |                             | $\checkmark$    |                      |
| 66  | 5      |         |                 |                 |                             |                             | ~               | ✓                    |
| 72  | 5      |         |                 |                 |                             |                             | $\checkmark$    | ✓                    |
| 78  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

### Table 27: UART1 TX Configuration

#### Table 28: UART1 RX Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 2   | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 14  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 22  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 26  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 36  | 5      | ~       |                 | ~               |                             |                             | ~               |                      |
| 43  |        |         |                 |                 |                             |                             | ~               | ~                    |
| 47  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 55  | 5      | ~       | ~               | ~               | ~                           |                             | ~               | ~                    |
| 62  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 68  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

## 9.3.4.4 UART1 RTS/CTS Connections

The UART modem control signals RTS and CTS may each be connected to one of two pads. Note that RTS and CTS are selected independently. Table 29 shows the connections for RTS, which should have the corresponding INPENn and PULLCFGn fields clear. Table 30 shows the connections for CTS, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 4   | 5      | ~       | ~               | ~               | ~                           |                             | ~               |                      |
| 16  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 24  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 49  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 57  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 70  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 76  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 79  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

### Table 29: UART1 RTS Configuration

#### Table 30: UART1 CTS Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 6   | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 18  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 27  | 5      |         |                 |                 |                             |                             | ~               | $\checkmark$         |
| 29  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 40  |        |         |                 |                 |                             |                             | ~               | ✓                    |
| 51  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 59  | 5      | ~       | ~               | ~               | ~                           |                             |                 |                      |
| 65  | 5      |         |                 |                 |                             |                             | ~               | ✓                    |
| 77  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

### 9.3.4.5 UART2 TX/RX Connections

The UART data signals TX and RX may each be connected to several pads. Note that TX and RX are selected independently. Table 31 shows the connections for TX, which should have the corresponding INPENn and PULLCFGn fields clear. Table 32 shows the connections for RX, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 1   | 4      | ~       | ~               | ✓               | $\checkmark$                | $\checkmark$                | ✓               | ✓                    |
| 13  | 4      | ~       | ~               | $\checkmark$    | $\checkmark$                | ✓                           | ✓               | ✓                    |

#### Table 31: UART2 TX Configuration
| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 21  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 31  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 35  | 4      | ~       |                 | ~               |                             |                             | ~               |                      |
| 42  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ~                    |
| 46  | 4      | ~       |                 | ~               |                             |                             | ~               |                      |
| 54  | 4      | ~       | ~               | ~               | ~                           |                             | ~               |                      |
| 61  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 67  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 73  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |

### Table 31: UART2 TX Configuration

### Table 32: UART2 RX Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 3   | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 11  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 23  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 33  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 34  | 5      |         |                 |                 |                             |                             | ~               |                      |
| 37  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 44  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ~                    |
| 48  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 56  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 63  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 69  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 75  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |

## 9.3.4.6 UART2 RTS/CTS Connections

The UART modem control signals RTS and CTS may each be connected to one of two pads. Note that RTS and CTS are selected independently. Table 33 shows the connections for RTS, which should have the corresponding INPENn and PULLCFGn fields clear. Table 34 shows the connections for CTS, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 5   | 4      | ~       | ~               | $\checkmark$    | ~                           | ~                           | $\checkmark$    | ~                    |
| 39  | 4      | ~       |                 | ~               |                             | ~                           | ~               | ~                    |
| 50  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |

## Table 33: UART2 RTS Configuration

### Table 34: UART2 CTS Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 7   | 4      | ✓       | ~               | ✓               | ~                           | ~                           | ~               | ~                    |
| 19  | 4      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 28  | 4      | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 31  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 52  | 4      | ~       | ~               | ~               | ~                           |                             | ~               | ~                    |
| 64  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 80  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

## 9.3.4.7 UART3 TX/RX Connections

The UART data signals TX and RX may each be connected to several pads. Note that TX and RX are selected independently. Table 35 shows the connections for TX, which should have the corresponding INPENn and PULLCFGn fields clear. Table 36 shows the connections for RX, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 1   | 5      | ~       | ~               | ~               | ~                           | ~                           | $\checkmark$    | ~                    |
| 13  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 21  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 35  | 5      | ~       |                 | ~               |                             |                             | ~               |                      |
| 42  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 46  | 5      | ~       |                 | ~               |                             |                             | ~               |                      |
| 54  | 5      | ~       | ~               | ~               | ~                           |                             | ~               |                      |
| 61  | 5      | ~       | ✓               | ~               | ~                           | ~                           | ~               | ✓                    |
| 67  | 5      |         |                 |                 |                             |                             | $\checkmark$    | ~                    |
| 73  | 5      |         |                 |                 |                             |                             | ~               | ~                    |

### Table 35: UART3 TX Configuration

#### Table 36: UART3 RX Configuration

| Pad | Value | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|-------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 3   | 5     | ~       | ~               | ✓               | ~                           | ✓                           | ~               | ~                    |
| 11  | 5     | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 15  | 5     | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 23  | 5     | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 37  | 5     |         |                 |                 |                             |                             | ~               | ~                    |
| 44  | 5     |         |                 |                 |                             |                             | ~               | ~                    |
| 48  | 5     | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 56  | 5     | ~       | ~               | ~               | ~                           | ~                           |                 |                      |
| 63  | 5     | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 69  | 5     |         |                 |                 |                             |                             | ~               | ~                    |
| 75  | 5     |         |                 |                 |                             |                             | ~               | ~                    |

## 9.3.4.8 UART3 RTS/CTS Connections

The UART modem control signals RTS and CTS may each be connected to one of two pads. Note that RTS and CTS are selected independently. Table 37 shows the connections for RTS, which should have the corresponding INPENn and PULLCFGn fields clear. Table 38 shows the connections for CTS, which must have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 5   | 5      | ✓       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 17  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 39  | 5      |         |                 |                 |                             |                             | ~               | ~                    |
| 50  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ~                    |
| 58  | 5      | ~       | ~               | ~               | ~                           |                             |                 | ✓                    |
| 71  | 5      |         |                 |                 |                             |                             | ~               | ✓                    |
| 83  | 5      |         |                 |                 |                             |                             | ✓               | ✓                    |

### Table 37: UART3 RTS Configuration

#### Table 38: UART3 CTS Configuration

| Pad | FNCSEL | Apollo4 | Apollo4<br>Blue | Apollo4<br>Plus | Apollo4<br>Blue Plus<br>KBR | Apollo4<br>Blue Plus<br>KXR | Apollo4<br>Lite | Apollo4<br>Blue Lite |
|-----|--------|---------|-----------------|-----------------|-----------------------------|-----------------------------|-----------------|----------------------|
| 7   | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | $\checkmark$         |
| 19  | 5      | ~       | ~               | ~               | ~                           | ~                           | ~               | ✓                    |
| 32  | 5      |         |                 |                 |                             |                             |                 | ✓                    |
| 52  | 5      | ~       | ~               | ~               | ~                           |                             | ~               | ✓                    |
| 60  | 5      | ~       | ~               | ~               | ~                           |                             | ~               | ✓                    |
| 74  | 5      |         |                 |                 |                             |                             |                 | ✓                    |
| 82  | 5      |         |                 |                 |                             |                             |                 | $\checkmark$         |

### 9.3.5 Implementing Audio Connections

The audio signals of the PDM and I2S modules can be connected to a number of pads as described in the sub-sections below.

### 9.3.5.1 PDM Connections

PDM CLK and DATA signals for each available PDM are connected to separate GPIO pads. Note that CLK and DATA are enabled independently. Table 39 shows the connections for PDM CLK, which should have the corresponding INPENn and PULLCFGn fields clear. Table 40 shows the pins for PDM DATA, which must have the corresponding INPENn field set and the corresponding PULLCFGn field clear.

NOTE

The KXR package of the Apollo4 Blue Plus includes two PDMs, PDM0 and PDM3. The Apollo4 Lite and the Apollo4 Blue Lite include a single PDM, PDM0.

| PDM Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| PDM0         | FNCSEL50 | 0            | 50  |
| PDM1         | FNCSEL52 | 0            | 52  |
| PDM2         | FNCSEL54 | 0            | 54  |
| PDM3         | FNCSEL56 | 0            | 56  |

### Table 39: PDM CLK Configuration

#### Table 40: PDM DATA Configuration

| PDM Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| PDM0         | FNCSEL51 | 0            | 51  |
| PDM1         | FNCSEL53 | 0            | 53  |
| PDM2         | FNCSEL54 | 0            | 55  |
| PDM3         | FNCSEL57 | 0            | 57  |

### 9.3.5.2 I2S Connections

For each available I2S modules, the BCLK, WCLK and DAT signals may each be connected to one of multiple pads. Note that CLK, WS and DATA are selected independently.

- Table 41 shows the connections for I2S CLK, which should have the corresponding PADnINPEN set and the corresponding PULLCFGn field clear.
- Table 42 shows the connections for I2S WS, which should have the corresponding INPENn set and the PULLCFGn field clear.
- Table 43 shows the connections for I2S DATA, which must have the corresponding INPENn and PULL-CFGn fields clear.
- Table 44 shows the connections for I2S SDIN, which must have the corresponding INPENn and PULL-CFGn fields clear.
- Table 45 shows the connections for I2S SDOUT, which must have the corresponding INPENn and PULL-CFGn fields clear.

#### NOTE

The Apollo4 Lite and the Apollo4 Blue Lite include a single I2S module, I2S0. Disregard all I2S1 function selections in the tables below for these devices.

| I2S Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| 1280         | FNCSEL5  | 2            | 5   |
| 1280         | FNCSEL11 | 2            | 11  |
| 1280         | FNCSEL28 | 2            | 28  |
| 1280         | FNCSEL47 | 10           | 47  |
| I2S1         | FNCSEL5  | 10           | 5   |

### Table 41: I2S CLK Configuration

| I2S Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| I2S1         | FNCSEL16 | 2            | 16  |
| I2S1         | FNCSEL47 | 2            | 47  |
| I2S1         | FNCSEL61 | 2            | 61  |

## Table 41: I2S CLK Configuration

### Table 42: I2S WS Configuration

| Instance | Field    | FNCSEL Value | Pad |
|----------|----------|--------------|-----|
| 1280     | FNCSEL7  | 2            | 7   |
| 1280     | FNCSEL13 | 2            | 13  |
| 1280     | FNCSEL30 | 2            | 30  |
| 1280     | FNCSEL49 | 10           | 49  |
| I2S1     | FNCSEL7  | 10           | 7   |
| I2S1     | FNCSEL18 | 2            | 18  |
| I2S1     | FNCSEL49 | 2            | 49  |
| I2S1     | FNCSEL63 | 2            | 63  |

## Table 43: I2S DATA Configuration

| I2S Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| 1280         | FNCSEL6  | 2            | 6   |
| 1280         | FNCSEL12 | 2            | 12  |
| 1280         | FNCSEL29 | 2            | 29  |
| I2S1         | FNCSEL17 | 2            | 17  |
| I2S1         | FNCSEL48 | 2            | 48  |
| I2S1         | FNCSEL62 | 2            | 62  |

### Table 44: I2S SDIN Configuration

| I2S Instance | Field    | FNCSEL Value | Pad             |
|--------------|----------|--------------|-----------------|
| 12S0         | FNCSEL4  | 9            | 4 <sup>1</sup>  |
| I2S0         | FNCSEL14 | 9            | 14              |
| I2S0         | FNCSEL27 | 9            | 27              |
| 1250         | FNCSEL46 | 10           | 46 <sup>2</sup> |
| I2S1         | FNCSEL4  | 10           | 4 <sup>1</sup>  |
| I2S1         | FNCSEL19 | 9            | 19              |
| I2S1         | FNCSEL46 | 9            | 46 <sup>2</sup> |
| I2S1         | FNCSEL64 | 9            | 64              |

- 1. The BLE Controller in the Apollo4 Blue Plus KXR package and the Apollo4 Blue Lite uses GPIO04 for its 32 kHz clock. This pad cannot be configured for the I2S0/I2S1 SDIN function or any other function.
- 2. The BLE Controller in the Apollo4 Blue Plus KXR package and the Apollo4 Blue Lite uses GPIO46 for its 32 MHz clock. This pad cannot be configured for the I2S0/I2S1 SDIN function or any other function.

| I2S Instance | Field    | FNCSEL Value | Pad |
|--------------|----------|--------------|-----|
| 12S0         | FNCSEL6  | 9            | 6   |
| I2S0         | FNCSEL12 | 10           | 12  |
| I2S0         | FNCSEL29 | 9            | 29  |
| I2S0         | FNCSEL48 | 10           | 48  |
| I2S1         | FNCSEL6  | 10           | 6   |
| I2S1         | FNCSEL17 | 9            | 17  |
| I2S1         | FNCSEL48 | 9            | 48  |
| I2S1         | FNCSEL62 | 9            | 62  |

#### Table 45: I2S SDOUT Configuration

### 9.3.6 Implementing Secure Digital IO Connections

The signals for the Secure Digital Input/Output Interface (SDIO) for connection to SD/SDIO/MMC devices can be connected to one set of pads on the device. Table 46 shows the pads used for SDIO data lines, clock and command signals. The pads for the clock (CLKOUT) and command (CMD) signal should have the corresponding INPENn and PULLCFGn fields clear. The pads for the data lines should have the corresponding INPENn field set and should have the corresponding PULLCFGn field clear.

| Signal      | Field    | FNCSEL Value | Pad             |
|-------------|----------|--------------|-----------------|
| SDIF_DATA0  | FNCSEL84 | 2            | 84              |
| SDIF_DATA1  | FNCSEL85 | 2            | 85              |
| SDIF_DATA2  | FNCSEL86 | 2            | 86              |
| SDIF_DATA3  | FNCSEL87 | 2            | 87              |
| SDIF_DATA4  | FNCSEL79 | 2            | 79              |
| SDIF_DATA5  | FNCSEL80 | 2            | 80              |
| SDIF_DATA6  | FNCSEL81 | 2            | 81              |
| SDIF_DATA7  | FNCSEL82 | 2            | 82              |
| SDIF_CMD    | FNCSEL83 | 2            | 83 <sup>1</sup> |
| SDIF_CLKOUT | FNCSEL88 | 2            | 88              |

#### Table 46: SDIO/SDIF Configuration

1. SDIF\_CMD is also offered on FNCSEL 2 of pad 91 on Apollo4 Lite and Apollo4 Blue Lite

### 9.3.7 Implementing GPIO Connections

Each pad of the device can be configured as a GPIO port by setting FNCSELn to 3. INPENn and PULLCFGn must be set appropriately depending on the specific GPIO function.

### 9.3.8 Implementing CLKOUT Connections

The flexible clock output of the Clock Generator module, CLKOUT, may be configured on several pads as shown in Table 47. INPENn and PULLCFGn should be cleared in each case.

### Table 47: CLKOUT Configuration

### 9.3.9 Implementing 32 kHz CLKOUT Connections

In addition to the CLKOUT mux output, there is also a dedicated 32 kHz clock output. This clock is primarily for leveraging the 32 kHz oscillator clock from the device. This clock output may be configured on several pads as shown in Table 48. INPENn and PULLCFGn should be cleared in each case.

| Field    | FNCSEL Value | Pad             |
|----------|--------------|-----------------|
| FNCSEL4  | 2            | 4 <sup>1</sup>  |
| FNCSEL37 | 2            | 37              |
| FNCSEL45 | 2            | 45 <sup>2</sup> |
| FNCSEL64 | 1            | 64              |
| FNCSEL65 | 1            | 65              |
| FNCSEL69 | 1            | 69              |
| FNCSEL70 | 1            | 70              |
| FNCSEL75 | 1            | 75              |
| FNCSEL76 | 1            | 76              |
| FNCSEL82 | 1            | 82              |
| FNCSEL83 | 1            | 83              |

### Table 48: 32 kHz CLKOUT Configuration

- The BLE Controller in the Apollo4 Blue Plus KXR package and the Apollo4 Blue Lite uses GPIO04 for its 32 kHz clock. This pad cannot be configured for the 32KHzXT clock output or any other function.
- 2. GPIO45 is not pinned out on the Apollo4 Blue Plus KBR package and cannot be used as the 32KHzXT clock output.

### 9.3.10 Implementing 32 MHz CLKOUT Connections

There is also a dedicated 32 MHz clock output (CLKOUT\_32M). This clock is primarily for leveraging the high-speed 32 MHz oscillator clock from the device. This clock output may be configured on one pad as shown in Table 49. INPENn and PULLCFGn should be cleared in this case.

#### Table 49: CLKOUT\_32M Configuration

| Field    | Value | Pad             |
|----------|-------|-----------------|
| FNCSEL46 | 2     | 46 <sup>1</sup> |

1. GPIO46 is not pinned out on either package of the Apollo4 Blue Plus or the Apollo4 Blue Lite.

### 9.3.11 Implementing ADC Connections

Three types of pad connections may be made for the ADC module. Up to eight pads may be selected from and configured as the analog inputs, as shown in Table 50. The ADCREF reference voltage input is supplied on a dedicated input pin.

If an external digital trigger is desired, up to forty selectable pad choices may be selected from and configured, as shown in Table 51. For the trigger inputs, INPENn must be set. For other inputs, INPENn should be cleared. PULLCFGn should be cleared except in the case of an open drain trigger input.

| Field    | FNCSEL Value | Input  | Pad |
|----------|--------------|--------|-----|
| FNCSEL19 | 0            | ADCSE0 | 19  |
| FNCSEL18 | 0            | ADCSE1 | 18  |
| FNCSEL17 | 0            | ADCSE2 | 17  |
| FNCSEL16 | 0            | ADCSE3 | 16  |
| FNCSEL15 | 0            | ADCSE4 | 15  |
| FNCSEL14 | 0            | ADCSE5 | 14  |
| FNCSEL13 | 0            | ADCSE6 | 13  |
| FNCSEL12 | 0            | ADCSE7 | 12  |

#### Table 50: ADC Analog Input Configuration

#### Table 51: ADC Trigger Input Configuration

| Field    | FNCSEL Value | Input | Pad |
|----------|--------------|-------|-----|
| FNCSEL7  | 1            | TRIG0 | 7   |
| FNCSEL11 | 1            | TRIG0 | 11  |
| FNCSEL15 | 1            | TRIG0 | 15  |

| Field    | FNCSEL Value | Input | Pad              |
|----------|--------------|-------|------------------|
| FNCSEL27 | 1            | TRIG0 | 27               |
| FNCSEL29 | 0            | TRIG0 | 29               |
| FNCSEL36 | 1            | TRIG0 | 36 <sup>1</sup>  |
| FNCSEL41 | 1            | TRIG0 | 41               |
| FNCSEL49 | 1            | TRIG0 | 49               |
| FNCSEL50 | 1            | TRIG0 | 50               |
| FNCSEL54 | 1            | TRIG0 | 54               |
| FNCSEL59 | 1            | TRIG0 | 59 <sup>2</sup>  |
| FNCSEL2  | 2            | TRIG1 | 2                |
| FNCSEL8  | 1            | TRIG1 | 8                |
| FNCSEL12 | 1            | TRIG1 | 12               |
| FNCSEL16 | 1            | TRIG1 | 16               |
| FNCSEL20 | 1            | TRIG1 | 20               |
| FNCSEL30 | 0            | TRIG1 | 30               |
| FNCSEL37 | 1            | TRIG1 | 37               |
| FNCSEL40 | 1            | TRIG1 | 40 <sup>3</sup>  |
| FNCSEL44 | 1            | TRIG1 | 44 <sup>4</sup>  |
| FNCSEL51 | 1            | TRIG1 | 51               |
| FNCSEL55 | 1            | TRIG1 | 55 <sup>5</sup>  |
| FNCSEL60 | 1            | TRIG1 | 60 <sup>6</sup>  |
| FNCSEL9  | 1            | TRIG2 | 9                |
| FNCSEL13 | 1            | TRIG2 | 13               |
| FNCSEL17 | 1            | TRIG2 | 17               |
| FNCSEL21 | 1            | TRIG2 | 21               |
| FNCSEL38 | 1            | TRIG2 | 38               |
| FNCSEL42 | 1            | TRIG2 | 42 <sup>7</sup>  |
| FNCSEL45 | 1            | TRIG2 | 45 <sup>8</sup>  |
| FNCSEL52 | 1            | TRIG2 | 52 <sup>9</sup>  |
| FNCSEL56 | 1            | TRIG2 | 56 <sup>10</sup> |
| FNCSEL10 | 1            | TRIG3 | 10               |
| FNCSEL14 | 1            | TRIG3 | 14               |
| FNCSEL24 | 1            | TRIG3 | 24               |
| FNCSEL39 | 1            | TRIG3 | 39 <sup>11</sup> |
| FNCSEL43 | 1            | TRIG3 | 43 <sup>12</sup> |
| FNCSEL46 | 1            | TRIG3 | 46 <sup>13</sup> |
| FNCSEL53 | 1            | TRIG3 | 53 <sup>14</sup> |

Table 51: ADC Trigger Input Configuration

#### Table 51: ADC Trigger Input Configuration

#### 9.3.12 Implementing Voltage Comparator Connections

Two types of pad connections may be made for the Voltage Comparator (VCOMP) module. Three reference voltages may be used for the comparator negative input as shown in Table 52. The voltage to be applied to the comparator positive input are shown in Table 53. In each case INPENn and PULLCFGn should be cleared. The voltage to be output from the comparator to an output pin are shown in Table 54. In each case INPENn and PULLCFGn should be cleared.

| Field    | FNCSEL Value | Input  | Pad |
|----------|--------------|--------|-----|
| FNCSEL9  | 0            | CMPRF0 | 9   |
| FNCSEL8  | 0            | CMPRF1 | 8   |
| FNCSEL12 | 9            | CMPRF2 | 12  |

#### Table 52: Voltage Comparator Reference Configuration

| Field    | FNCSEL Value | Input  | Pad |
|----------|--------------|--------|-----|
| FNCSEL10 | 0            | CMPIN0 | 10  |
| FNCSEL11 | 0            | CMPIN1 | 11  |

### Table 53: Voltage Comparator Input Configuration

### Table 54: Voltage Comparator Output Configuration

| Field    | FNCSEL Value | Output | Pad              |
|----------|--------------|--------|------------------|
| FNCSEL0  | 9            | VCMPO  | 0                |
| FNCSEL1  | 9            | VCMPO  | 1                |
| FNCSEL2  | 9            | VCMPO  | 2                |
| FNCSEL22 | 9            | VCMPO  | 22               |
| FNCSEL23 | 9            | VCMPO  | 23               |
| FNCSEL26 | 9            | VCMPO  | 26               |
| FNCSEL28 | 1            | VCMPO  | 28 <sup>1</sup>  |
| FNCSEL29 | 1            | VCMPO  | 29 <sup>2</sup>  |
| FNCSEL30 | 1            | VCMPO  | 30 <sup>3</sup>  |
| FNCSEL31 | 9            | VCMPO  | 31               |
| FNCSEL34 | 9            | VCMPO  | 34 <sup>4</sup>  |
| FNCSEL35 | 9            | VCMPO  | 35 <sup>5</sup>  |
| FNCSEL44 | 9            | VCMPO  | 44 <sup>6</sup>  |
| FNCSEL52 | 9            | VCMPO  | 52 <sup>7</sup>  |
| FNCSEL57 | 9            | VCMPO  | 57 <sup>8</sup>  |
| FNCSEL72 | 9            | VCMPO  | 72               |
| FNCSEL90 | 9            | VCMPO  | 90 <sup>9</sup>  |
| FNCSEL91 | 9            | VCMPO  | 91               |
| FNCSEL92 | 9            | VCMPO  | 92 <sup>10</sup> |
| FNCSEL93 | 9            | VCMPO  | 93               |
| FNCSEL94 | 9            | VCMPO  | 94 <sup>11</sup> |

1. GPIO28 is not pinned out on the Apollo4 Lite or Apollo4 Blue Lite.

2. GPIO29 is not pinned out on the Apollo4 Lite or Apollo4 Blue Lite.

3. GPIO30 is not pinned out on the Apollo4 Lite or Apollo4 Blue Lite.

4. GPIO34 is not pinned out on the Apollo4 Blue, either package of the

Apollo4 Blue Plus or the Apollo4 Blue Lite. 5. GPIO35 is not pinned out on the Apollo4 Blue, either package of the

Apollo4 Blue Plus or the Apollo4 Blue Lite. 6. GPIO44 is not pinned out on the Apollo4 Blue or the KBR package

 GP1044 is not plinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.

7. GPI052 is not pinned out on the Apollo4 Blue Plus KXR package or the Apollo4 Blue Lite.

- 8. GPIO57 is not pinned out on the Apollo4 Lite or Apollo4 Blue Lite.
- 9. GPIO90 is not pinned out on the Apollo4 Blue Plus KXR package or the Apollo4 Blue Lite.
- 10.GPIO92 is not pinned out on either package of the Apollo4 Blue Plus, the Apollo4 Lite or the Apollo4 Blue Lite.
- 11.GPIO94 is not pinned out on either package of the Apollo4 Blue
  - Plus, the Apollo4 Lite or the Apollo4 Blue Lite.

### 9.3.13 Implementing the Software Debug Port Connections

The software debug clock (SWDCK) and data (SWDIO) must be connected on pads 20 and 21 respectively. FNCSEL20 and FNCSEL21 must be set to 0, INPEN20 and INPEN21 must be set, and PULLCFG20 and PULLCFG21 must be set, which results in a default state of SWDCK low and SWDIO high.

The optional continuous output signal SWO may be configured on a variety of pads as shown in Table 55, and INPENn and PULLCFGn should be cleared for the selected pad.

| Field    | FNCSEL Value | Pad             |
|----------|--------------|-----------------|
| FNCSEL3  | 2            | 3               |
| FNCSEL22 | 2            | 22              |
| FNCSEL23 | 2            | 23              |
| FNCSEL24 | 2            | 24              |
| FNCSEL28 | 0            | 28 <sup>1</sup> |
| FNCSEL34 | 2            | 34 <sup>2</sup> |
| FNCSEL35 | 2            | 35 <sup>3</sup> |
| FNCSEL36 | 2            | 36 <sup>4</sup> |
| FNCSEL41 | 9            | 41 <sup>5</sup> |
| FNCSEL44 | 2            | 44 <sup>6</sup> |
| FNCSEL56 | 2            | 56 <sup>7</sup> |
| FNCSEL57 | 2            | 57 <sup>8</sup> |
| FNCSEL64 | 2            | 64              |
| FNCSEL65 | 2            | 65              |
| FNCSEL66 | 2            | 66              |
| FNCSEL67 | 2            | 67              |
| FNCSEL68 | 1            | 68              |
| FNCSEL69 | 2            | 69              |
| FNCSEL79 | 4            | 79              |

#### Table 55: SWO Configuration

1. GPIO28 is not pinned out on the Apollo4 Lite or Apollo4 Blue Lite.

2. GPIO34 is not pinned out on the Apollo4 Blue, either package of the Apollo4 Blue Plus or the Apollo4 Blue Lite.

- 3. GPIO35 is not pinned out on the Apollo4 Blue, either package of the Apollo4 Blue Plus or the Apollo4 Blue Lite.
- 4. GPIO36 is not pinned out on the Apollo4 Blue, either package of the Apollo4 Blue Plus or the Apollo4 Blue Lite.
- 5. GPIO41 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 6. GPIO44 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 7. GPIO56 is not pinned out on the Apollo4 Lite or the Apollo4 Blue Lite.
- 8. GPI057 is not pinned out on the Apollo4 Llte or the Apollo4 Blue Lite.

Optional debug trace output signals, SWTRACE0-SWTRACE3, as well as the trace clock (SWTRACECLK) and trace control signal (SWTRACECTL), may be configured on a variety of pads as shown in Table 56. INPENn and PULLCFGn should be cleared for the selected pads.

| Field    | FNCSEL Value | Signal     | Pad             |
|----------|--------------|------------|-----------------|
| FNCSEL1  | 0            | SWTRACE0   | 1               |
| FNCSEL39 | 2            | SWTRACE0   | 39 <sup>1</sup> |
| FNCSEL51 | 2            | SWTRACE0   | 51              |
| FNCSEL70 | 2            | SWTRACE0   | 70              |
| FNCSEL80 | 4            | SWTRACE0   | 80              |
| FNCSEL2  | 0            | SWTRACE1   | 2               |
| FNCSEL40 | 2            | SWTRACE1   | 40 <sup>2</sup> |
| FNCSEL52 | 2            | SWTRACE1   | 52 <sup>3</sup> |
| FNCSEL71 | 2            | SWTRACE1   | 71              |
| FNCSEL81 | 4            | SWTRACE1   | 81              |
| FNCSEL3  | 0            | SWTRACE2   | 3               |
| FNCSEL41 | 2            | SWTRACE2   | 41 <sup>4</sup> |
| FNCSEL53 | 2            | SWTRACE2   | 53 <sup>5</sup> |
| FNCSEL72 | 2            | SWTRACE2   | 72              |
| FNCSEL82 | 4            | SWTRACE2   | 82              |
| FNCSEL4  | 0            | SWTRACE3   | 4 <sup>6</sup>  |
| FNCSEL42 | 2            | SWTRACE3   | 42 <sup>7</sup> |
| FNCSEL54 | 2            | SWTRACE3   | 54              |
| FNCSEL73 | 2            | SWTRACE3   | 73              |
| FNCSEL83 | 4            | SWTRACE3   | 83              |
| FNCSEL0  | 0            | SWTRACECLK | 0               |
| FNCSEL38 | 2            | SWTRACECLK | 38              |

#### Table 56: SW Trace Configuration

| Field    | FNCSEL Value | Signal     | Pad             |
|----------|--------------|------------|-----------------|
| FNCSEL50 | 2            | SWTRACECLK | 50              |
| FNCSEL43 | 0            | SWTRACECTL | 43 <sup>8</sup> |
| FNCSEL55 | 0            | SWTRACECTL | 55 <sup>9</sup> |

#### Table 56: SW Trace Configuration

1. GPIO39 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.

- 2. GPIO40 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 3. GPI052 is not pinned out on the Apollo4 Blue Plus KXR package or the Apollo4 Blue Lite.
- 4. GPIO41 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 5. GPI053 is not pinned out on the Apollo4 Blue Plus KXR package or the Apollo4 Blue Lite.
- 6. The BLE Controller in the Apollo4 Blue Plus KXR package and the Apollo4 Blue Lite uses GPIO04 for its 32 kHz clock. This pad cannot be configured for the 32KHzXT clock output or any other function.
- 7. GPIO42 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 8. GPIO43 is not pinned out on the Apollo4 Blue or the KBR package of the Apollo4 Blue Plus.
- 9. GPI055 is not pinned out on the Apollo4 Blue Plus KXR package or the Apollo4 Blue Lite.

## 9.3.14 Fast GPIO

### 9.3.14.1 Description

Access to GPIO pin registers on the device can be multiple CPU cycles to complete. To support certain functions that require shorter latency access, a fast GPIO interface is supported. The fast GPIO is accessed via the fast GPIO registers shown in the next section.

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# **10. General Purpose ADC and Temperature Sensor Module (GPADC)**

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

## **10.1 Clock Source and Divider**

When the GPADC block is enabled and has an active scan in progress, it requests a clock source. There is an automatic hardware hand shake between the clock generator and the GPADC. If the GPADC is the only block requesting an HFRC based clock, then the HFRC will be automatically started. The GPADC can be configured to completely power down the HFRC between scans if the startup latency is acceptable or it can leave the HFRC powered on between scans if the application requires low latency between successive conversions.

The GPADC on all Apollo4 family SoCs offers four options for the reference clock source and frequency via the GPADC's CFG\_CLKSEL field:

- 1. HFRC at 48 MHz (default setting)
- 2. Inverted HFRC at 48 MHz
- 3. HFRC at 24 MHz
- 4. HFRC2 at 48 MHz

NOTE

Due to errata ERR091 and ERR113, only the 24 MHz HFRC setting (CLKSEL = 0x2) is supported. The other three clock options should not be used.

Also, setting additional input signal sampling/tracking time of at least 32 ADC clock cycles (SLnCFG\_TRKCYCn = 32), for a total minimum of 37 sampling/ tracking cycles, should be used.

## **10.2 Eleven Selectable Analog Input Channels**

The GPADC block contains an 11 channel analog multiplexer on the input port to the analog to digital converter. Eight (8) of the GPIO pins on the MCU can be selected as analog inputs to the GPADC through a combination of settings in the PAD configuration registers in the GPIO block and settings in the configuration registers described below.

The analog mux channels are connected as follows:

- 1. ADC\_EXT0 external GPIO pin connection.
- 2. ADC\_EXT1 external GPIO pin connection.
- 3. ADC\_EXT2 external GPIO pin connection.
- 4. ADC\_EXT3 external GPIO pin connection.
- 5. ADC EXT4 external GPIO pin connection.
- 6. ADC EXT5 external GPIO pin connection.
- 7. ADC EXT6 external GPIO pin connection.
- 8. ADC\_EXT7 external GPIO pin connection.
- 9. ADC\_TEMP internal temperature sensor.
- 10. ADC\_DIV3 internal voltage divide by 3 connection to the input power rail.
- 11. ADC\_VSS internal ground connection.

Refer to the detailed register information below for the exact coding of the channel selection bit field. Also the use of the voltage divider and switchable load resistor are detailed below.

## **10.3 Triggering and Trigger Sources**

The GPADC block can be initially triggered from one of six sources. Once triggered, it can be repetitively triggered from timer 7 in the Timer Module or via the GPADC's internal repeating trigger timer. Four of the GPIO pins on the MCU can be selected as trigger inputs to the GPADC through a combination of settings in the PAD configuration registers in the GPIO block and settings in SLOT configuration registers described below. trigger sources are as follows:

- 0. ADC\_EXT0 (TRIG0) external GPIO pin connection.
- 1. ADC\_EXT1 (TRIG1) external GPIO pin connection.
- 2. ADC\_EXT2 (TRIG2) external GPIO pin connection.
- 3. ADC\_EXT3 (TRIG3) external GPIO pin connection.
- 4. VCOMP Voltage Comparator trigger.
- 5. <Reserved>
- 6. <Reserved>
- 7. ADC\_SWT software trigger.

Refer to the CFG Register in the GPADC register set for the Apollo4. The initial trigger source is selected in the TRIGSEL field. In addition, one can select a trigger polarity in this register applicable for any of the trigger sources except the software trigger. A number of GPIO pin trigger sources are provided to allow pin configuration flexibility at the system definition and board layout phases of development.

The software trigger is effected by writing 0x37 to the to the SWT field of the Software Trigger Register in the GPADC block. Note that writing 0x37 to the SWT field will initiate a scan regardless of which trigger source is selected. However, a hardware trigger source will not initiate a scan if the software trigger has been selected (in the TRIGSEL field of the CFG Register).

When the GPADC is configured for repeat mode, the initial trigger must be initiated by a software trigger and subsequent scans will be initiated at a repeating rate set by the TIMER7 or the GPADC-internal repeating trigger timer. The discussion of the use of TIMER7 or GPADC-internal timer as a source for repetitive triggering is deferred until later in this chapter.

NOTE

A trigger event applies to all enabled slots as a whole. Individual slots cannot be separately triggered.

## **10.4 Voltage Reference Source**

The GPADC supports one internal reference source to be used for the analog to digital conversion step. The reference voltage is 1.19 V and is not user settable. GPADC input voltages > 1.19 V exceed the GPADC range and return full scale code, but will not damage GPADC inputs.

## 10.5 Eight Automatically Managed Conversion Slots

The GPADC block contains eight conversion slot control registers, one for each of the eight slots. These can be thought of as time slots in the conversion process. When a slot is enabled, it participates in a conversion cycle. The GPADC's mode controller cycles through up to eight time slots each time it is triggered. For each slot that is enabled, a conversion cycle is performed based on the settings in the slot

configuration register for that slot. Slots are enabled when the LSB of the slot configuration is set to one. See "One SLOT Configuration Register" on page 89.

| 3 | 3 | 2    | 2   | 2 | 2  | 2                       | 2         | 2 | 2 | 2 | 2 | 1 | 1    | 1    | 1 | 1 | 1 | 1 | 1 | 1 | 1         | 0          | 0       | 0 | 0 | 0    | 0    | 0 | 0 | 0           | 0           |
|---|---|------|-----|---|----|-------------------------|-----------|---|---|---|---|---|------|------|---|---|---|---|---|---|-----------|------------|---------|---|---|------|------|---|---|-------------|-------------|
| 1 | 0 | 9    | 8   | 7 | 6  | 5                       | 4         | 3 | 2 | 1 | 0 | 9 | 8    | 7    | 6 | 5 | 4 | 3 | 2 | 1 | 0         | 9          | 8       | 7 | 6 | 5    | 4    | 3 | 2 | 1           | 0           |
|   | R | eser | ved |   | Sa | #<br>ampl<br>to<br>ccur | les<br>m. |   |   |   |   | F | Rese | erve | d |   |   |   |   | C | HA<br>SEL | NNE<br>EC1 | iL<br>T |   | F | Rese | erve | d |   | WINDOW_COMP | SLOT_ENABLE |

### Table 57: One SLOT Configuration Register

The window comparator enable is discussed in a subsequent section below. See "Window Comparator" on page 92. The number of samples to accumulate is explained in a subsequent section. See "Automatic Sample Accumulation and Scaling" on page 89.

As described above, the channel select bit field specifies which one of the analog multiplexer channels will be used for the conversions requested for an individual slot. See "Eleven Selectable Analog Input Channels" on page 87.

Each of the eight conversion slots can independently specify:

- Analog Multiplexer Channel Selection
- Participation in Window Comparisons
- Automatic Sample Accumulation

## **10.6 Automatic Sample Accumulation and Scaling**

The GPADC block offers a facility for the automatic accumulation of samples without requiring core involvement. Thus up to 128 samples per slot can be accumulated without waking the core. This facilitates averaging algorithms to smooth out the data samples. Each slot can request from 1 to 128 samples to be accumulated before producing a result in the FIFO.

| NOTE                                                                  |
|-----------------------------------------------------------------------|
| Each slot can independently specify how many samples to accumulate so |
| results can enter the FIFO from different slots at different rates.   |

All slots write their accumulated results to the FIFO in exactly the same format regardless of how many samples were accumulated to produce the results. The precision mode for each determines the format for the FIFO data. 12-bit, 10-bit and 8-bit precision modes correspond to 12.6, 10.6 and 8.6 formats, respectively.

NOTE

If the accumulation control for a slot is set for two samples with 8-bit precision, then the 8-bit average integer value will be placed in bits 6 through 13, the 1 bit fractional number is placed in bit 5 and the lower 5 fractional bits are zero'd.

Each slot contains a 21-bit accumulator as shown in Table 58. When the GPADC is triggered for the last sample of an accumulation, the accumulator is cleared and the FIFO will be written with the final average value. When each active slot obtains a sample from the GPADC, it is added to the value in its accumulator.

If a slot is set to accumulate 128 samples per result then the accumulator could reach a maximum value of:

 $128^{(2^{14}-1)} = 128^{(16383)} = 2097024 = 2^{21} - 128$ , hence the 21 bit accumulator.

| 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1    | 1   | 0     | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|---|---|---|---|------|-----|-------|---|---|---|---|---|---|---|---|---|
| 0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1    | 0   | 9     | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|   |   |   |   |   |   |   |   |   | Acci | umu | lator | • |   |   |   |   |   |   |   |   |

#### Table 58: Per Slot Sample Accumulator

Table 59 shows the maximum possible 10-bit accumulated values. Note that for the 128 sample accumulation case, the LSB of the accumulator is discarded when the results are written to the FIFO.

| # Samples | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2     | 1<br>1 | 1<br>0 | 0<br>9 | 0<br>8 | 0<br>7 | 0<br>6 | 0<br>5 | 0<br>4 | 0<br>3 | 0<br>2 | 0<br>1 | 0<br>0 |  |  |  |  |  |  |  |
|-----------|--------|--------|--------|--------|------------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--|--|--|--|--|--|--|
| 128       |        |        |        |        |            |        |        | 10     | .6     |        |        |        |        |        |        |        | 0      |  |  |  |  |  |  |  |
| 64        | Х      |        |        |        |            |        |        |        | 10     | ).6    |        |        |        |        |        |        |        |  |  |  |  |  |  |  |
| 32        | Х      | Х      |        | 10.5   |            |        |        |        |        |        |        |        |        |        |        |        |        |  |  |  |  |  |  |  |
| 16        | Х      | Х      | Х      |        |            |        |        |        |        | 10     | ).4    |        |        |        |        |        |        |  |  |  |  |  |  |  |
| 8         | Х      | Х      | Х      | Х      |            |        |        |        |        |        | 10.3   |        |        |        |        |        |        |  |  |  |  |  |  |  |
| 4         | Х      | Х      | Х      | Х      | Х          |        |        |        |        |        | 10     | ).2    |        |        |        |        |        |  |  |  |  |  |  |  |
| 2         | Х      | Х      | Х      | Х      | X X X 10.1 |        |        |        |        |        |        | X 10.1 |        |        |        |        |        |  |  |  |  |  |  |  |
| 1         | Х      | Х      | Х      | Х      | Х          | X X 10 |        |        |        |        |        |        |        |        |        |        |        |  |  |  |  |  |  |  |

#### Table 59: Accumulator Scaling

## **10.7 Sixteen Entry Result FIFO**

All results written to the FIFO have exactly the same format as shown in Table 60. The properly scaled accumulation results are written the lower half word in the aforementioned format. Since each slot can produce results at a different rate, the slot number generating the result is also written to the FIFO along with the total valid entry count within the FIFO.

#### Table 60: FIFO Register



| # Samples | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|-----------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|---|---|---|---|---|---|---|---|---|---|
| 128       | 0      | 0      |        |        |        |        |        | 12     | 2.6    |        |   |   |   |   |   |   |   |   |   |   |
| 64        | 0      | 0      | 12.6   |        |        |        |        |        |        |        |   |   |   |   |   |   |   |   |   |   |
| 32        | 0      | 0      | 12.5   |        |        |        |        |        |        |        |   |   |   |   |   |   |   |   | Х |   |
| 16        | 0      | 0      |        | 12.4   |        |        |        |        |        |        |   |   |   |   |   |   |   | Х | Х |   |
| 8         | 0      | 0      |        |        |        |        |        | 12     | 2.3    |        |   |   |   |   |   |   |   | Х | Х | Х |
| 4         | 0      | 0      | 12.2   |        |        |        |        |        |        |        |   |   |   | Х | Х | Х | Х |   |   |   |
| 2         | 0      | 0      |        |        |        |        |        | 12     | 2.1    |        |   |   |   |   |   | Х | Х | Х | Х | Х |
| 1         | 0      | 0      | 12     |        |        |        |        |        |        |        |   |   | х | х | Х | Х | Х | Х |   |   |

### Table 61: 12-bit FIFO Data Format

Table 62: 10-bit FIFO Data Format

| # Samples | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 9  | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |  |
|-----------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|----|---|---|---|---|---|---|---|---|---|--|
| 128       | 0      | 0      | 0      | 0      |        | 10     |        |        |        |        |    |   |   |   |   |   | 6 | 6 |   |   |  |
| 64        | 0      | 0      | 0      | 0      | 0 10   |        |        |        |        |        | 10 |   |   |   |   |   |   |   |   |   |  |
| 32        | 0      | 0      | 0      | 0      |        | 10     |        |        |        |        |    |   |   |   |   |   | 5 |   |   | Х |  |
| 16        | 0      | 0      | 0      | 0      |        |        |        |        | 1      | 0      |    |   |   |   |   | 4 |   |   | Х | Х |  |
| 8         | 0      | 0      | 0      | 0      |        |        |        |        | 1      | 0      |    |   |   |   |   | 3 |   | Х | Х | Х |  |
| 4         | 0      | 0      | 0      | 0      |        | 10     |        |        |        |        |    |   |   |   |   | 2 | Х | Х | Х | Х |  |
| 2         | 0      | 0      | 0      | 0      |        |        |        |        | 1      | 0      |    |   |   |   | 1 | Х | Х | Х | Х | Х |  |
| 1         | 0      | 0      | 0      | 0      |        | 10     |        |        |        |        |    |   |   |   | Х | Х | Х | Х | Х | Х |  |

### Table 63: 8-bit FIFO Data Format

| # Samples | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3    | 1<br>2 | 1<br>1 | 1<br>0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|-----------|--------|--------|--------|--------|--------|--------|-----------|--------|--------|--------|---|---|---|---|---|---|---|---|---|---|
| 128       | 0      | 0      | 0      | 0      | 0      | 0      | 8.6       |        |        |        |   |   |   |   |   |   |   |   |   |   |
| 64        | 0      | 0      | 0      | 0      | 0      | 0      | 8.6       |        |        |        |   |   |   |   |   |   |   |   |   |   |
| 32        | 0      | 0      | 0      | 0      | 0      | 0      | 8.5       |        |        |        |   |   |   |   |   | Х |   |   |   |   |
| 16        | 0      | 0      | 0      | 0      | 0      | 0      | 8.4       |        |        |        |   |   | Х | Х |   |   |   |   |   |   |
| 8         | 0      | 0      | 0      | 0      | 0      | 0      |           | 8.3    |        |        |   |   |   |   |   | Х | Х | Х |   |   |
| 4         | 0      | 0      | 0      | 0      | 0      | 0      |           | 8.2    |        |        |   |   |   |   |   | Х | Х | Х | Х |   |
| 2         | 0      | 0      | 0      | 0      | 0      | 0      | 8.1 X X X |        |        |        |   |   | Х | Х |   |   |   |   |   |   |
| 1         | 0      | 0      | 0      | 0      | 0      | 0      | 8 X X X   |        |        |        |   |   |   | Х | Х | Х |   |   |   |   |

Software accesses the contents of the FIFO through the ADCFIFO register. This register will be written by the GPADC digital controller simultaneous with the conversion complete interrupt (if enabled) after accumulating the number of samples to average configured for the slot. The ADCFIFO register contains the earliest written data, the number of valid entries within the FIFO and the slot number associated with the FIFO data. Thus the interrupt handler servicing GPADC interrupts can easily distribute results to different RTOS tasks by simply looking up the target task using the slot number from the FIFO register.

#### NOTE

After a conversion complete ISR (CNVCMP) is asserted, a minimum time delay of 30  $\mu s$  is required before reading the FIFO, otherwise the FIFO data count may be incorrect.

Three other features greatly simplify the task faced by firmware developers of interrupt service routines for the GPADC block:

- 1. The FIFO count bit field is not really stored in the FIFO. Instead it is a live count of the number of valid entries currently residing in the FIFO. If the interrupt service routine was entered because of a conversion then this value will be at least one. When the interrupt routine is entered it can pull successive sample values from the FIFO until this bit field goes to zero. Thus avoiding wasteful re-entry of the interrupt service routine. Note that no further I/O bus read is required to determine the FIFO depth.
- 2. This FIFO has no read side effects. This is important to firmware for a number of reasons. One important result is that the FIFO register can be freely read repetitively by a debugger without affecting the state of the FIFO. In order to pop this FIFO and look at the next result, if any, one simply writes any value to this register. Any time the FIFO is read, then the compiler has gone to the trouble of generating an address for the read. To pop the FIFO, one simply writes to that same address with any value. This give firmware a positive handshake mechanism to control exactly when the FIFO pops.
- 3. When a conversion completes resulting in hardware populating the 12th valid FIFO entry, the FIFOOVR1 (FIFO 75% full) interrupt status bit will be set. When a conversion completes resulting in hardware populating the 16th valid FIFO entry, the FIFOOVR2 interrupt status bit will be set. In a FIFO full condition with 16 valid entries, the GPADC will not overwrite existing valid FIFO contents. Before subsequent conversions will populate the FIFO with conversion data, software must free an open FIFO entry by writing to the FIFO Register or by resetting the GPADC by disabling and enabling the GPADC using the ADC\_CFG register.

## **10.8 Window Comparator**

A window comparator is provided which can generate an interrupt whenever a sample is determined to be inside the window limits or outside the window limits. These are two separate interrupts with separate interrupt enables. Thus one can request an interrupt any time a specified slot makes an excursion outside the window comparator limits.

The window comparison function has an option for comparing the contents of the limits registers directly with the FIFO data (default) or for scaling the limits register depending on the precision mode selected for the slots.

Firmware has to participate in the determination of whether an actual excursion occurred. The window comparator interrupts set their corresponding interrupt status bits continuously whenever the inside or outside condition is true. Thus if one enables and receives an "*excursion*" interrupt then the status bit can't be usefully cleared while the GPADC slot is sampling values outside the limits. That is, if one receives an excursion interrupt and clears the status bit, it will immediately set again if the next GPADC sample is still outside the limits. Thus firmware should reconfigure the interrupt enables upon receiving an excursion interrupt so that the next interrupt will occur when an GPADC sample ultimately goes back inside the

window limits. Firmware may also want to change the windows comparator limit at that time to utilize a little hysteresis in these window comparator decisions.

| 1<br>9 | 1<br>8 | 1<br>7 | 1 6 | 1<br>5 | 1<br>4 | 1 3 | 1 2 | 1  | 1   | 09    | 0 8 | 0<br>7 | 0 | 0<br>5 | 0<br>4 | 03 | 0 2 | 0 | 0<br>0 |
|--------|--------|--------|-----|--------|--------|-----|-----|----|-----|-------|-----|--------|---|--------|--------|----|-----|---|--------|
|        |        |        |     |        |        |     |     | Lo | owe | r Lin | nit |        |   |        |        |    |     |   |        |

Table 64: Window Comparator Lower Limit Register

#### Table 65: Window Comparator Upper Limit Register

| 1 | 1           | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|-------------|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9 | 8           | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|   | Upper Limit |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |   |

The determination of whether a sample is *inside* or *outside* of the window limits is made by comparing the data format of the slot result written to the FIFO with the 20 bit window limits. An GPADC sample is inside if the following relation is true:

14.6 Lower Limit <= GPADC SAMPLE <= 14.6 Upper Limit

Thus setting both limits to the same value, say 700.0 (0x2BC << 6 = 0xAF00), will only produce an inside interrupt when the GPADC sample is exactly 700.0 (0xAF00). Furthermore, note that if the lower limit is set to zero (0x00000) and the upper limit is set to 0xFFFFF then all accumulated results from the GPADC will be inside the window limits and no excursion interrupts can ever by generated. In fact, in this case, the incursion interrupt status bit will be set for every sample from any active slot with its window comparator bit enabled. If the incursion interrupt is enabled then an interrupt will be generated for every such sample written to the FIFO.

The window comparator limits are a shared resource and apply to all active slots which have their window comparator bits enabled. If window limits are enabled for multiple enabled slots with different precision modes, the window comparison function can be configured to automatically scale the 14.6 upper and lower limits value to match the corresponding precision mode format for the enabled slots through the ADCSCWLIM register.

## **10.9 Operating Modes and the Mode Controller**

The mode controller is a sophisticated state machine that manages not only the time slot conversions but also the power state of the GPADC analog components and the hand shake with the clock generator to start the HFRC clock source if required. Thus once the various control registers are initialized, the core can go to sleep and only wake up when there are valid samples in the FIFO for the interrupt service routine to distribute. Firmware does not have to keep track of which block is using the HFRC clock source since the devices in conjunction with the clock generator manage this automatically. The GPADC block's mode controller participates in this clock management protocol.

From a firmware perspective, the GPADC mode controller is controlled from bit fields in the GPADC configuration register and from the various bit fields in the eight slot configuration registers.

The most over-riding control is the GPADC enable bit in the PWRCTRL\_DEVPWREN register of the power control block. This bit must be set to '1' to enable power to the GPADC subsystem. Furthermore, the PWRENADC bit in the GPADC configuration register is a global functional enable bit for general GPADC operation. Setting this bit to zero has many of the effects of a software reset, such as reseting the FIFO

pointers. Setting this bit to one enables the mode controller to examine its inputs and proceed to autonomously handle analog to digital conversions.

A GPADC scan is the process of sampling the analog voltages at each input of the GPADC following a trigger event. If the GPADC is enabled and one or more slots are enabled, a scan is initiated after the GPADC receives a trigger through one of the configured trigger sources. The scan flowchart diagram can be found in Figure 13

A GPADC conversion is the process of averaging measurements following one or more scans for each slot that is enabled.



### Figure 13. Scan Flowchart

### 10.9.1 Single Mode

In single mode, one trigger event produces one scan of all enabled slots. Depending on the settings of the accumulate and scale bit field for the active slots, this may or may not result in writing a result to the FIFO. When the trigger source is an external pin then one external pin transition of the proper polarity will result in one complete scan of all enabled slots. If the external pin is connected to a repetitive pulse source then repeating scans of all enabled slots are run at the input trigger rate.

### 10.9.2 Repeat Mode

Timing for repeat triggers in Repeat Mode can be selected between the GPADC-internal repeat trigger timer and Timer 7 of the Timer Module. Selection is made via the RPTTRIGSEL bit of the GPADC's CFG register. If the GPADC-internal timer is selected for the repeating trigger timing, then the GPADC's INTTRIGTIMER register is used to enable this timer (TIMEREN), select the timer's clock divider (CLKDIV), and set the trigger count (TIMERMAX).

To use The Timer Module's Timer7 for repeat trigger timing, there is a bit in the Timer module's Global Register, ADCEN, which allows Counter/Timer 7 to be a source of repetitive triggers for the GPADC. If counter/timer 7 is initialized for this purpose then one only needs to turn on the RPTEN bit in the GPADC CFG register to enable this mode in the GPADC.

#### NOTE

The mode controller does **not** process these repetitive triggers from the counter/ timer until a first triggering event occurs from the normal trigger sources. Thus one can select software triggering in the TRIGSEL field and set up all of the other GPADC registers for the desired sample acquisitions. Then one can write to the software trigger register and the mode controller will enter REPEAT mode. In repeat mode, the mode controller waits only for each successive counter/ timer 7 input to launch a scan of all enabled slots.

#### 10.9.3 Low Power Modes

An application may use the GPADC in one of three power modes. Each mode has different implications from overall energy perspective relative to the startup latency from trigger-to-data as well as the standby power consumed. The table below is intended to provide guidance on which mode may be more effective based on latency tolerance. This table should only be used as a reference.

| LPMODE | Definition                                                                                                                                 | Entry Latency                              |
|--------|--------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------|
| 0      | GPADC is kept active continuously (used in continuous sam-<br>pling scenarios)                                                             | 0<br>(requires initial<br>calibration)     |
| 1      | GPADC is mostly powered off between samples, HFRC is<br>duty cycled between samples. No calibration required after<br>initial calibration) | <70μs<br>(shorter for lower<br>resolution) |
| 2      | GPADC is completely powered off between samples, HFRC is duty cycled between samples. Requires recalibration for each conversion.          | <660µs                                     |

#### Table 66: GPADC Power Modes

### 10.9.3.1 Low Power Mode 0

Low Power Mode 0 (LPMODE0) enables the lowest latency from trigger to conversion data available. This mode leaves the reference buffer powered on between scans to bypass any startup latency between triggers<sup>1</sup>.

### 10.9.3.2 Low Power Mode 1

Low power mode 1 (LPMODE1) is a power mode whereby the GPADC Digital Controller will automatically power off the GPADC clocks, analog GPADC and reference buffer between scans while maintaining GPADC calibration data. This mode may operate autonomously without CPU interaction, even while the CPU is in sleep or deepsleep mode for repeat mode triggers or hardware triggers. While operating in this mode, the GPADC Digital Controller may be used to burst through multiple scans enabling max sample rate data collection if the triggers are running at a rate at least 2x the maximum sample rate until the final scan has completed. When a scan completes without a pending trigger latched, the GPADC subsystem will enter a low power state until the next trigger event.

### 10.9.3.3 Low Power Mode 2

If desirable, for applications requiring infrequent conversions, software may choose to operate the GPADC in LPMODE2, whereby the full GPADC Analog and Digital subsystem remains completely powered off between samples. In this use case, the software configures the power control GPADC enable register followed by configuring the GPADC slots and the GPADC configuration register between conversion data collections, followed by disabling the GPADC in the power control GPADC enable register. Although this mode provides extremely low power operation, using the GPADC in this mode will result in a cold start latency including reference buffer stabilization delay and a calibration sequence 100's of microseconds, nominally. In this mode, the GPADC must be reconfigured prior to any subsequent GPADC operation.

### 10.10Interrupts

The GPADC has 8 interrupt status bits with corresponding interrupt enable bits, as follows:

- 1. Conversion Complete Interrupt
- 2. Scan Complete Interrupt
- 3. FIFO Overflow Level 1
- 4. FIFO Overflow Level 2
- 5. Window Comparator Excursion Interrupt (a.k.a. outside interrupt)
- 6. Window Comparator Incursion Interrupt (a.k.a. inside interrupt)
- 7. DMA Complete (DCMP)
- 8. DMA Error (DERR)

The window comparator interrupts are discussed above. See "Window Comparator" on page 92.

There are two interrupts based on the *fullness* of the FIFO. When the respective interrupts are enabled, Overflow 1 fires when the FIFO reaches 75% full, viz. 6 entries. Overflow 2 fires when the FIFO is completely full.

When enabled, the conversion complete interrupt fires when a single slot completes its conversion and the resulting conversion data is pushed into the FIFO.

When enabled, the scan complete interrupt indicates that all enabled slots have sampled their respective channels following a trigger event.

<sup>1.</sup> The reference buffer will not be powered on when the GPADC is configured for external reference

When a single slot is enabled and programmed to average over exactly one measurement and the scan complete and conversion complete interrupts are enabled, a trigger event will result in the conversion complete and scan complete interrupts firing simultaneously upon completion of the GPADC scan. Again, if both respective interrupts are enabled and a single slot is enabled and programmed to average over 128 measurements, 128 trigger events result in 128 scan complete interrupts and exactly one conversion complete interrupt following the 128 GPADC scans. When multiple slots are enabled with different settings for the number of measurements to average, the conversion complete interrupt signifies that one or more of the conversions have completed and the FIFO contains valid data for one or more of the slot conversions.

NOTE

After a conversion complete ISR (CNVCMP) is asserted, a minimum time delay of 30  $\mu$ s is required before reading the FIFO, otherwise the FIFO data count may be incorrect.

## **10.11Generating the Sample Rate for the GPADC**

TIMER7 of the Timer Module has a special function which allows it to function as the sample trigger generator for the GPADC. If the TIMER\_GLOBEN\_ADCEN bit is set, the output of the timer is sent to the GPADC which uses it as a trigger. Typically, TIMER7 is configured in Repeatable Up-counter Compare (UPCOUNT - FN =2) mode. INTEN\_TMR70INT may be set to generate an interrupt whenever the trigger occurs, but typically the GPADC interrupt will be used for this purpose.

# **11. Multi-bit Serial Peripheral Interface Master Module (MSPI)**

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

#### NOTE

Only two MSPI instances are pinned out on the Apollo4 Blue and the KBR package of the Apollo4 Blue Plus, as MSPI1 is used for internal communication between the Apollo4 MCU and the BLE Controller module. Consult the datasheet for possible operating limitations of the individual MSPI instances.

#### NOTE

Use of Device1 is not supported on the MCU. Disregard any reference to or inference of multiple MSPI devices.

NOTE

Check the datasheet of the applicable device for MSPI clocking and functional restrictions.

## 11.1 Configuration

Each MSPIn module should be configured to match the transfer characteristics of the external device(s) on the bus. Generally, the configuration sequence would proceed as follows:

- Configure MSPIn clock divider (CLKDIV0 field in the DEV0CFG register). The MSPI's reference frequency is 96 MHz, so the resulting clock frequency is 96/CLKDIV value.
- Configure MSPIn transfer characteristics (DEV0CFG register) to initialize the device (usually mode 0, serial transfers).
- Configure MSPIn PADOUTEN to enable the desired bits on the MSPIn bus (clock plus relevant data bits).

#### NOTE

Enabling unused data lines will impact the values present on those pads even if the GPIO function select is not set to MSPI.

- Program external flash device to the appropriate mode, either dual, quad, octal or hex (if supported).
- Update DEV0CFG register to new settings (in cases of a transfer mode or addressing change).
- Write DEV0XIP register to set read/write instructions and transfer characteristics for DMA/XIP operations (and optionally enable XIP mode).

Each MSPIn's DEV0CFG register contains the controller's settings when communicating with any given device and it is expected that these values will be static after initial configuration of the external memory devices. The MSPIn\_DEV0CFG\_DEVCFG0 field specifies both the transfer mode (serial, dual, quad, etc) as well as which chip enable is used to access the device. The MSPIn\_DEV0CFG\_ISIZE0 and MSPIn\_DEV0CFG\_ASIZE0 fields indicate the number of bytes transmitted for the instruction and address phases, but individual operations can select whether to transmit these or not. The TURNAROUND0 field indicates the number of cycles between the TX of instruction/address and reception of the first RX byte

(the flash device must be programmed to use the same count). The same register also includes a WRITELATENCY0 field which can be used to insert wait states between write address and write data in a manner similar to the TURNAROUND for read operations.

## 11.2 PIO Operation

Software can issue general PIO (Programmed Input/Output) operations to devices on the MSPI bus using the INSTR, ADDR, and CTRL registers. Software should first write the instruction to be sent to the INSTR register and the address to be sent to the ADDR register (if required) followed by a write to the CTRL register to start the transfer. The CTRL\_TXRX bit indicates whether data should flow to or from the device and CTRL\_XFERBYTES indicates the number of bytes to transfer. CTRL\_SENDI and CTRL\_SENDA can be used to enable or disable the instruction or address phases and the CTRL\_ENTURN is used to enable the turnaround phase. The transfer will only commence if the CTRL\_START bit is set. Software may read the BUSY and STATUS fields to check on transaction status, otherwise the CMDCPL interrupt (INTEN\_CMDCPL) can be used to indicate completion.

AM\_REG(MSPIn,INSTR) = instr; AM\_REG(MSPIn,CTRL) = AM\_REG\_MSPIn\_CTRL\_XFERBYTES(bytes) | AM\_REG\_MSPIn\_CTRL\_SENDI\_M | AM\_REG\_MSPIn\_CTRL\_TXRX(1) | AM\_REG\_MSPIn\_CTRL\_START\_M;

Write latency, the time between the address and first data byte, can be set and controlled by setting the CTRL\_ENWLAT bit to enable the Write Latency Counter, which sets the latency from 0 to 63 counter clocks with the setting in the DEV0CFG\_WRITELATENCY0 field.

For write (TX) operations, data should be written to the TXFIFO after the transaction has been started. Software should read TXENTRIES before writing to ensure that space is available in the FIFO before writing new TX data. For read operations, software should read the RXENTRIES to determine the number of words available and then read the data from the RXFIFO register.

```
// Example TX data write loop
for (i = 0; i < count; ) {
    temp1=AM_REG(MSPI,TXENTRIES);
    for(;(temp1<16) && (i<count);temp1++,i++) {
        AM_REG(MSPIn,TXFIFO) = data[i];
     }
}</pre>
```

## 11.3 DMA Operations

Each MSPI controller tightly integrates the DMA controller with the transfer interface and automatically handles sequencing of instructions and address to serial flash device and the subsequent transfer of data to/from system memory. Before starting DMA operations, software should have already configured the DEV0CFG register (to specify device configuration) and the DEV0XIP register (to specify the template used for DMA operations). Software should first set up the static DMA parameters which specify the DMA burst parameters:

MSPIn(mspiModule)->DMATHRESH=8; // Issue MSPIn(mspiModule)->DMABCOUNT=32; // burst

// Issue new DMA at FIFO half empty/full condition
// burst count=32 bytes (8 words)

The MSPI implements a single FIFO for both TX and RX transfers as well as separate threshold values for RX/TX operations. In most cases, the DMATHRESH\_DMATXTHRESH field should be set at 8 to indicate that a TX DMA (read from SRAM) will be triggered when the FIFO drops below eight entries and will trigger an RX DMA (write to SRAM) when the FIFO level reaches eight entries. The DMABCOUNT\_BCOUNT field indicates the number of words that will be transferred each time that DMA is triggered. The DMA will

also trigger automatically to flush or fill the FIFOs at the end of transfer if the total count is not a multiple of 32 bytes.

To initiate a DMA transfer, software should issue the following register operations:

| MSPIn(mspiModule)->DMADEVADDR=(uint32_t) addr;  | // set device address              |
|-------------------------------------------------|------------------------------------|
| MSPIn(mspiModule)->DMATARGADDR=(uint32_t) data; | // set address in system memory    |
| MSPIn(mspiModule)->DMATOTCOUNT=(count<<2);      | // set total number of bytes       |
| MSPIn(mspiModule)->DMACFG=                      | // enable DMA peripheral to memory |
| AM_REG_MSPI_DMACFG_DMAEN   AM_REG_MSPI_         | DMACFG_DMADIR_P2M;                 |

When complete, the MSPIn will issue the DMACPL interrupt or software can monitor the status by reading the DMASTAT\_DMATIP bit. For writes to peripherals, software should also check the INTSTAT\_CMDCMP bit to ensure the transaction has finished. Transfers to the flash device are initiated by setting the DMACFG\_DMADIR field to M2P (Memory to Peripheral).

Each controller will use the template in the DEV0XIP register to determine whether to send the instruction and address phases (XIPSENDI0, XIPSENDA0) and whether to insert turnaround cycles (DEV0XIP\_XIPENTURNn). Instruction and address lengths are determined by the settings in the DEV0CFG register and the address and transfer count are set by the DMADEVADDR and DMATOTCOUNT registers. The instruction sent for read (RX) operations is specified in the READINSTR field of the DEV0INSTR register and likewise the WRITEINSTR field is used when transmitting data to the flash device.

If the AUTO DMA cannot be used because the device's characteristics don't fit into the template, software can issue PIO operations to initiate a more complex transfer setup and then enable DMA for just the bulk DMA portions using the DMAEN\_EN instead of DMAEN\_AUTO.

Optionally, the MSPI can turn off it's power domain at the end of a DMA transfer if the DMAPWROFF bit is set in the DMACFG register. The domain will only power off once the entire DMA is complete (i.e. writes have been committed to system memory or have completed to the external flash device).

### 11.3.1 Configuring MSPI as a DMA Target and a DMA Client Concurrently

A DMA deadlock may occur when there is heavy traffic of concurrent DMA accesses such as when the MSPI is used as both a "DMA client", where MSPI is sourcing or sinking data through the DMA, and a "DMA target", where MSPI is a memory-mapped source or destination for other peripheral DMA. For example, a situation may exist when the ADC is targeting a memory device through the MSPI XIPMM aperture as a the ADC sample "DMA target" at the same time that the MSPI is using DMA itself to target SSRAM or TCM. This condition may result in a DMA deadlock due to a circular dependency when the APBDMA-AXI, MSPI-XIPDMA and APBDMA-ARBITOR states are blocking or waiting for DMA resources.

To avoid this potential problem, software should control the DMA's configuration to alternate between MSPI as a DMA client and as a DMA target so as not to allow overlap of these DMA accesses. Note that there should not be a threat of this deadlock situation when short CPU accesses such as XIP or memory mapped MSPI are occurring, due to the location of arbitration against DMA traffic.

## 11.4 Execute in Place (XIP) Operations

The XIP mode of operation allows devices on the MSPI interface to be mapped into the flash cache's address space and appear as an extension to the internal flash array(s). Once enabled by the XIPEN bit in the DEV0XIP register, the flash/cache module will decode the address region and forward operations to the MSPI interface for completion. XIP mode uses the same configuration information as DMA mode and will automatically execute a cache line read fetch from the attached device and return it to the cache controller.



Figure 14. XIP Block Diagram

XIP and DMA/PIO operations can all be interleaved since the MSPI controller will allow the current operation to complete before performing the XIP operation. XIP may be interleaved within an ongoing DMA transfer as configured by either the DEV0BOUNDARY\_DMATIMELIMIT0 transaction time or the DEV0BOUNDARY\_DMABOUND0 address boundary.

Generally DMA read operations can safely be interleaved with XIP, however, XIP mode may have to be disabled during flash programming operations since the flash array within the device may not be available during program or erase operations and thus would return invalid data.

## 11.4.1 XIP Address Mapping and Accesses

The MSPI additionally supports a memory-mapped XIP mode (XIP) that enables full read/write mapping of an MSPI device such as a PSRAM to the CPU's peripheral address map at offset 0x1400000-0x1FFFFFF. This is mapped to offset 0 of the device on the MSPI bus and is not cached (unlike XIP space) and thus can be used as an extension to system SRAM. MSPI1 and MSPI2 are similarly mapped at 0x18000000 and 0x1C000000, respectively. The MSPI device can be accessed by both XIP and XIP accesses (the regions overlap), but it is recommended that XIP used for static data/instructions and that a separate area of the MSPI device is used for read/write operations to avoid having stale data visible in the cache.

XIP seamlessly supports word, halfword, and byte read and write accesses, however, there are a few restrictions and caveats:

- As mentioned above, writes to XIP do not flush cached data to the same address.
- For scrambled regions, XIP can only be written safely by writing words (byte and halfword writes will corrupt the scrambled data at that location). Byte, halfword, and word reads may all be performed to scrambled regions.
- Read/Write performance to the XIP region will be significantly slower than accesses to internal SRAM since there are multiple cycles of command, addressing, and data transfer overhead. For this reason, internal SRAM should be used for frequently accessed data and XIP should be used for infrequently used data.

## 11.4.2 Optimized XIP Addressing

Some SPI flash devices support an optimized XIP mode that minimizes the number of instruction/address cycles that must be transmitted in order to reduce overall fetch latency. To activate this mode, software should program the flash device's registers to enter the device XIP mode, and then update the DEV0CFG\_DEVCFG0 field to the specified number of address bytes and then disable the DEV0XIP\_XIPSENDI0 field (assuming that no instruction needs to be sent). To exit the device's XIP mode, software should reconfigure the MSPI interface in order to send the required XIP exit sequence to the device.

### 11.4.3 Micron XIP Support

Micron flash devices support an XIP mode that does not require the instruction byte to be transmitted, which minimizes the access time to the device. In order to transition in and out of this mode, the MSPI controller must issue an acknowledgment of XIP mode during the first turnaround cycle for each XIP access. When transitioning into and out of XIP mode, software must set the DEV0XIP\_XIPACK0 field appropriately.

Under normal operation, the XIPACK0 should be set to NOACK (0x0), indicating that no acknowledgment should be sent. To transition into XIP mode, software should perform the following actions:

- 1. Activate XIP in the Micron device by writing the Volatile Configuration register
- 2. Set the DEV0XIP\_XIPACK0 bit field to ACK (0x2)
- 3. Perform a memory read from the Micron device (instruction must be sent). This access will allow the MSPI controller to acknowledge switching into the XIP mode
- 4. Set the DEV0XIP\_XIPSENDI0 bit field to 0 to indicate that the instruction byte no longer needs to be sent.

The MSPI will now transmit just the address to the Micron device and drive a 0 onto the data lines on the first turnaround cycle to remain in XIP mode. It is important that software ONLY perform read operations to the flash device until XIP mode has been exited.

To terminate XIP mode, software should perform the following sequence:

- 1. Set the DEV0XIP\_XIPACK0 bit field to TERMINATE (0x3)
- 2. Issue a memory read to the Micron device. This will allow the MSPI controller to signal termination of XIP mode by driving the data lines high during the first turnaround cycle.
- 3. Set the DEV0XIP\_XIPACK0 bit field to NOACK (0x0) and the DEV0XIP\_XIPSENDI0 bit field to 1.

After this sequence has completed, software can erase, program, or send any other instructions to the Micron flash again.

## 11.5 Command Queueing (CQ)

The MSPI's command queuing (CQ) interface is similar to command queuing implementation in the IOM and BLE modules. To utilize the command queue, software basically constructs a series of register operations that would be issued to the MSPI device, but instead places them in an array in system SRAM (or internal flash). The start of this buffer is then written to the CQADDR register and the commands can issued by enabling the CQEN bit in the CQCFG register. The CQ logic then reads the address/data pairs via DMA operations and will continue executing them until the end of the command queue, which is denoted as a write to the STOP bit in the CQPAUSE register. As the CQ logic issues register operations, it will automatically pause fetching new operations while the transfer module is busy or can be paused to wait for external events based on the status of the CQPAUSE and CQFLAGS registers.

The primary limitation of CQ operations is that all addresses must reside within the MSPI module since the operations are executed internally by the MSPI module (i.e. it cannot write register in other modules, etc).

## 11.5.1 Command Queue Data Format

As the command queue resides in system memory, the general format is pairs of words that form the register address to write as well as the data to write. Assuming the CQ base address is 0x10000, system SRAM might look like the following table:

| Address | Data       | Description                                                     |
|---------|------------|-----------------------------------------------------------------|
| 0x10000 | 0x50014258 | DMATARGADDR register address                                    |
| 0x10004 | 0x00002800 | Data to write to DMATARGADDR (i.e. 0x2800 is the target buffer) |
| 0x10008 | 0x5001425C | DMADEVADDR register address                                     |
| 0x1000C | 0x00304000 | Address within flash device                                     |
| 0x10010 | 0x50014260 | DMATOTCOUNT register address                                    |
| 0x10014 | 0x00000100 | Transfer 256 bytes of data                                      |
| 0x10018 | 0x50014250 | DMACFG register address                                         |
| 0x1001C | 0x0000003  | AUTO DMA enable on peripheral to memory transfer                |
| 0x10020 | 0x50014288 | CQPAUSE register address                                        |
| 0x10024 | 0x00008000 | End of Command Queue (write to STOP bit)                        |

#### Table 67: Command Queue Example

The AM\_REG macros can be used to construct the CQ table in a manner similar to below:

## 11.5.2 CQ Interrupts

The MSPI CQ module provides several interrupts via the INTEN register to provide feedback to software as the MSPI works through its command queue.

- CQERR: Indicates that the command queue encountered an error when fetching the command queue instructions. This can be caused by an invalid CQ pointer that points to an invalid flash or SRAM address (SRAM powered down, etc).
- CQPAUSED: Indicates that the command queue has encountered a pause condition. This can be triggered by an index match or when the CQ is waiting on a software or hardware flag.
- CQCPL: Indicates that the command queue has completed operations. This is typically used when the command queue is executing a single-shot set of commands which end with the CQ writing the STOP bit in the CQPAUSE register.

• CQTIP: indicate that a CQ transfer is active and this will remain active even when paused waiting for external event.

Software can generate a CQTIP interrupt at any point during command queue operation by setting bit[0] of the register address of the command to a 1 (basically OR 0x1 with the address portion of a CQ entry. This can be useful when software would like intermediate interrupts as operations complete such as after each CQ index is updated.

## 11.5.3 Pausing CQ Operations

While the basic operation of the CQ functionality is pretty straightforward, constructing more complex scenarios such as queuing of multiple operations requires additional logic to accommodate handshaking with the software managing the queue and other modules within the chip. The MSPI accomplishes both of these by providing the ability to pause the CQ processing using a pause mask (CQPAUSE register) and software and hardware pause flags.

After the MSPI executes a CQ write operation, it will check all bits specified in the CQPAUSE register against their CQFLAGS status, and will pause operation if all of the associated CQFLAGS bits are set. Since all registers are available to be written by both CPU software and CQ commands, there are numerous ways these can be used, but two common scenarios are

- Software can initially set a mask in CQPAUSE and CQ operation will continue until the matching CQFLAGS condition is encountered.
- The CQ command stream can set the CQPAUSE register during execution and pause until the status in FLAGS changes to indicate that it should restart.

The CQFLAGS register contains 8 soft flags (register bits that can be controlled by either the CPU or the QC operation) and an additional 812 hard flags, which are hardware status flags tied to logic in the MSPI module or other modules in the chip. The lowest two soft flags are also exported to the IOM SPI modules and the other two MSPI modules to facilitate communication between an IOM and the MSPI to enable management of common MSPI/IOM buffers via the command queues. The table below lists the flags available in the MSPI:

| Bit | Туре | Mnemonic                 | Description/Use                                                                                                                                                                                                |
|-----|------|--------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 15  | Hard | STOP                     | CQ Stop Flag. When set to 1, CQ processing will terminate and the CQCPL interrupt will be generated.                                                                                                           |
| 14  | Hard | CQIDX                    | CQ Index Pointer Match. Will be set to 1 when the CURIDX and ENDIDX pointers match. Generally used by software when forming a request queue.                                                                   |
| 13  | Hard | BUF1XOREN                | Buffer 1 Ready Status (from selected IOM/MSPI).<br>This status is the result of XOR'ing the IOM1START with the incoming status<br>from the IOM. When high, MSPI can transfer the buffer.                       |
| 12  | Hard | BUF0XOREN                | Buffer 0 Ready Status (from selected IOM/MSPI).<br>This status is the result of XOR'ing the IOM0START with the incoming status<br>from the IOM. When high, MSPI can transfer the buffer.                       |
| 11  | Hard | DMACPL                   | DMA Complete Status (hardwired DMACPL bit in DMASTAT)                                                                                                                                                          |
| 10  | Hard | CMDCPL                   | PIO Operation completed (STATUS bit in CTRL register)                                                                                                                                                          |
| 9   | Hard | IOM1READY/<br>BUF1XNOREN | Buffer 1 Ready Status. IOM Buffer 1 Ready Status (from selected IOM).<br>This status is the result of XNOR'ing the IOM0START with the incoming status<br>from the IOM. When high, MSPI can send to the buffer. |
| 8   | Hard | IOM0READY/<br>BUF0XNOREN | Buffer 0 Ready Status. IOM Buffer 0 Ready Status (from selected IOM).<br>This status is the result of XNOR'ing the IOM0START with the incoming status<br>from the IOM. When high, MSPI can send to the buffer. |

#### Table 68: CQFLAGS

| Bit | Туре | Mnemonic  | Description/Use                                                                                                                                                |
|-----|------|-----------|----------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 7   | Soft | SWFLAG7   | Software flag                                                                                                                                                  |
| 6   | Soft | SWFLAG6   | Software flag                                                                                                                                                  |
| 5   | Soft | SWFLAG5   | Software flag                                                                                                                                                  |
| 4   | Soft | SWFLAG4   | Software flag                                                                                                                                                  |
| 3   | Soft | SWFLAG3   | Software flag                                                                                                                                                  |
| 2   | Soft | SWFLAG2   | Software flag                                                                                                                                                  |
| 1   | Soft | IOM1START | Flag wired to IOM devices as a hard flag for intercommunication. Typically indi-<br>cates that buffer 1 has been filled by MSPI and can be emptied by the IOM. |
| 0   | Soft | IOM0START | Flag wired to IOM devices as a hard flag for intercommunication. Typically indi-<br>cates that buffer 0 has been filled by MSPI and can be emptied by the IOM. |

#### Table 68: CQFLAGS

The soft flags can be set/cleared/toggled via writes to the CQSETCLEAR register and their status can be read by software by reading the CQFLAGS register directly. The CQPAUSE mask bits are enumerated in the same manner.

In order to minimize the need to pause for individual operations, the CQ will automatically pause any time the MSPI's transfer block is active (for PIO, DMA, or XIP operations). Thus, whenever the CQ enables a DMA operation, there is an implicit pause until the operation completes, and then the CQ will resume fetching additional commands. To terminate the CQ processing, the CQ or software should set the most significant (unreserved) CQPAUSE register bit (STOP), which will cause the MSPI to terminate processing of the command queue and issue a CQCPL interrupt.

## 11.5.4 Using the CQ Index registers

The MSPI command queuing implementation also includes a pair of registers that allow software to manage a list of outstanding operations: CQCURIDX and CQENDIDX. When initializing the command queue software can set both of these registers to the same value, which indicate an index or reference into the position of the command queue. The CQPAUSE can then be set to CQIDX and the command queue enabled. Since the CQCURIDX equals the CQENDIDX, the command queue will immediately pause and wait for them to be not equal again triggering resumption of CQ processing.

For each group of commands in the command queue, software can place a write to the CQCURIDX after each DMA operation in the command queue and then directly write the CQENDIDX register with the index of the last operation in the queue. Since the CQENDIDX now mismatches the CQCURIDX, the command queue will begin processing commands and start working its way through the queue. After completing the first operation, the command queue will include a write to the CQCURIDX to indicate that the operation has completed, and the CQ logic will check to see if the CQCURIDX equals the CQENDIDX and either pause or continue processing until the two are equal again.

This mechanism allows software to asynchronously post additional operations to the command queue by simply writing the new commands to memory and then updating the CQENDIDX to the index of the last operation. Because the MSPI CQ hardware simply looks for a match between the registers, software may roll over from 0xFF to 0x00 or use the indices in any manner they see fit as long as the end index value is not found elsewhere in the command queue.

Software can monitor the progress of the MSPI's CQ processing by enabling the DMACPL interrupt, which will generate an interrupt after each DMA completion. The interrupt routine can read the CQCURIDX register to determine which operations have completed in order to return the proper status to the application.

## 11.5.5 MSPI and IOM Intercommunication

The MSPI modules and IOM modules can be linked through the command queue flags to allow a simple form of handshaking to facilitate data flow between the two modules. The MSPI only has a single pair of hardware flags dedicated to IOM communication so software must write the IOMSEL field in the MSPICFG register to select which IOM or MSPI device is paired with the MSPI.

Since MSPI is memory mapped, IOM can directly perform DMA reads/writes to an MSPI device instead of using the internal SRAM buffers. However, using an internal buffer allows for longer DMA bursts, which may be more efficient than direct addressing via DMA. Also MSPI devices can be set up to DMA or handshake with other MSPI devices in a manner similar to the IOM handshaking.

With the introduction of three MSPI modules on the MCU, the MSPI/IOM handshaking has been extended to supports MSPI-to-MSPI handshaking as well. The target MSPI is selected in the same manner as an IOM in the IOMSEL field. The CQPAUSE flags on the MSPI have also been updated to include a BUFnXNOR function, which is required for an MSPI that is a consumer of data.

A typical use model for this feature is for transmitting blocks of data stored in external flash to a device (such as a display) on the IOM interface. In this scenario, software would allocate two buffers in SRAM which would be filled by the MSPI and emptied by the IOM/MSPI device. At the beginning of the operation, software would clear the IOM0START and IOM1START flags and initialize the MSPI command queue with two read operations to load data into buffer 0 and buffer 1. Software would also initialize the corresponding flags in the other device and set up the command queue to point to begin reading at buffer 0, but pause until it sees the buffer0 status is ready.

When the MSPI command queue is enabled, it will check the IOM0READY flag (which will be zero since the incoming bit is zero and the IOM0START flag is zero) and begin processing the operation which would DMA data from the external flash to fill buffer 0. At the end of the operation, the CQ would write the CQPAUSE register with the mask for IOM1READY. The status of IOM1READY will also be zero, so it will continue processing to fill buffer 1. At the end of this operation, the CQ will write the CQPAUSE register to IOM0READY again, but this time it will likely pause because the IOM is still reading data out of buffer 0. Once the IOM finishes its reads from buffer 0, it's CQ will set the flag for buffer 0, which will in turn cause the IOM0READY hardware flag to become zero and allow the MSPI to continue processing (which would fill buffer 0 again). In this manner, software would only need to continue adding commands to the MSPI command queue in order to continuously feed data frames to the IOM device.

## 11.6 Data Scrambling

In order to protect customer data stored on external flash devices, the MSPI module supports a data scrambling algorithm to obfuscate data on the MSPI bus. Scrambling can be enabled by programming the DEV0SCRAMBLING\_SCRSTART0 and DEV0SCRAMBLING\_SCREND0 fields to correspond to the address range to be encrypted and setting the SCRENABLEn bit in the DEV0SCRAMBLING register. Scrambling is enabled for all DMA and XIP operations that fall within the scrambling window.

Accesses to the scrambling region must always be to an aligned, four-byte boundary (i.e. device address must always end in 0x0, 0x4, 0x8, 0xC). Accesses through the XIP region are always aligned to cache lines, but software must ensure that DMA operations are properly aligned. In the case of a mis-aligned DMA access, the MSPI will issue the SCRERR interrupt (SCRambling ERRor).

## 11.7 Auto Power Down

The MSPI module has the ability to power itself down at the end of a DMA or CQ operation. This would usually be done while the system is going into deep sleep but desires the MSPI to transfer data to or from a flash device during the beginning of the sleep period. To enable auto-power down, software should enable the DMA with the DMACFG\_DMAPWROFF bit set or command queuing with the CQCFG\_CQPWROFF bit set.

## 11.8 Board/Package Considerations for MSPI Pin Timing

The MSPI logic contains controls to adjust I/O timings to accommodate differences in board or device timings through the RXCAP0, RXNEG0, and TXNEG0 bits in the DEV0CFG register. The discussion below assumes SPI mode 0 (CPHA=0, CPOL=0) and that in dual/quad/octal modes that MOSI refers to all pins in transmission mode and MISO refers to all pins when in receive mode.

## NOTE An MSPI timing window scan function is available to determine optimum values for MSPI timing parameters such as RXDQSDELAY, RXNEG, RXCAP, etc., and other timing settings on the Apollo4 and Apollo4 Plus SoCs. For Apollo4, it is recommended to run the window function and capture the timing values on a "per device" basis, meaning that these values should be ascertained and used for each MSPI instance, per board.

For Apollo4 Plus, timing values can be determined on a "per design" basis using a population of boards of the same design. Once the timing values are determined for each MSPI instance in the design, they can be applied to all boards without having to derive the timing settings on each board. The exception to this guidance is when an MSPI interface is configured for non-DQS mode using any data width. In this case, the window scan should be used on a "per device" basis.

Consult the Apollo4 or the Apollo4 Plus datasheet for timing and mode limitations for each MSPI instance on the applicable device

In an ideal model where there are no delays present, data is launched on the negative edge of the clock and captured on the positive edge of the clock at both the master (MSPI) and target (flash) device. However, the presence of delays in the system complicates the timing, and the timing diagram shown in Figure 15 demonstrates how transmission (TX) delays are accommodated in the MSPI interface design with a SCLK = 48 MHz or lower.

CLK (int) refers to the internal 96 MHz clock used by the MSPI, and SCLK (int) and MOSI (int) are the internal chip timings for the outgoing clock and MOSI lines. Likewise, the @ Dev signals indicate the SCLK and MOSI timing at the target device's pins. (Delays shown are just representative and may not reflect actual device timings.)


Figure 15. MSPI TX Interface Timing at SCLK = 48 MHz

Bit transmission from the MSPI to the target is fairly straightforward since both the SCLK and MOSI are delayed by similar amounts (red arrows). Depending on which pins are used, there may be some skew between the SCLK and MOSI, however, it should be relatively small compared to the half-cycle of setup time. If required, setting the TXNEG bit to 1 launches MOSI a half cycle (~5 ns) later, which is indicated by the "TXNEG = 1" dashed green line.

#### NOTE

The option to set TXNEG to 0 does not apply when SCLK = 96 MHz. Clocking at this SCLK rate requires TXNEG to be set to 1.

Target to master (MISO) timings on the MSPI interface are a bit more difficult to handle because of the cumulative round trip delay that consists of the clock delay from master to target, the access time at the target itself, and the return delay MISO path (red arrows). For this reason, read timings often dictate the frequency of the SPI bus.

When using a SCLK of 48 MHz or lower, the RXCAP and RXNEG bits may be used together to determine the incoming RX data capture point. In an ideal world (zero delays), the MSPI would capture data at the rising edge of the internal SCLK, which would correspond to the setting of RXCAP = 0 / RXNEG = 0 (the first vertical green dashed bar in Figure 16). It is useful, however, to push out the RX capture point to accommodate the late arrival of MISO. A setting of RXCAP = 0 / RXNEG = 1 delays the capture point by about 5 ns (one half period of the internal 96 MHz clock) as indicated by the second dashed green line.

The MSPI also supports RXCAP =1 / RXNEG = 0 and RXCAP= 1 / RXNEG=1 combinations, which delay capture of data by 10 ns and 15 ns, respectively.



Figure 16. MSPI RX Interface Timing at SCLK = 48 MHz

With SCLK = 96 MHz (for an MSPI instance that supports that rate), the RXCAP delay is not available and the RXNEG bit is the single setting to lengthen the sampling delay time. The MSPI captures data at the rising edge of the internal SCLK with a setting of RXNEG = 0 (the first vertical green dashed line in Figure 20). To push out the RX capture point to accommodate a later arrival of MISO, a setting of RXNEG = 1 is used to delay the capture point by about 5 ns (one half period of the internal 96 MHz clock) as indicated by the second dashed green line.





# 12. I<sup>2</sup>C/SPI Master (IOM)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 12.1 Programmer's Reference

An example register sequence to initiate an operation is shown below (note this does not show the data portion of the operation, only the command):

SPI SAMPLE OPERATION: // Enable clock for 24MHz SPI operation AM\_REG(IOM,CLKFG) =( 0 << AM\_REG\_IOM\_CLKCFG\_LOWPER\_S) | ( 0 << AM\_REG\_IOM\_CLKCFG\_TOTPER\_S) | ( 0 << AM\_REG\_IOM\_CLKCFG\_DIVEN\_S ) | ( 1 << AM\_REG\_IOM\_CLKCFG\_DIV3\_S ) | ( 1 << AM\_REG\_IOM\_CLKCFG\_FSEL\_S ) | ( 1 << AM\_REG\_IOM\_CLKCFG\_IOCLKEN\_S );

// Setup the SPI configuration register.MSB first, no flow control, not full duplex, mode 0

AM\_REG(IOM, SPICFG) = ((0 << AM\_REG\_IOM\_MSPICFG\_MSPIRST\_S) & AM\_REG\_IOM\_MSPICFG\_MSPIRST\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_DOUTDLY\_S) & AM\_REG\_IOM\_MSPICFG\_DOUTDLY\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_DINDLY\_S) & AM\_REG\_IOM\_MSPICFG\_DINDLY\_M) | ((0 << AM\_REG\_IOM\_MSPICFG\_SPILSB\_S) & AM\_REG\_IOM\_MSPICFG\_SPILSB\_M) | ((0 << AM\_REG\_IOM\_MSPICFG\_RDFCPOL\_S) & AM\_REG\_IOM\_MSPICFG\_RDFCPOL\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_WTFCPOL\_S) & AM\_REG\_IOM\_MSPICFG\_WTFCPOL\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_WTFCIRQ\_S) & AM\_REG\_IOM\_MSPICFG\_WTFCPOL\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_MOSIINV\_S) & AM\_REG\_IOM\_MSPICFG\_MOSIINV\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_RDFC\_S) & AM\_REG\_IOM\_MSPICFG\_RDFC\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_RDFC\_S) & AM\_REG\_IOM\_MSPICFG\_RDFC\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_WTFC\_S) & AM\_REG\_IOM\_MSPICFG\_FULLDUP\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_FULLDUP\_S) & AM\_REG\_IOM\_MSPICFG\_FULLDUP\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_SPHA\_S) & AM\_REG\_IOM\_MSPICFG\_SPHA\_M)| ((0 << AM\_REG\_IOM\_MSPICFG\_SPOL\_S) & AM\_REG\_IOM\_MSPICFG\_SPOL\_M);

# 12.2 Interface Clock Generation

The I<sup>2</sup>C/SPI Master (IOM) can generate a wide range of I/O interface clocks, as shown in Figure 18. The source clock is a scaled version of the HFRC 96 MHz clock, selected by IOMn\_CLKCFG\_FSEL. A divideby-3 circuit may be selected by IOMn\_CLKCFG\_DIV3, which is particularly important in creating a useful SPI frequency of 32 MHz. The output of the divide-by-3 circuit may then be divided by an 8-bit value, IOMn\_CLKCFG\_TOTPER + 1, to produce the interface clock. This structure allows very precise specification of the interface frequency, and produces a minimum available interface frequency of 1.2 kHz.

If TOTPER division is enabled by IOMn\_CLKCFG\_DIVEN, the length of the low period of the clock is specified by IOMn\_CLKCFG\_LOWPER + 1. Otherwise, the clock will have a 50% duty cycle.



Figure 18. I<sup>2</sup>C/SPI Master Clock Generation

# 12.3 Command Operation

In order to minimize the amount of time the CPU must be awake during I<sup>2</sup>C/SPI Master operations, the architecture of the I<sup>2</sup>C/SPI Master is organized around processing commands which transfer data to and from an internal 64-byte FIFO.

The IOMn\_CMD register is used for command operations for both the SPI and I2C communication channels.

For writes to the interface, software writes data to the FIFO (IOMn\_FIFO) and then sends a single command to the IOMn\_CMD Register. Unless the TSIZE field of the CMD is zero, at least one word (4 bytes) of data must be written into the FIFO prior to writing the CMD Register or an ICMD interrupt will be generated and the operation will be terminated. The Command includes either the I<sup>2</sup>C slave address or the SPI channel select, the desired address offset and the length of the transfer. At that point the I<sup>2</sup>C/SPI Master executes the entire transfer, so the CPU can go to sleep. If more than 64 bytes are to be transferred, the Master will generate a THR interrupt when the FIFOSIZ value, IOMn\_FIFOPTR\_FIFOSIZ, drops below the write threshold IOMn\_FIFOTHR\_FIFOWTHR so the CPU can wake up and refill the FIFO. The I<sup>2</sup>C/SPI Master will generate the CMDCMP interrupt when the command is complete. In each case, the total number of bytes transferred in each operation is specified in the LENGTH field of the CMD Register. If software executes a write to the FIFO when it is full (FIFOSIZ is greater than 124) the FOVFL interrupt will be generated and the transfer will be terminated.

For reads, the CMD Register is first written with the command and the CPU can go to sleep. The Master initiates the read and transfers read data to the FIFO. If the FIFOSZ value exceeds the read threshold IOMn\_FIFOTHR\_FIFORTHR, a THR interrupt is generated so the CPU can wake up and empty the FIFO. A CMDCMP interrupt is also generated when the Command completes. If software executes a read from the FIFO when it has less than a word of data the FUNDFL interrupt will be generated and the transfer will be terminated. FUNDFL will not be generated if the read transfer has already completed, so that software can read the last FIFO word even if it is incomplete.

If the FIFO empties on a write or fills on a read, the I<sup>2</sup>C/SPI Master will simply pause the interface clock until the CPU has read or written a byte from the FIFO. This avoids the requirement that the thresholds be set conservatively so that the processor can wake up fewer times on long transfers without a risk of an underflow or overflow aborting a transfer in progress.

If software initiates an incorrect operation, such as attempting to read the FIFO on a write operation or when it is empty, or write the FIFO on a read operation or when it is full, the Master will generate an IACC error interrupt. If software attempts to write the Command Register when another Command is underway

or write the CMD register with a write command when the FIFO is empty (unless the LENGTH field in the CMD is zero), the Master will generate an ICMD error interrupt.

# 12.4 FIFO

The I<sup>2</sup>C/SPI Master includes a 64-byte local RAM (LRAM) for data transfers. The LRAM functions as a FIFO. Only 32-bit word accesses are supported to the FIFO from the CPU. When a write operation is underway, a word written to the FIFO will increment the IOMn\_FIFOPTR\_FIFOnSIZ register by 4 and decrement the IOMn\_FIFOPTR\_FIFOnREM register by 4. Reading a byte from the FIFO via the I/O interface decrements FIFOnSIZ by 1 and increments FIFOnREM by 1. When a read operation is underway, a word read from the FIFO decrements FIFOnSIZ by 4 and increments FIFOnREM by 4. A byte read from the I/O interface into the FIFO increments FIFOnSIZ by 1 and decrements FIFOnREM by 1. If FIFOnSIZ becomes one during a write operation or 0x40 on a read operation and there is more data to be transferred, the clock of the I/O interface is paused until software accesses the FIFO.

Two threshold registers, FIFORTHR and FIFOWTHR indicate when a THR interrupt should be generated to signal the processor that data should be transferred.

# 12.5 I<sup>2</sup>C Interface

The I<sup>2</sup>C/SPI Master supports a flexible set of Commands to implement a variety of standard I<sup>2</sup>C operations. The I<sup>2</sup>C interface consists of two lines: one bi-directional data line (SDA) and one clock line (SCL). Both the SDA and the SCL lines must be connected to a positive supply voltage via a pull-up resistor. By definition, a device that sends a message is called the "transmitter", and the device that accepts the message is called the "receiver". The device that controls the message transfer by driving SCL is called "master". The devices that are controlled by the master are called "slaves". The I<sup>2</sup>C Master is always a master device.

The following protocol has been defined:

- Data transfer may be initiated only when the bus is not busy.
- During data transfer, the data line must remain stable whenever the clock line is high.
- Changes in the data line while the clock line is high will be interpreted as control signals.

A number of bus conditions have been defined (see Figure 19) and are described in the following sections



Figure 19. Basic I<sup>2</sup>C Conditions

I2C operations may transfer up to 512 bytes in a single transfer.

### 12.5.1 Bus Not Busy

Both SDA and SCL remain high.

# 12.5.2 Start Data Transfer

A change in the state of SDA from high to low, while SCL is high, defines the START condition. A START condition which occurs after a previous START, but before a STOP, is called a RESTART condition, and functions exactly like a normal STOP followed by a normal START.

### 12.5.3 Stop Data Transfer

A change in the state of SDA from low to high, while SCL is high, defines the STOP condition.

### 12.5.4 Data Valid

After a START condition, SDA is stable for the duration of the high period of SCL. The data on SDA may be changed during the low period of SCL. There is one clock pulse per bit of data. Each data transfer is initiated with a START condition and terminated with a STOP condition. The number of data bytes transferred between the START and STOP conditions is not limited. The information is transmitted byte-wide and each receiver acknowledges with a ninth bit.

### 12.5.5 Acknowledge

Each byte of eight bits is followed by one acknowledge (ACK) bit as shown in Figure 20. This acknowledge bit is a low level driven onto SDA by the receiver, whereas the master generates an extra acknowledge related SCL pulse. A slave receiver which is addressed is obliged to generate an acknowledge after the reception of each byte. Also, on a read transfer, a master receiver must generate an acknowledge after the reception of each byte that has been clocked out of the slave transmitter. The device that acknowledges must pull down the SDA line during the acknowledge related SCL pulse. A master receiver must signal an end-of-data to the slave transmitter by not generating an acknowledge (a NAK) on the last byte that has been clocked out of the state must leave the data line high to enable the master to generate the STOP condition. If I/O Host attempts an I<sup>2</sup>C operation but no slave device generates an ACK, or if a slave fails to generate an ACK on a data byte before the transfer is complete, a NAK interrupt will be generated.





# 12.5.6 *I*<sup>2</sup>C Slave Addressing

For normal I<sup>2</sup>C reads and writes, the Command specifies the address to be sent on the interface. Both 7bit and 10-bit addressing are supported, as selected by 10BIT in the Command. The address is specified in the ADDRESS field.

Figure 21 shows the operation in 7-bit mode in which the master addresses the slave with a 7-bit address configured as 0xD0 in the lower 7 bits of the ADDRESS field. After the START condition, the 7-bit address is transmitted MSB first. If this address matches the lower 7 bits of an attached slave device, the eighth bit indicates a write (RW = 0) or a read (RW = 1) operation and the slave supplies the ACK. If no slave acknowledges the address, the transfer is terminated and a NAK error interrupt is generated.



Figure 21. I<sup>2</sup>C 7-bit Address Operation

Figure 22 shows the operation with which the master addresses the slave with a 10-bit address configured at 0x536. After the START condition, the 10-bit preamble 0b11110 is transmitted first, followed by the upper two bits of the ADDRESS field and the eighth bit indicating a write (RW = 0) or a read (RW = 1) operation. If the upper two bits match the address of an attached slave device, it supplies the ACK. The next transfer includes the lower 8 bits of the ADDRESS field, and if these bits also match I2CADDR the slave again supplies the ACK. If no slave acknowledges either address byte, the transfer is terminated and a NAK error interrupt is generated.



Figure 22. I<sup>2</sup>C 10-bit Address Operation

# 12.5.7 I<sup>2</sup>C Offset Address Transmission

If the OFFSETCNT field of the CMD register specifies that there is at least one byte of address offset for either a read or write command, then the I<sup>2</sup>C/SPI Master will first send one or more 8-bit Offset Address bytes, where the offset is specified in the OFFSETLO field of the CMD register, and the OFFSETHI field of the OFFSETHI register if multiple offset bytes have been specified.

This transfer is shown in Figure 23. The Offset Address is loaded into the Address Pointer of the slave.



Figure 23. I<sup>2</sup>C Offset Address Transmission

# 12.5.8 *I*<sup>2</sup>C Write Operation with Address Offset

In a write operation the I<sup>2</sup>C/SPI Master transmits to a slave receiver. The Address Operation has a RW value of 0, and the second byte contains the Offset Address, as in Figure 23. The next byte is written to the slave register selected by the Address Pointer (which was loaded with the Offset Address) and the Address Pointer is incremented. Subsequent transfers write bytes into successive registers until a STOP condition is received, as shown in Figure 24.



Figure 24. I<sup>2</sup>C Write Operation with Address Offset

### 12.5.9 I<sup>2</sup>C Read Operation with Address Offset

If a read operation with address offset is selected, the I<sup>2</sup>C/SPI Master first executes an Offset Address Transmission to load the Address Pointer of the slave with the desired Offset Address.

A subsequent operation will again issue the address of the slave but with the RW bit as a 1 indicating a read operation. As shown in Figure 25, this transaction begins with a RESTART condition so that the interface will be held in a multi-master environment. After the address operation, the slave becomes the transmitter and sends the register value from the location pointed to by the Address Pointer, and the Address Pointer is incremented. Subsequent transactions produce successive register values, until the  $I^2C/SPI$  Master receiver responds with a NAK and a STOP to complete the operation.





# 12.5.10 I<sup>2</sup>C Write Operation with No Address Offset

If a write with no address offset is selected in the CMD and OFFSETCNT fields of the CMD register, the I<sup>2</sup>C/SPI Master does not execute the Offset Address Transmission, but simply begins transferring bytes as shown in Figure 26. This provides support for slave devices which do not implement the standard offset address architecture. The OFFSETHI and OFFSETLO fields are not used in this case.



### Figure 26. I<sup>2</sup>C Write Operation with No Address Offset

# 12.5.11 I<sup>2</sup>C Read Operation with No Address Offset

If a read with no address offset is selected in the CMD and OFFSETCNT fields of the CMD register, the I<sup>2</sup>C/SPI Master does not execute the Offset Address Transmission, but simply begins transferring bytes with a read as shown in Figure 27. This is important for slave devices which do not support an Address Pointer architecture. For devices which do include an Address Pointer, multiple reads with no address offset may be executed after a read with address offset to access subsequent registers as the Address Pointer increments, without having to execute the Offset Address Transmission for each access.



Figure 27. I<sup>2</sup>C Read Operation with No Address Offset

### 12.5.12 Holding the Interface with CONT

In all of the previously described transactions, the  $I^2C/SPI$  Master terminates the  $I^2C$  operation with a STOP sequence. In environments where there are other masters connected to the  $I^2C$  interface, it may be necessary for the MCU to hold the interface between Commands to insure that another master does not inadvertently access the same slave that the MCU is accessing. In order to implement this functionality, the CONT bit should be set in the CMD Register. This will cause the  $I^2C/SPI$  Master to keep SDA high at the end of the transfer so that a STOP does not occur, and the next transaction begins with a RESTART instead of a START. Note that for a Normal Read the interface is held between the Offset Address Transmission and the actual read independent of the state of CONT, but it CONT is set the read transaction will not terminate with a STOP.

# 12.5.13 I<sup>2</sup>C Multi-master Arbitration

The  $I^2C/SPI$  Master supports multi-master arbitration in  $I^2C$  mode. There are two cases which must be handled.

The first is the case where another master initiates an  $I^2C$  operation when the MCU Master is inactive. In this case the  $I^2C/SPI$  Master will detect an  $I^2C$  START operation on the interface and the START interrupt will be asserted, which tells the software not to generate any IO operations (which will not be executed in any case). Software then waits for the STOP interrupt, which reenables operation.

The second case is where another master initiates an operation at the same time as the MCU. In this case there will be a point where one master detects that it is not driving SDA low but the bus signal is low, and that master loses the arbitration to the other master. If the MCU I<sup>2</sup>C/SPI Master detects that it has lost arbitration, it will assert the ARB interrupt and immediately terminate its operation. Software must then wait for the STOP interrupt and re-execute the current Command.

# 12.6 SPI Operations

### 12.6.1 SPI Configuration

The I<sup>2</sup>C/SPI Master supports all combinations of the polarity (CPOL) and phase (CPHA) modes of SPI using the IOMn\_MSPICFG\_SPOL and IOMn\_MSPICFG\_SPHA bits. It also may be configured in either 3-wire or 4-wire mode.

In 4-wire mode, the MOSI and MISO interface signals use separate IO pins. In this mode the GPIO used for the MOSI signal should be configured with the GPIO\_PINCFGn\_FOENn bit set to force output enable active.

In 3-wire mode, MOSI and MISO are multiplexed on a single IO pin for more efficient pin utilization. The 3/ 4 wire configuration is selected in the mapping function of the PINCFG module.

SPI operations may transfer up to 4095 bytes in a single transfer, as the TSIZE field in the CMD register provides a 12-bit length specification.

### 12.6.2 SPI Slave Addressing

In SPI mode, the Command specifies the slave channel to be used in the CMDSEL field. The I<sup>2</sup>C/SPI Master supports up to four slaves, each of which has its own nCE signal which can be configured on an IO pin. Additional slaves may be supported using GPIO pins and external decoding.

### 12.6.3 SPI Write with Address Offset

Figure 28 shows the case of a SPI Write with a one-byte address offset operation, whereby a write operation is selected in the CMD field. The operation is initiated when the I<sup>2</sup>C/SPI Master pulls one of the four nCE signals low. At that point the I<sup>2</sup>C/SPI Master begins generating the clock on SCK and the offset address is transmitted from the master on the MOSI line, with the upper R/W bit of the offset field indicating read (if 0) or write (if 1). In this example the R/W bit is a one selecting a write operation. The entire multi-byte offset, the length of which is specified by the OFFSETCNT field, is taken from the OFFSETLO field of the CMD and, depending on the value in OFFSETCNT, the OFFSETHI field in the OFFSETHI register. The MSB of the entire OFFSET should be set to 1 if the slave expects a RW bit. If the slave does not expect a RW bit, this allows the first byte of a write to be completely specified in the OFFSET field, and a single byte write in that case can be executed without requiring any data to be loaded in to the FIFO.

Each subsequent byte is read from the FIFO and transmitted. The operation is terminated when the  $I^2C/SPI$  Master brings the nCE signal high. Note that the MISO line is not used in a write operation and is held in the high impedance state by the  $I^2C/SPI$  Master.





### 12.6.4 SPI Read with Address Offset

Figure 29 shows the case of a Read with a one-byte address offset operation, whereby a read operation is selected in the CMD field. The operation is initiated when the I<sup>2</sup>C/SPI Master pulls one of the four nCE signals low. At that point the I<sup>2</sup>C/SPI Master begins driving the clock onto SCK and the address is transferred from the master to the slave just as it is in a write operation, but in this case the R/W bit is a 0 indicating a read. After the transfer of the last address bit (bit 0), the I<sup>2</sup>C/SPI Master stops driving the MOSI line and begins loading the FIFO with the data on the MISO line. The transfer continues until the I<sup>2</sup>C/SPI Master brings the nCE line high.



#### Figure 29. SPI Normal Read Operation

As with a Normal Write, the Offset Address byte including the R/W bit is taken from the offset field(s) of CMD. If the slave expects a R/W bit, the msb of the offset must be set accordingly. This allows reads from devices which have different formats for the address byte.

### 12.6.5 SPI Write with No Address Offset

If a write with no address offset is selected in the CMD field, the Offset Address byte is not sent and all data comes directly from the FIFO as shown in Figure 30. The OFFSET field is not used in this case.



### Figure 30. SPI Raw Write Operation

### 12.6.6 SPI Read with No Address Offset

If a read with no address offset is selected in the CMD field, data goes directly to the FIFO as shown in Figure 31. The OFFSET field is not used in this case.





### 12.6.7 SPI 3-wire Mode

In 3-wire mode, the MOSI and MISO lines are shared on a single pin. As described in the previous sections, the MISO and MOSI lines are not driven at the same time, so 3-wire mode is equivalent to simply tying them together external to the MCU. 3-wire mode is configured by selecting the MxWIR3 alternative (x = 0 to 5 selecting the I2C/SPI Master) in the GPIO Pad Multiplexor rather than the MxMOSI and MxMISO alternatives. Detailed configuration information is supplied in the GPIO and Pad Configuration Module chapter.

### 12.6.8 Complex SPI Operations

In some cases peripheral devices require more complex transaction sequences than those supported by a single Command. In order to support these transactions, the CONT bit may be set in the Command. In this case, the nCE pin selected by the Channel will remain asserted low at the end of the transaction, so that the next SPI operation will be seen as part of the same transaction. For example, there are peripheral devices which require both a Function and an Address Offset to be transmitted at the beginning of a read. Implementing this can be done in several ways. One example as shown in Figure 32 is:

- 1. Execute a Raw SPI write of length 2, with the data bytes being the Function and Offset. Set the CONT bit in this Command so nCE remains asserted low.
- 2. Execute a Raw SPI Read of the desired transfer length. The data will then be read into the FIFO. The CONT bit is not set in this Command.



#### Figure 32. SPI Combined Operation

### 12.6.9 SPI Polarity and Phase

The MCU supports all combinations of CPOL (clock polarity) and CPHA (data phase) in SPI mode, as defined by the SPOL and SPHA bits. Figure 33 shows how these two bits affect the interface signal behavior.



#### Figure 33. SPI CPOL and CPHA

If CPOL is 0, the clock SCK is normally low and positive pulses are generated during transfers. If CPOL is 1, SCK is normally high and negative pulses are generated during transfers.

If CPHA is 0, the data on the MOSI and MISO lines is sampled on the edge corresponding to the first SCK edge after nCE goes low (i.e. the rising edge if CPOL is 0 and the falling edge if CPOL is 1). Data on MISO and MOSI is driven on the opposite edge of SCK.

If CPHA is 1, the data on the MOSI and MISO lines is sampled on the edge corresponding to the second SCK edge after nCE goes low (i.e. the falling edge if CPOL is 0 and the rising edge if CPOL is 1). Data on MISO and MOSI is driven on the opposite edge of SCK.

The SPOL and SPHA bits may be changed between Commands if different slave devices have different requirements. In this case the SUBMODCTRL\_SMODnEN bit should be set to 0 either before or at the same time as SPHA and SPOL are changed, and then set back to 1 before CMD is written.

### 12.7 Bit Orientation

In both I<sup>2</sup>C and SPI modes, the I<sup>2</sup>C/SPI Master supports data transmission either LSB first or MSB first as configured by the LSB bit in the Command. If LSB is 0, data is transmitted and received MSB first. If LSB is 1, data is transmitted and received LSB first.

# 12.8 SPI Flow Control

The I<sup>2</sup>C/SPI Master supports flow control from the slave, which is controlled by several configuration bits. Either read or write (or both) flow control may be implemented. Read flow control is enabled by setting the IOMn\_MSPICFG\_RDFC bit, in which case the I<sup>2</sup>C/SPI Master will check the state of the Flow Control IRQ pin, and if it is inactive the SPI clock will stop at the completion of the current byte transfer until it becomes active. The Flow Control IRQ can be any of the 50 pins as selected by the GPIO\_IOMnIRQ register corresponding to the particular I<sup>2</sup>C/SPI Master. The polarity of the active state of the Flow Control IRQ is selected by the IOMn\_MSPICFGn\_RDFCPOL.

Write flow control is enabled by setting the IOMn\_MSPICFGn\_WTFC bit, but in this case either the Flow Control IRQ or the state of the MISO line may be used for flow control, as selected by the IOMn\_MSPICFGn\_WTFCIRQ bit. If IRQ is selected by setting a one, the clock control is identical to that described for reads above and the IRQ polarity is set by the IOMn\_MSPICFGn\_WTFCPOL bit. If MISO is selected by setting a zero in WTFCIRQ, the clock will be stopped if the MISO line is at the inactive polarity, which is set by the WTFCPOL bit.

Slave devices supporting flow control typically require specific states of the MOSI line prior to the start of a transfer. This state is controlled by the IOMn\_MSPICFGn\_MOSIINV bit. If this bit is zero, MOSI will be driven to a 1 at the start of a write transaction and to a 0 at the start of a read transaction – this is the normal operation of devices with flow control support. If MOSIINV is set to one, these polarities will be inverted.

Flow control may be asserted either prior to the first byte transfer, which will delay the start of SCK, or within each byte transferred, which will pause SCK at the end of that byte. The examples below assume that WTFCPOL or RDFCPOL are set to 0.

Figure 34 shows the operation of flow control at the beginning of a write transfer or a normal read transfer which begins with an offset byte write. Either MISO or IRQ (selected by WTFCIRQ) must be deasserted low within ½ of the SCK period after nCE is asserted low in order to delay the clock. SCK will continue in its inactive state until MISO or IRQ is changed to the active state, and then will begin normal operation.



Figure 34. Flow Control at Beginning of a Write Transfer

Figure 35 shows the operation of flow control at the beginning of a raw read transfer. IRQ must be deasserted low within ½ of the SCK period after nCE is asserted low in order to delay the clock. SCK will continue in its inactive state until IRQ is changed to the active state, and then will begin normal operation.



### Figure 35. Flow Control at Beginning of a Raw Read Transfer

Figure 36 shows the operation of flow control in the middle of a write transfer. MISO or IRQ must be deasserted after the leading edge of SCK on the first bit of the byte (labeled 7) and before the falling edge of the 7th bit of the byte (labeled 1) in order to insure that SCK stops at the end of the byte. De-asserting MISO or IRQ outside of that window can produce unpredictable results. SCK will resume at some point after the assertion of MISO or IRQ.



Figure 36. Flow Control in the Middle of a Write Transfer

Figure 37 shows the operation of flow control in the middle of a read transfer. IRQ must be deasserted after the leading edge of SCK on the first bit of the byte (labeled 7) and before the falling edge of the 7th bit of the byte (labeled 1) in order to insure that SCK stops at the end of the byte. De-asserting IRQ outside of that window can produce unpredictable results. SCK will resume at some point after the assertion of IRQ.



Figure 37. Flow Control in the Middle of a Read Transfer

### 12.9 Minimizing Power

Each I<sup>2</sup>C/SPI Master submodule has an interface enable bit IOMn\_SUBMODCTRL\_SMODnEN. This bit should be kept at 0 (along with PWRCTRL\_DEVPWREN\_PWRIOMn fields) whenever the interface is not being used in order to minimize power consumption.

# 13. I<sup>2</sup>C/SPI Slave (IOS)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 13.1 Local RAM Allocation

The I<sup>2</sup>C/SPI Slave (IOS) is built around a 256-byte local RAM (LRAM), through which all data flows between the CPU AHB and the IO interface. The I<sup>2</sup>C/SPI Slave supports a 128-byte offset space when accessed from the I/O interface.

The LRAM is divided into three separate areas on 8-byte boundaries. These areas are:

- A Direct Area for direct communication between the host and the SoC, which is mapped between the AHB address space and the I/O address space. This area is from LRAM address 0x00 to the address calculated from the 5-bit FIFOBASE field in the FIFO configuration register (FIFOCFG), minus 1. This 5-bit field (IOSLAVE\_FIFOCFG\_FIFOBASE) should contain a value that represents the start of the FIFO Area and, in so doing, defines the size of the Direct Area in 8-byte segments. Part of this area can be defined as IO Slave Read-only starting at any 8-byte segment defined by IOSLAVE\_FIFOCFG\_ROBASE and extending through the end of the Direct Area at FIFOBASE\*8-1.
- 2. A FIFO Area which is used to stream data from the MCU. This memory is directly addressed from the AHB, but accessed from the I/O Interface using a single I/O address 0x7F as a streaming port. The FIFO area is from the LRAM address calculated from the value in the FIFOBASE field, FIFOBASE\*8, to the LRAM address calculated from the value in the FIFOMAX field of the FIFO-CFG register, IOSLAVE\_FIFOCFG\_FIFOMAX.The upper FIFO Area address is FIFOMAX\*8-1. The maximum value for FIFOMAX is 0x20, which would result in an upper FIFO Area address of 0xFF.
- 3. A RAM Area which is accessible only from the AHB Slave. The RAM area is from the LRAM address calculated from the value in the FIFOMAX field of the FIFOCFG register, IOSLAVE\_FIFO-CFG\_FIFOMAX, to address 0xFF. Setting FIFOMAX to 0x20 would result in a RAM area of zero size.

The data in the LRAM is maintained in Deep Sleep Mode.

Figure 38 below shows the LRAM address mapping between the I/O interface and the AHB.



Figure 38. I<sup>2</sup>C/SPI Slave Module LRAM Addressing in Standard Address Mode

# 13.2 Direct Area Functions

The Direct Area is used for direct communications between the interface Host and the MCU. The Host may write a register in this Register Access space, called REGACC, and read it back without requiring the CPU to wake up, so that very low-power interactions are supported. In some cases, however, accesses require interaction with the CPU.

REGACC interrupts are mapped in the Direct Area and operate as follows. Each REGACC interrupt status bit will be set whenever there is a read or write over the I2C or SPI interface in the Direct Area with an offset address which corresponds to a particular REGACC interrupt. Table 69 in Section 13.2.2 lists the offsets to memory locations within the Direct Area and corresponding interrupt bit settings in the REGACCINTSTAT register.

I/O writes to locations 0x0-0xF will set a corresponding interrupt flag in the REGACCINTSTAT register. These locations are typically used for specific commands to the MCU. Note that not all flags need generate an actual interrupt, so small multi-byte commands may be transmitted in this area. For example, a write to location 0x0 will set bit 31 of the REGACCINTSTAT register, a write to location 0x1 will set bit 30 of REGACCINTSTAT, and a write to location 0xF will set bit 16 of the REGACCINTSTAT register.

The upper 16 REGACC interrupts are each generated on an access to a multi-byte range. In this standard mode, each upper 16 REGACC interrupt covers a range which is identified as the last byte of a 32-bit word, starting at 0x10. I/O writes to locations 0x10 to 0x4F will set a corresponding interrupt flag in the REGACCINTSTAT register if the I/O address modulo 4 is 3 (i.e., addresses 0x13, 0x17, 0x1B, etc.). This allows transfers to be sent in a burst with a trigger being generated on the last write, and it also allows specifying a data buffer of any whole word size to have an interrupt generated on access to the last byte of the buffer.

For example, a write to location 0x13 will set bit 15 of the REGACCINTSTAT register, a write to location 0x17 will set bit 14 of REGACCINTSTAT, and a write to location 0x4F will set bit 0 of the REGACCINTSTAT

register. Note that in this mode there are no interrupts which would be set for word accesses in the transaction beyond the first 80 bytes (0x0 to 0x4F address range).

### 13.2.1 Host Burst Write Support

An unlimited-length host burst write mode, enabled by setting the CFG\_WRAPPTR bit, is offered to extend a host write transaction length to any arbitrary size and with ability to generate interrupts on longer (8-byte and 16-byte) address ranges.

This mode re-purposes the FIFO and controls (from purely host read functionality) to be used for host burst write buffering. The size of the FIFO can be set to receive a section, or all, of a host burst transaction. The arbitrarily long transaction size is achieved by the hardware automatically wrapping around the address pointer to the beginning of the FIFO, FIFOBASE\*8, after it hits the end of the FIFO at FIFOMAX\*8-1, essentially creating a hardware ring buffer.

The FIFO base address, as set in the IOSLAVE\_FIFOCFG\_FIFOBASE field, can establish a FIFO start address (and wrap-to address) on any 8-byte segment starting at 0x0 through 0xF8. Additionally, the address pointer automatically skips Direct Area locations 0x78 to 0x7F (if the FIFO Area encompasses these locations) to avoid writing to the Host Registers during a host burst write. The wrap-to address (=FIFOBASE) must not be defined as a location within the Host Register space (address 0x78 when IOSLAVE\_FIFOCFG\_FIFOBASE = 0xF), as undefined results will occur.

For burst writes > 248 bytes, the end address of the FIFO, as defined by IOSLAVE\_FIFOCFG\_FIFOMAX, should be set to maximize each iteration before wraparound occurs. This value is 248 (0xF8) bytes because the address pointer skips addresses 0x78-0x7F. Therefore FIFOMAX should be set to 0x100 to get the largest (0xF8) transfer before wraparound occurs.

A "mapped address" is created in LRAM whose offset is the same as the address pointer when (address pointer < 78), and is (address pointer - 8) when the address pointer is larger than 0x7F. In this mapping to the LRAM, transfers to 0x80, for example, are mapped to LRAM address 0x78, and so on, to prevent a "hole" in the LRAM which would cause overhead in the software to work around this. The highest LRAM address which can be written in this mode is 0xF7.

With respect to REGACC interrupts, multi-byte ranges for each of the 16 higher address interrupts which occur upon Direct Area accesses are different sizes in this Host Burst Write Mode than for the standard Host Read Mode. This is as such to be able to cover the entire iteration of 0x0 to 0xF7 (248) bytes, with the 32 possible interrupts. After the first 16 single-byte locations, 0x0-0xF, for which each sets a corresponding interrupt flag (INT31-INT16) in the REGACCINTSTAT register, the multi-byte ranges are either 8 bytes or 16 bytes long. In Table 69 they are identified with the last byte of an 8-byte or 16-byte range, starting at Direct Area address 0x10. I/O writes to bytes 0x10 to 0xF7 in a 248-byte iteration of a transaction sets a corresponding interrupt flag in the REGACCINTSTAT register when the I/O address corresponds to the last byte location for any of the ranges. (i.e., addresses 0x17, 0x1F, 0x2F, etc.). In this mode, a write to location 0x17 will set bit 15 of the REGACCINTSTAT register, a write to location 0x1F will set bit 14 of REGACCINTSTAT, and a write to location 0xF7 will set bit 0 of the REGACCINTSTAT register.

Note that the application software must process the data loaded in the FIFO in this mode before a wraparound when new data will overwrite any existing data. As well, the application must service any enabled REGACC interrupts and re-arm interrupts on time and as required by the application before new data is loaded.

NOTE

IO Slave Read-Only address mapping, as specified by the IOSLAVE\_ FIFOCFG\_ROBASE setting, is not supported in Host Burst Write Mode (IOSLAVE\_CFG\_WRAPPTR = 1).

### 13.2.2 Mapping of Direct Area Access Interrupts

Table 69 shows the memory locations within the Direct Area for each of the Direct Area access interrupt bits in the REGACCINTSTAT register. The lower-order register bits corresponding to multi-byte access locations are at different offsets for the Host Read Mode and the Host Burst Write Mode. This is to enable interrupts across the entire section of the longer burst write transactions.

| REGACCINTSTAT Bit | Direct Area Offset Address -<br>Host Read Mode<br>(CFG_WRAPPTR = 0) | Direct Area Offset Address -<br>Host Burst Write Mode<br>(CFG_WRAPPTR = 1) |
|-------------------|---------------------------------------------------------------------|----------------------------------------------------------------------------|
| 31                | 0x0                                                                 | 0x0                                                                        |
| 30                | 0x1                                                                 | 0x1                                                                        |
| 29                | 0x2                                                                 | 0x2                                                                        |
| 28                | 0x3                                                                 | 0x3                                                                        |
| 27                | 0x4                                                                 | 0x4                                                                        |
| 26                | 0x5                                                                 | 0x5                                                                        |
| 25                | 0x6                                                                 | 0x6                                                                        |
| 24                | 0x7                                                                 | 0x7                                                                        |
| 23                | 0x8                                                                 | 0x8                                                                        |
| 22                | 0x9                                                                 | 0x9                                                                        |
| 21                | 0xA                                                                 | 0xA                                                                        |
| 20                | 0xB                                                                 | 0xB                                                                        |
| 19                | 0xC                                                                 | 0xC                                                                        |
| 18                | 0xD                                                                 | 0xD                                                                        |
| 17                | 0xE                                                                 | 0xE                                                                        |
| 16                | 0xF                                                                 | 0xF                                                                        |
| 15                | 0x13                                                                | 0x17                                                                       |
| 14                | 0x17                                                                | 0x1F                                                                       |
| 13                | 0x1B                                                                | 0x2F                                                                       |
| 12                | 0x1F                                                                | 0x3F                                                                       |
| 11                | 0x23                                                                | 0x4F                                                                       |
| 10                | 0x27                                                                | 0x5F                                                                       |
| 9                 | 0x2B                                                                | 0x6F                                                                       |
| 8                 | 0x2F                                                                | 0x7F                                                                       |
| 7                 | 0x33                                                                | 0x8F                                                                       |
| 6                 | 0x37                                                                | 0x9F                                                                       |
| 5                 | 0x3B                                                                | 0xAF                                                                       |
| 4                 | 0x3F                                                                | 0xBF                                                                       |
| 3                 | 0x43                                                                | 0xCF                                                                       |
| 2                 | 0x47                                                                | 0xDF                                                                       |
| 1                 | 0x4B                                                                | 0xEF                                                                       |
| 0                 | 0x4F                                                                | 0xF7                                                                       |

### Table 69: Mapping of Direct Area Access Interrupts and Corresponding REGACCINTSTAT Bits

The REGACCINTSTAT register provides status of the 32 individual write interrupts. If an interrupt is enabled and set, it shows as a high bit in this register. The highest priority REGACC bit is bit 31 (set on

access to address 0x00), and the lowest priority is bit 0 (set on access to address 0x4F in Standard Address Mode or 0xF7 in Address Pointer Wrap Mode). The 5-bit IOSLAVE\_PRENC register provides an encoded value of the highest priority of these interrupts to speed software decoding, and is therefore very useful for quickly servicing the highest priority REGACC interrupt (i.e., the one at the lowest offset address). The encoding works such that if interrupt 31 is set, PRENC will be 0. If interrupt 31 is not set and bit 30 is set, PRENC will be 1, and so on to the point where if bits 31-1 are not set and bit 0 is set PRENC will be 31. If no interrupts are set the value in PRENC is indeterminate.

The final special memory space within the Direct Area is a read-only area for the I/O Host, which is from I/O address (FIFOCFG\_ROBASE \* 8) to (FIFOBASE \* 8 - 1). I/O writes to this address space will not change the LRAM, which allows the space to be used for returning status to the I/O Host. ROBASE should have a minimum value of 0x0A, representing a start address of 0x50 to allow space for special commands and burst writes in lower Direct Area space.

# 13.3 FIFO Area Functions

The FIFO is used to provide very efficient flow of data from the MCU to the I/O Host processor with minimal CPU interaction. A FIFO of up to 1023 bytes can be easily maintained by software, with the oldest bytes residing in the LRAM FIFO Area and the newer data being held in system SRAM and transferred to the I<sup>2</sup>C/SPI Slave on demand. Several hardware features support this operation.

Figure 39 shows the basic FIFO operation. The main FIFO is held in a buffer in SRAM, and the oldest data in that FIFO has been transferred to the FIFO Area of the I/O Slave. The IOSLAVE\_FIFOPTR\_FIFOPTR register points to the next byte to be read on the I/O interface. IOSLAVE\_FIFOPTR\_FIFOSIZ holds the current number of valid bytes in the FIFO on the I<sup>2</sup>C/SPI Slave, and FIFOCTR holds the total number of bytes in the FIFO. The value in IOSLAVE\_FIFOCTR may be read indirectly at any time by the Host processor via the FIFOCTRUP\_FIFOCTRLO registers to determine if there is FIFO data available (and how much is currently in the FIFO). I/O Host access to the FIFO counter is at offset 0x7C/0x7D.

#### WARNING

The host read of the FIFOCTR value via FIFOCTRUP\_FIFOCTRLO is not synchronized to the write clock. So if the host read happens during a FIFOCTR update (either through a read-modify-write of FIFOCTR register or an automatic update because of a write to the FIFOINC register by the Slave CPU), it is possible for the count value to be out of sync, impacting the value read in either or both the upper (FIFOCTRUP) and lower (FIFOCTRLO) bytes. This is a very rare case, but proper code would have the host read the two registers for the FIFOCTR value multiple times until consecutive reads are the same.





When the host reads a byte from the FIFO, the data retrieved is pointed to by FIFOPTR, FIFOPTR is incremented and wraps around in the FIFO Area if it reaches FIFOMAX. FIFOSIZ and FIFOCTR are each decremented by one. The Host can read FIFOCTR and then read that many bytes without further checking. Note that this process can continue without requiring a CPU wakeup. If the Host attempts to read the FIFO when FIFOSIZ is 0, the FUNDFL interrupt flag is set in both the I<sup>2</sup>C Slave interrupt block and in the Host interrupt block.

When FIFOSIZ drops below the configured threshold IOSLAVE\_FIFOTHR the FSIZE interrupt flag is set and if enabled an interrupt is sent to the CPU which will wake it up. At that point, the CPU can move as much data from the SRAM FIFO to the I<sup>2</sup>C/SPI Slave FIFO as possible in a single operation and then go back to sleep. Since the FIFO Area can be quite large, CPU wake-ups will be very infrequent. If a write to the FIFOCTR which would increment the value beyond 1023 occurs, the FOVFL interrupt flag is set.

When some other process, such as a sensor read, produces new data for the FIFO, the CPU will add that data to the FIFO in SRAM, wrapping around as necessary. The IOSLAVE\_FIFOINC register is then written with the number of bytes added to the FIFO, which is added to the FIFOCTR register in an atomic fashion. In this way the Host processor can always determine how much read data is available.

The FIFO interface offset 0x7F is treated uniquely by the I<sup>2</sup>C/SPI Slave, in that an access to this address does not increment the Address Pointer. This allows the Host to initiate a burst read from address 0x7F of any length, and each read will supply the next byte in the FIFO.

# 13.4 Rearranging the FIFO

In normal operation the Host reads the oldest data from the FIFO, and the CPU writes new data onto the FIFO. In some cases it is desirable to modify this process, in particular for the FIFO to provide the newest data. The MCU supports such operation using a special control function.

If software is intended to write the current sample to the front of the FIFO, it first checks the IOSLAVE\_FUPD\_IOREAD status bit to ensure that there is not a Host read operation from the FIFO underway. Once IOREAD is clear, software sets the IOSLAVE\_FUPD\_FIFOUPD bit, writes the new sample data to the front of the FIFO and modifies the FIFOPTR to point to the new data. At that point the FIFOUPD bit is cleared.

If the Host attempts a FIFO read operation while the FIFOUPD is set, a RDERR interrupt will be generated to the Host and the FRDERR interrupt flag will be set. The Host must either poll the RDERR interrupt bit at the end of each operation or configure a hardware interrupt. Note that if the software does not support alternate FIFO ordering, the Host does not have to check the RDERR function.

# **13.5 Interface Interrupts**

The CPU may also signal the Host via the IOINT interrupt, which may be connected to an MCU pin and driven to the Host. Eight interrupts are available to be combined into the IOINT interrupt, and the Host can enable, read, clear and set these interrupts via the I/O interface. Software on the CPU can set 6 of the interrupts (SWINT0 through SWINT5) to communicate a variety of situations to the Host, and the other two interrupts indicate errors such as an attempt by the Host to read the FIFO when it is empty. A CPU interrupt is generated whenever the Host writes any IOINT registers (for example, to clear an interrupt) so the CPU can manage the interrupt interaction.

The I2C/SPI Slave includes a mechanism to allow the Host CPU and the MCU to each interrupt the other via a set of eight interrupts. The Host CPU accesses these interrupts via interface locations 0x78 - 0x7B, and the Apollo accesses these interrupts in the IOINTCTL Register.

The Host CPU may enable or disable any of the eight interrupts by writing the corresponding bit in the IOINTEN field of the IOINTCTL Register, which is accessed by the Host at interface location 0x78. The Host CPU may then clear or set any of the interrupts by writing a 1 to the corresponding bit of the clear (at location 0x7A) or set (at location 0x7B) registers. The current state of all eight interrupts may be read in the IOINT field at location 0x79. Note that this structure is identical to the standard MCU interrupts in all modules. The SoC can read the value of the eight interrupt enables in the IOINTEN field of IOINTCTL, and can read the values of the eight interrupt status bits in the IOINT field of the IOINTCTL register. These two fields are read only. Table 70 summarizes these I/O interface interrupts and how they can be controlled and read.

| RAM<br>Location | IOINT<br>Register <sup>1</sup> | Function             | SoC Register_Field      | Description                                                                                                   |
|-----------------|--------------------------------|----------------------|-------------------------|---------------------------------------------------------------------------------------------------------------|
| 0x78            | IOINTEN                        | I/O Interrupt Enable | IOINTCTL_IOINTEN (R/O)  | Each interrupt can be individually enabled by I/O<br>Host, but can only be read by the SoC.                   |
| 0x79            | IOINT                          | I/O Interrupt State  | IOINTCTL_IOINT (R/O)    | State of each interrupt, set or cleared, can be read by either the I/O Host or by the SoC.                    |
| 0x7A            | IOINTCLR                       | I/O Interrupt Clear  | IOINTCTL_IOINTCLR (W/O  | Each interrupt can be individually cleared by the I/O Host, but the SoC can (only) clear all of them at once. |
| 0x7B            | IOINTSET                       | I/O Interrupt Set    | IOINTCTL_IOINTSET (W/O) | Each interrupt can be individually set by either the I/O Host or the SoC.                                     |

| Table | 70: | I/O | Interface | Interrupt | Control |
|-------|-----|-----|-----------|-----------|---------|
|-------|-----|-----|-----------|-----------|---------|

1. Readable by the I/O Host

The MCU software may set any of the eight interrupt status register bits by writing a 1 to the corresponding bit of the IOINTSET field of the IOINTCTL Register, and may clear all of the interrupts by writing a 1 to the IOINTCLR bit of the IOINTCTL register. This allows the SoC to generate a software interrupt to the Host device. In addition, a FIFO underflow interrupt FUNDFL in the I2C/SPI Slave will set interrupt bit 7, and a FIFO read error interrupt FRDERR will set interrupt bit 6 of the IO interrupt status register IOINT. Note that the SoC software cannot write the IOINTEN register, so that IO interrupts are controlled completely by the Host processor.

If any of the IOINT interrupt bits are set and the corresponding bit in IOINTEN is set, an IOINT interrupt will be generated. If the GPIO configuration registers have configured PAD4 as IOINT, that interrupt will be driven directly onto PAD\_IO[4]. This pin should be connected to an interrupt input pin of the Host interface device so that it can receive the interrupt and service it.

If the Host device writes to any of the interrupt register access locations (any location in 0x78 - 0x7B) the IOINTW interrupt will be set in the I2C/SPI INTSTAT Register. This allows MCU software to receive a software interrupt from the Host device. Note that this interrupt will occur for all writes by the Host, including a write to clear an interrupt.

# **13.6 Command Completion Interrupts**

Four interrupts in the I2C/SPI Slave module are generated when the Host interface device completes a transfer. This allows the SoC to be easily awakened for any transfer from the Host while maximizing the time the SoC is in sleep mode. The XCMPWR interrupt is generated at the completion of a Host write transfer which includes addresses in the currently configured Direct Register space, and the XCMPRR interrupt is generated on the completion of a Host read transfer to that space. The XCMPWF interrupt is generated at the completion of a Host write transfer which includes the CMPWF interrupt is generated at the completion of a Host write transfer which includes the FIFO address 0x7F (although that is an invalid access), and the XCMPRF interrupt is generated at the completion of a Host read transfer which includes the FIFO address 0x7F.

#### NOTE

A write to 0x7F, which is the FIFO address, uses the address 0xFF since this includes the R/W bit in the upper (first) bit followed by the 7-bit Direct Register address (offset). The prescribed usage of IOS FIFO is only for READ from the host, and hence writing to the FIFO is generally an invalid operation. So, even though XCMPWF flag/interrupt is defined, it is likely never going to be used.

#### NOTE

A burst transfer which begins in the Direct Register address space and is long enough to cause the Address Pointer to be 0x7F can set both the Direct Register and FIFO interrupts, although that would in general be an invalid operation.

# **13.7 Host Address Space and Registers**

The Host of the I/O interface can access 128 bytes in the  $I^2C/SPI$  Slave in either  $I^2C$  or SPI mode. Offsets 0x00 to 0x77 may be directly mapped to the Direct RAM Area. The remaining eight offset locations access hardware functions within the  $I^2C/SPI$  Slave. The R/W indicator is referring to accesses from the Host.

# 13.8 I<sup>2</sup>C Interface

The MCU I<sup>2</sup>C Slave interface operates as a standard slave. The device is accessed at an address configured in the IOSLAVE\_IOSCFG\_I2CADDR field, and supports Fast Mode Plus (up to 1 MHz). Both 7bit and 10-bit address modes are supported, as selected by IOSLAVE\_IOSCFG\_10BIT. The I<sup>2</sup>C interface consists of two lines: one bi-directional data line (SDA) and one clock line (SCL). Both the SDA and the SCL lines must be connected to a positive supply voltage via a pull-up resistor. By definition, a device that sends a message is called the "transmitter", and the device that accepts the message is called the "receiver". The devices that controls the message transfer by driving SCL is called "master". The devices that are controlled by the master are called "slaves". The SoC's I<sup>2</sup>C Slave is always a slave device.

The following protocol has been defined:

- Data transfer may be initiated only when the bus is not busy.
- During data transfer, the data line must remain stable whenever the clock line is high.
- Changes in the data line while the clock line is high will be interpreted as control signals.

A number of bus conditions have been defined (see Figure 40) and are described in the following sections.



Figure 40. Basic I<sup>2</sup>C Conditions

### 13.8.1 Bus Not Busy

Both SDA and SCL remain high.

### 13.8.2 Start Data Transfer

A change in the state of SDA from high to low, while SCL is high, defines the START condition. A START condition which occurs after a previous START but before a STOP is called a RESTART condition, and functions exactly like a normal STOP followed by a normal START.

### 13.8.3 Stop Data Transfer

A change in the state of SDA from low to high, while SCL is high, defines the STOP condition.

### 13.8.4 Data Valid

After a START condition, SDA is stable for the duration of the high period of SCL. The data on SDA may be changed during the low period of SCL. There is one clock pulse per bit of data. Each data transfer is initiated with a START condition and terminated with a STOP condition. The number of data bytes transferred between the START and STOP conditions is not limited. The information is transmitted byte-wide and each receiver acknowledges with a ninth bit.

### 13.8.5 Acknowledge

Each byte of eight bits is followed by one Acknowledge (ACK) bit as shown in Figure 41. This Acknowledge bit is a low level driven onto SDA by the receiver, whereas the master generates an extra ACK related SCL pulse. A slave receiver which is addressed is obliged to generate an Acknowledge after the reception of each byte. Also, on a read transfer a master receiver must generate an Acknowledge after the reception of each byte that has been clocked out of the slave transmitter. The device that acknowledges must pull down the SDA line during the Acknowledge clock pulse in such a way that the SDA line is a stable low during the high period of the Acknowledge related SCL pulse. A master receiver must signal an end-of-data to the slave transmitter by not generating an Acknowledge (a NAK) on the last byte that has been clocked out of the slave. In this case, the transmitter must leave the data line high to enable the master to generate the STOP condition.



Figure 41. I<sup>2</sup>C Acknowledge

# 13.8.6 Address Operation

In I<sup>2</sup>C mode, the I<sup>2</sup>C/SPI Slave supports either 7-bit or 10-bit addressing, selected by the 10BIT bit in the IOSCFG Register. Figure 42 shows the operation in 7-bit mode in which the master addresses the MCU with a 7-bit address configured as 0xD2 in the CFG\_I2CADDR field. After the START condition, the 7-bit address is transmitted MSB first. If this address matches the lower 7 bits of the CFG\_I2CADDR field, the SoC is selected, the eighth bit indicate a write (RW = 0) or a read (RW = 1) operation and the SoC supplies the ACK. The SoC ignores all other address values and does not respond with an ACK.



Figure 42. I<sup>2</sup>C 7-bit Address Operation

Figure 43 shows the operation with which the master addresses the SoC with a 10-bit address configured at 0x536. After the START condition, the 10-bit preamble 0b11110 is transmitted first, followed by the first two address bits and the eighth bit indicating a write (RW = 0) or a read (RW = 1) operation. If the upper two bits match the I2CADDR value, the I<sup>2</sup>C/SPI Slave supplies the ACK. The next transfer includes the lower 8 bits of the address, and if these bits also match I2CADDR the SoC again supplies the ACK. The I<sup>2</sup>C/SPI Slave ignores all other address values and does not respond with an ACK.



### Figure 43. I<sup>2</sup>C 10-bit Address Operation

### 13.8.7 Offset Address Transmission

If the RW bit of the Address Operation indicates a write, the next byte transmitted from the master is the Offset Address as shown in Figure 44. This value is loaded into the Address Pointer of the I<sup>2</sup>C/SPI Slave.



Figure 44. I<sup>2</sup>C Offset Address Transmission

### 13.8.8 Write Operation

In a write operation the master transmitter transmits to the MCU slave receiver. The Address Operation has a RW value of 0, and the second byte contains the Offset Address as in Figure 44. The next byte is written to the register selected by the Address Pointer (which was loaded with the Offset Address) and the Address Pointer is incremented. Subsequent transfers write bytes into successive registers until a STOP condition is received, as shown in Figure 45. Note that if the Address Pointer is at 0x7F, it will not increment on the write.





### 13.8.9 Read Operation

In a read operation, the master first executes an Offset Address Transmission to load the Address Pointer with the desired Offset Address. A subsequent operation will again issue the address of the SoC but with the RW bit as a 1 indicating a read operation. Figure 46 shows this transaction beginning with a RESTART condition, although a STOP followed by a START may also be used. After the address operation, the slave becomes the transmitter and sends the register value from the location pointed to by the Address Pointer, and the Address Pointer is incremented. Subsequent transactions produce successive register values, until the master receiver responds with a NAK and a STOP to complete the operation. Because the Address Pointer holds a valid register address, the master may initiate another read sequence at this point without performing another Offset Address operation. Note that if the Address Pointer is at 0x7F, it will not increment on the read.



Figure 46. I<sup>2</sup>C Read Operation

### 13.8.10 General Address Detection

The I<sup>2</sup>C/SPI Slave may be configured to detect an I<sup>2</sup>C General Address (0x00) write. If this address is detected, the first data byte written is stored in the IOSLAVE\_GADATA Register and the GENAD interrupt flag is set. This allows software to create the appropriate response, which is typically to reset the I<sup>2</sup>C/SPI Slave.

### 13.9 SPI Interface

The I<sup>2</sup>C/SPI Slave includes a standard 3-wire or 4-wire SPI interface. The serial peripheral interface (SPI) bus is intended for synchronous communication between different ICs. 4-wire SPI consists of four signal lines: serial data input (MOSI), serial data output (MISO), serial clock (SCL) and an active low chip enable (nCE). The I<sup>2</sup>C/SPI Slave may be connected to a master with a 3-wire SPI interface by configuring 3-wire mode in the pin configuration block of the GPIO module, which will tie MOSI and MISO together. By definition, a device that sends a message is called the "transmitter", and the device that accepts the message is called the "receiver". The device that controls the message transfer by driving SCL is called "master". The devices that are controlled by the master are called "slaves". The I<sup>2</sup>C/SPI Slave SPI Slave is always a slave device.

The nCE input is used to initiate and terminate a data transfer. The SCL input is used to synchronize data transfer between the master and the slave devices via the MOSI (master to slave) and MISO (slave to master) lines. The SCL input, which is generated by the master, is active only during address and data transfer to any device on the SPI bus.

The I<sup>2</sup>C/SPI Slave supports all SPI configurations of CPOL and CPHA using the SPOL configuration bit. There is one clock for each bit transferred. Address and data bits are transferred in groups of eight bits.

### 13.9.1 Write Operation

Figure 47 shows a SPI write operation. The operation is initiated when the nCE signal to the SoC goes low. At that point an 8-bit Address byte is transmitted from the master on the MOSI line, with the upper RW bit indicating read (if 0) or write (if 1). In this example the RW bit is a one selecting a write operation, and the lower 7 bits of the Address byte contain the Offset Address, which is loaded into the Address Pointer of the  $I^2C/SPI$  Slave.

Each subsequent byte is loaded into the register selected by the Address Pointer, and the Address Pointer is incremented. The operation is terminated by the master by bringing the nCE signal high. Note that the MISO line is not used in a write operation and is held in the high impedance state by the I<sup>2</sup>C/SPI Slave. Note also that if the Address Pointer is 0x7F, it does not increment on the read.



Figure 47. SPI Write Operation

### 13.9.2 Read Operation

Figure 48 shows a read operation. The address is transferred from the master to the slave just as it is in a write operation, but in this case the RW bit is a 0 indicating a read. After the transfer of the last address bit (bit 0), the I<sup>2</sup>C/SPI Slave begins driving data from the register selected by the Address Pointer onto the MISO line, bit 7 first, and the Address Pointer is incremented. The transfer continues until the master brings the nCE line high. Note that if the Address Pointer is 0x7F, it does not increment on the read.



Figure 48. SPI Read Operation

### 13.9.3 Configuring 3-wire vs. 4-wire SPI Mode

The I<sup>2</sup>C/SPI Slave can operate in either 4-wire SPI mode, where the MISO and MOSI signals are on separate wires, or in 3-wire SPI mode where MISO and MOSI share a wire. This configuration is performed in the Pin Configuration module, and no configuration is necessary in the I<sup>2</sup>C/SPI Slave itself.

### 13.9.4 SPI Polarity and Phase

The I<sup>2</sup>C/SPI Slave supports all combinations of CPOL (clock polarity) and CPHA (data phase) in SPI mode. Figure 49 shows how these two bits affect the interface signal behavior.



#### Figure 49. SPI CPOL and CPHA

If CPOL is 0, the clock SCK is normally low and positive pulses are generated during transfers. If CPOL is 1, SCK is normally high and negative pulses are generated during transfers.

If CPHA is 0, the data on the MOSI and MISO lines is sampled on the edge corresponding to the first SCK edge after nCE goes low (i.e. the rising edge if CPOL is 0 and the falling edge if CPOL is 1). Data on MISO and MOSI is driven on the opposite edge of SCK.

If CPHA is 1, the data on the MOSI and MISO lines is sampled on the edge corresponding to the second SCK edge after nCE goes low (i.e. the falling edge if CPOL is 0 and the rising edge if CPOL is 1). Data on MISO and MOSI is driven on the opposite edge of SCK.

The I<sup>2</sup>C/SPI Slave has only a single SPOL bit to control the polarity. If CPOL = CPHA, IOSLAVE\_IOSCFG\_SPOL must be set to 0. If CPOL  $\neq$  CPHA, SPOL must be set to 1.

### 13.10 Bit Orientation

In both I<sup>2</sup>C and SPI modes, the I<sup>2</sup>C/SPI Slave supports data transmission either LSB first or MSB first as configured by the IOSLAVE\_IOSCFG\_LSB bit. If LSB is 0, data is transmitted and received MSB first. If LSB is 1, data is transmitted and received LSB first.

# 13.11 Wakeup Using the I<sup>2</sup>C/SPI Slave

The I<sup>2</sup>C/SPI Slave can continue to operate even if the SoC's CPU is in Sleep or Deep Sleep mode. The hardware will enable and disable the I<sup>2</sup>C/SPI Slave clock and oscillators as necessary. The only consideration in this environment is when the SoC is in a deep sleep mode, such that the HFRC Oscillator is powered down, and a master attempts to access the I<sup>2</sup>C/SPI Slave. In this case the HFRC Oscillator must be powered up before anything is transferred to or from the internal RAM. This process takes roughly 5-10  $\mu$ s, and is initiated by nCE going low in SPI mode or by the detection of a START in I<sup>2</sup>C mode.

#### NOTE

If the host reads or writes the register space from offset 0x78 through 0x7D (IOINT is 0x78 through 0x7B), there is no dependence on the HFRC running because the LRAM is not being accessed, so these transactions occur even if in Deep Sleep. If the host accesses the direct memory space on a read or write (all addresses not in the range 0x78 to 0x7F), or accesses the LRAM through the FIFO read port at offset 0x7F, the LRAM is being accessed and the HFRC must be running.

For I<sup>2</sup>C applications, the time delay is typically not relevant. At the fastest system clock of 1 MHz, the master must transfer 9 bits of address plus 9 bits of offset before any FIFO access can occur, and that is a minimum of 18 µs. The clocks will have started prior to that point in every case.

For SPI applications with fast interface clocks (faster than 1 MHz), the master must be programmed to pull nCE low at least 10 µs prior to sending the first clock. If a master is unable to control the timing of nCE in this way, then there are other options for ensuring that master commands/data are received reliably by the slave are as follows:

- 1. A GPIO interrupt can be configured to wake the SoC from Deep Sleep mode prior to initiating any SPI transfers.
- 2. An IOINT interrupt from the host can be configure to wake up the slave by writing to the IOINT register over the interface.
- 3. The HFRC clock can be forced to remain active during Deep Sleep mode by setting the CLKGEN\_-MISC\_FRCHFRC bit.

Regarding option 3, forcing HFRC to stay active is done by setting the FRCHFRC bit of the CLKGEN's MISC Register. If this bit is set, the HFRC will continue to be active even if the SoC's CPU is in deep sleep mode, so that the slave can immediately begin transferring data independent of the transfer rate. This will result in higher power because the HFRC remains active, so the FRCHFRC bit should only be set if it is known that a transfer is likely to begin prior to another interrupt

There is no delay restriction if the SoC is in normal Sleep mode. In that case the HFRC is not powered down and the I<sup>2</sup>C/SPI Slave clock will start immediately when nCE goes low.

# 13.12 Host Side Address Space and Register

### 13.12.1 Host Address Space and Registers

The Host of the I/O interface can access 128 bytes in the I<sup>2</sup>C/SPI Slave in either I<sup>2</sup>C or SPI mode. Offsets 0x00 to 0x77 may be directly mapped to the Direct RAM Area. The remaining eight offset locations access hardware functions within the I<sup>2</sup>C/SPI Slave. The R/W indicator refers to accesses from the Host.

### 13.12.1.1 HOST\_IER Register

#### Host Interrupt Enable

#### OFFSET: 0x78

This register enables the FIFO read interrupts.

#### Table 71: HOST\_IER Register

| 0        | 0       | 0               | 0        | 0        | 0        | 0        | 0               |
|----------|---------|-----------------|----------|----------|----------|----------|-----------------|
| 7        | 6       | 5               | 4        | 3        | 2        | 1        | 0               |
| FUNDFLEN | RDERREN | <b>SWINT5EN</b> | SWINT4EN | SWINT3EN | SWINT2EN | SWINT1EN | <b>SWINT0EN</b> |

#### Table 72: HOST\_IER Register Bits

| Bit | Name     | Reset | RW | Description                                                                                                                                                                 |  |  |  |  |
|-----|----------|-------|----|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| 7   | FUNDFLEN | 0x0   | RW | If 1, enable an interrupt that triggers when the FIFO underflows                                                                                                            |  |  |  |  |
| 6   | RDERREN  | 0x0   | RW | <ul> <li>If 1, enable the interrupt which occurs when the Host attempts to access the FIFO when read access is locked</li> <li>If 1, enable activate interrupt 5</li> </ul> |  |  |  |  |
| 5   | SWINT5EN | 0x0   | RW | If 1, enable software interrupt 5                                                                                                                                           |  |  |  |  |
| 4   | SWINT4EN | 0x0   | RW | If 1, enable software interrupt 4                                                                                                                                           |  |  |  |  |
| 3   | SWINT3EN | 0x0   | RW | If 1, enable software interrupt 3                                                                                                                                           |  |  |  |  |
| 2   | SWINT2EN | 0x0   | RW | If 1, enable software interrupt 2                                                                                                                                           |  |  |  |  |
| 1   | SWINT1EN | 0x0   | RW | If 1, enable software interrupt 1                                                                                                                                           |  |  |  |  |
| 0   | SWINT0EN | 0x0   | RW | If 1, enable software interrupt 0                                                                                                                                           |  |  |  |  |

# 13.12.1.2 HOST\_ISR Register

# Host Interrupt Status Register

#### OFFSET: 0x79

The host uses this register to read interrupt status.

#### Table 73: HOST\_ISR Register

| 0          | 0         | 0          | 0          | 0                 | 0          | 0          | 0                 |
|------------|-----------|------------|------------|-------------------|------------|------------|-------------------|
| 7          | 6         | 5          | 4          | 3                 | 2          | 1          | 0                 |
| FUNDFLSTAT | RDERRSTAT | SWINT5STAT | SWINT4STAT | <b>SWINT3STAT</b> | SWINT2STAT | SWINT1STAT | <b>SWINT0STAT</b> |

Table 74: HOST\_ISR Register Bits

| Bit | Name       | Reset | RW                                                                                                                                                                                                                                     | Description                                                                                                               |  |  |  |  |
|-----|------------|-------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| 7   | FUNDFLSTAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 31 of the IOINTCTL Register, or if the Host attempts a FIFO read when FIFOCTR is 0. |  |  |  |  |
| 6   | RDERRSTAT  | 0x0   | RO       This bit is set by writing a 1 to bit 30 of the IOINTCTL Register, or if t         Host attempts a FIFO read when the FIFOUPD bit is a 1.         RO       This bit is set by writing a 1 to bit 29 of the IOINTCTL Register. |                                                                                                                           |  |  |  |  |
| 5   | SWINT5STAT | 0x0   | RO                                                                                                                                                                                                                                     | O This bit is set by writing a 1 to bit 29 of the IOINTCTL Register.                                                      |  |  |  |  |
| 4   | SWINT4STAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 28 of the IOINTCTL Register.                                                        |  |  |  |  |
| 3   | SWINT3STAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 27 of the IOINTCTL Register.                                                        |  |  |  |  |
| 2   | SWINT2STAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 26 of the IOINTCTL Register.                                                        |  |  |  |  |
| 1   | SWINT1STAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 25 of the IOINTCTL Register.                                                        |  |  |  |  |
| 0   | SWINTOSTAT | 0x0   | RO                                                                                                                                                                                                                                     | This bit is set by writing a 1 to bit 24 of the IOINTCTL Register.                                                        |  |  |  |  |

#### NOTE

All bits are cleared by a write to the IOINTCLR bit of the IOINTCTL Register.

### 13.12.1.3 HOST\_WCR Register

#### Host Interrupt Write-to-Clear Register

#### OFFSET: 0x7A

Write a 1 to a bit in this register to clear a pending interrupt.

### Table 75: HOST\_WCR Register

| 0        | 0       | 0               | 0        | 0               | 0               | 0        | 0        |
|----------|---------|-----------------|----------|-----------------|-----------------|----------|----------|
| 7        | 6       | 5               | 4        | 3               | 2               | 1        | 0        |
| FUNDFLWC | RDERRWC | <b>SWINT5WC</b> | SWINT4WC | <b>SWINT3WC</b> | <b>SWINT2WC</b> | SWINT1WC | SWINTOWC |

| Bit | Name     | Reset | RW | Description                                                                    |
|-----|----------|-------|----|--------------------------------------------------------------------------------|
| 7   | FUNDFLWC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit FUNDFLSTAT |
| 6   | RDERRWC  | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit RDERRSTAT  |
| 5   | SWINT5WC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT5STAT |
| 4   | SWINT4WC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT4STAT |
| 3   | SWINT3WC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT3STAT |
| 2   | SWINT2WC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT2STAT |
| 1   | SWINT1WC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT1STAT |
| 0   | SWINTOWC | 0x0   | WO | Writing a 1 to this bit will clear the pending interrupt status bit SWINT0STAT |

### Table 76: HOST\_WCR Register Bits

### 13.12.1.4 HOST\_WCS Register

# Host Interrupt Write-to-Set Register

#### OFFSET: 0x7B

Write a 1 to a bit in this register to set the status bit of a pending interrupt.

# Table 77: HOST\_WCS Register

| 0        | 0       | 0        | 0        | 0        | 0               | 0        | 0 |
|----------|---------|----------|----------|----------|-----------------|----------|---|
| 7        | 6       | 5        | 4        | 3        | 2               | 1        | 0 |
| FUNDFLWS | RDERRWS | SWINT5WS | SWINT4WS | SWINT3WS | <b>SWINT2WS</b> | SWINT1WS |   |

#### Table 78: HOST\_WCS Register Bits

| Bit | Name     | Reset | RW | Description                                                                  |  |  |  |
|-----|----------|-------|----|------------------------------------------------------------------------------|--|--|--|
| 7   | FUNDFLWS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit FUNDFLSTAT |  |  |  |
| 6   | RDERRWS  | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit RDERRSTAT  |  |  |  |
| 5   | SWINT5WS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT5STAT |  |  |  |
| 4   | SWINT4WS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT4STAT |  |  |  |
| 3   | SWINT3WS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT3STAT |  |  |  |
| 2   | SWINT2WS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT2STAT |  |  |  |
| 1   | SWINT1WS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT1STAT |  |  |  |
| 0   | SWINTOWS | 0x0   | WO | Writing a 1 to this bit will set the pending interrupt status bit SWINT0STAT |  |  |  |

# 13.12.1.5 FIFOCTRLO Register

### FIFOCTR Low Byte

#### OFFSET: 0x7C

This register allows the host to read the lower eight bits of the FIFOCTR register.

#### Table 79: FIFOCTRLO Register

| 0 | 0 | 0  | 0   | 0   | 0 | 0 | 0 |
|---|---|----|-----|-----|---|---|---|
| 7 | 6 | 5  | 4   | 3   | 2 | 1 | 0 |
|   |   | FI | FOC | TRI | 0 |   |   |

#### Table 80: FIFOCTRLO Register Bits

| Bit | Name      | Reset | RW | Description                           |
|-----|-----------|-------|----|---------------------------------------|
| 7:0 | FIFOCTRLO | 0x0   | RO | Reads the lower eight bits of FIFOCTR |

# 13.12.1.6 FIFOCTRUP Register

#### FIFOCTR Upper Byte

#### OFFSET: 0x7D

This register allows the host to read the upper two bits of the FIFOCTR register.

#### Table 81: FIFOCTRUP Register

| 0 | 0    | 0 | 0 | 0 | 0 | 0 | 0 |
|---|------|---|---|---|---|---|---|
| 7 | 6    | 5 | 4 | 3 | 2 | 1 | 0 |
|   | RSVD |   |   |   |   |   |   |

#### Table 82: FIFOCTRUP Register Bits

| Bit | Name      | Reset | RW | Description                         |
|-----|-----------|-------|----|-------------------------------------|
| 1:0 | FIFOCTRUP | 0x0   | RO | Reads the upper two bits of FIFOCTR |
# 13.12.1.7 FIFO Register

## **FIFO Read Data**

## OFFSET: 0x7F

Read this register for FIFO data.

# Table 83: FIFO Register

| 0    | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|------|---|---|---|---|---|---|---|
| 7    | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
| FIFO |   |   |   |   |   |   |   |

### Table 84: FIFO Register Bits

| Bit | Name | Reset | RW | Description                    |
|-----|------|-------|----|--------------------------------|
| 7:0 | FIFO | 0x0   | RO | Reads the top byte of the FIFO |

# 14. Universal Asynchronous Receiver/Transmitter (UART)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 14.1 Enabling and Selecting the UART Clock

The UART module receives two clocks - UART\_clk which is used to derive the UART serial clock and UART\_hclk, which is the bus interface clock of the UART module. Unlike other modules, the UART requires a bus clock whenever it is transmitting or receiving, so special controls are required when the UART is to transfer data while the MCU is in a sleep mode and its normal bus clocks are not operating.

UART\_clk is selected in the UARTx\_CR\_CLKSEL field, with values from 3 MHz to 48 MHz plus a disabled value NOCLK, and is enabled by the UARTx\_CR\_CLKEN bit. If the UART is inactive, CLKSEL should be set to the NOCLK value (0) to minimize power, and the CLKEN bit should be 0. When the UART is active, the serial clock is created by the baud rate generator based on UART\_clk. A higher UART\_clk frequency can produce more precise serial clock frequencies, but will cause the UART to use more power. It is thus recommended that UART\_clk be set to the minimum frequency which produces acceptable serial clocks.

# 14.2 Configuration

The UART Register Block may be set to configure the UART Module. The data width, number of stop bits, and parity may all be configured using the UART\_LCRH register.

The baud rate is configured using the integer UART\_IBRD and UART\_FBRD registers. The correct values for UART\_IBRD and UART\_FBRD may be determined according to the following equation:

 $F_{UART}/(16 \cdot BR) = IBRD + FBRD$ 

F<sub>UART</sub> is the frequency of the UART clock. BR is the desired baud rate. IBRD is the integer portion of the baud rate divisor. FBRD is the fractional portion of the baud rate divisor.

The UART Module supports independent CTS and RTS hardware flow control. All flow control configuration may be set using the UART\_CR register.

# 14.3 Transmit FIFO and Receive FIFO

The transmit and receive FIFOs may both be accessed via the same 8-bit word in the UART\_DR register. The transmit FIFO stores up to 32 8-bit words and can be written using writes to UART\_DR. The receive FIFO stores up to 32 12-bit words and can be read using reads to UART\_DR. Note that each 12-bit receive FIFO word includes an 8-bit data word and a 4-bit error status word.

# 15. Universal Serial Bus (USB)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

NOTE

The Apollo4 Lite SoC and Apollo4 Blue Lite SoC do not include a USB module. This chapter and its content do not apply to these SoCs.

# 15.1 Functional Overview

The USB subsystem provides support for USB full speed (12 Mbps) and high speed (480 Mbps) interface. This interface is primarily used for bulk data transfer, firmware updates and charging detect.

The USB controller supports up to 5 IN / 5 OUT endpoints plus 1 control. The FIFO sizing for each endpoint is dynamically configurable up to 4 kB.

The MCU has an integrated USB 2.0 PHY with support for suspend mode operation. Battery charger detection is supported within the PHY to enable battery charge algorithm execution and control of the external battery charge / power management IC. The charger detection supports Battery Charging Specification 1.2 (BC1.2) and also supports other non-BC1.2 standards.

The CPU interface to the USB Controller provides access to the control/status registers and the FIFOs for each endpoint. It also generates interrupts to the CPU when packets are successfully transmitted or received, and when the core enters suspend mode or resumes from suspend mode. The USB controller interfaces to the CPU over the APB.

The USB Controller can be configured to allow the connection of the controller to the USB PHY to be controlled by software, where the PHY can be switched between normal mode and non-driving mode by setting/clearing the CFG0\_SoftConn bit. When the SoftConn bit is set to 1, the PHY is placed in its normal mode and the D+/D- lines of the USB bus are enabled. When the SoftConn bit is zero, the PHY is put into non-driving mode and D+ and D- are tri-stated. The USB Controller then appears to the CPU as if it has been disconnected.

The RAM controller uses RAM to buffer packets between the CPU and USB. It takes the FIFO pointers from the endpoint controllers, converts them to address pointers within the RAM block and generates the RAM access control signals.

#### NOTE

Due to the absence of DMA to/from the USB interface, USB high-speed (HS) mode is feasible on an Apollo4 family SoC only in a use case where the CPU can be completely dedicated to and fully occupied by the USB task. HS mode generates 60 MB/s traffic which uses the majority of the CPU bandwidth. One such use case which could utilize HS mode effectively is when the CPU does nothing but data transfer to/from the USB endpoint, such as when programming eMMC over USB.

# 15.2 USB Reset

When a reset condition is detected on the USB, the USB Controller performs the following actions:

• Sets CFG0\_FuncAddr to 0.

- Sets CFG3\_ENDPOINT (index) to 0.
- Flushes all endpoint FIFOs.
- Clears all control/status registers.
- Enables all endpoint interrupts.
- Generates a Reset interrupt.

If the HSEnab bit in the CFG0 register was set, the USB Controller also tries to negotiate for high-speed operation. Whether high-speed operation is selected is indicated by the CFG0\_HSMode bit.

When the software receives a Reset interrupt, it should close any open pipes and wait for bus enumeration to begin.

# 15.3 Soft Connect/Disconnect

As mentioned earlier, the USB Controller can be configured to allow the connection of the controller to the USB PHY to be controlled by software, where the PHY can be switched between normal mode and nondriving mode by setting/clearing the CFG0\_AMSPECIFIC bit. When the AMSPECIFIC bit is set to 1, the PHY is placed in its normal mode and the D+/D- lines of the USB bus are enabled. When the AMSPECIFIC bit is zero, the PHY is put into non-driving mode and D+ and D- are tri-stated. The USB Controller then appears to the CPU as if it has been disconnected.

After a hardware reset (nRST = 0), AMSPECIFIC is cleared to 0. The USB Controller will therefore appear disconnected until the software has set AMSPECIFIC to 1. The application software can then choose when to set the PHY to its normal mode. Systems with a lengthy initialization procedure may use this to ensure that initialization is complete and the system is ready to perform enumeration before connecting to the USB.

Once the AMSPECIFIC bit has been set to 1, the software can also simulate a disconnect by clearing this bit.

# 15.4 High-speed Mode

If the HSENAB bit of the CFG0 register is set (it is set by default at power-up), then when the USB Controller is reset by a USB reset signal from the hub, it will negotiate for High-speed mode. If the USB host, and all hubs between the USB Controller and the host, support High-speed operation then the HSMODE bit of the CFG0 register will be set and the USB Controller will operate in High-speed mode. If the High-speed negotiation fails, the HSMODE bit will not be set and the USB Controller will operate in Full-speed mode only.

### 15.5 USB Interrupt Handling

When the CPU is interrupted with a USB interrupt, it needs to read the interrupt status register to determine which endpoint(s) have caused the interrupt and jump to the appropriate routine. If multiple endpoints have caused the interrupt, Endpoint 0 should be serviced first, followed by the other endpoints. The Suspend interrupt should be serviced last.

A flowchart for the USB Interrupt Service Routine is shown in Figure 50.



Figure 50. USB Interrupt Service Routine

# 15.6 Index 0 Register Fields

NOTE This section is included here to describe how certain fields in the USB Controller's IDX0 register are muxed and named to reflect that muxing. These fields are referenced throughout this chapter and, in the interest of clarity and brevity, references to these multi-function fields will use the field name appropriate to the endpoint that is being discussed.

The Index 0 Register (IDX0) has fields whose functions are different depending on whether the Endpoint index has been set in CFG3\_ENDPOINT to Endpoint 0 or to a non-zero endpoint, e.g., IN Endpoint1 - IN Endpoint5 or OUT Endpoint1 - OUT Endpoint5.

Table 85 below lists the functionality for these fields. See the field descriptions in the IDX0 register for more information.

| Field Name in Register Set  | ENDPOINT0 Field<br>Name | ENDPOINT0 Use                                                                                                                                                                                                      | All Other Endpoint<br>Field Name | All Other Endpoint<br>Use                                                                                                                                                     |
|-----------------------------|-------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| IncompTxServiceSetupEnd     | ServiceSetupEnd         | CPU writes a 1 to this<br>bit to clear the Setu-<br>pEnd bit. Cleared<br>automatically.                                                                                                                            | IncompTx                         | Indicates where a<br>large packet has been<br>split into 2 or 3 pack-<br>ets for transmission<br>but insufficient I<br>tokens have been<br>received to send all<br>the parts. |
| ClrDataTogServicedOutPktRdy | ServicedOutPktRdy       | CPU writes a 1 to this<br>bit to clear the OutPk-<br>tRdy bit. Cleared<br>automatically.                                                                                                                           | ClrDataTog                       | CPU writes a 1 to this<br>bit to reset the end-<br>point IN data toggle to<br>0.                                                                                              |
| SentStallSendStall          | SendStall               | CPU writes a 1 to this<br>bit to terminate the<br>current transaction.<br>The STALL hand-<br>shake will be trans-<br>mitted and then this<br>bit will be cleared<br>automatically.                                 | SentStall                        | Set when a STALL<br>handshake is trans-<br>mitted. The FIFO is<br>flushed and the InPk-<br>tRdy bit is cleared.                                                               |
| SendStallSetupEnd           | SetupEnd                | Set when a control<br>transaction ends<br>before the DataEnd<br>bit has been set. An<br>interrupt will be gen-<br>erated and the FIFO<br>flushed. Cleared by<br>CPU writing a 1 to the<br>ServicedSetupEnd<br>bit. | SendStall                        | CPU writes a 1 to this<br>bit to issue a STALL<br>handshake to an IN<br>token. CPU clears<br>this bit to terminate<br>the stall condition.                                    |

#### Table 85: Index 0 (IDX0) Register Multi-function Fields

| Field Name in Register Set | ENDPOINT0 Field<br>Name | ENDPOINT0 Use                                                                                                                                                                                                                                                  | All Other Endpoint<br>Field Name | All Other Endpoint<br>Use                                                                                                                                                                                                                                                                                                                                  |
|----------------------------|-------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| FlushFIFODataEnd           | DataEnd                 | CPU sets this bit:<br>1. When setting InPk-<br>tRdy for the last data<br>packet.<br>2. When clearing Out-<br>PktRdy after unload-<br>ing the last data<br>packet.<br>3. When setting InPk-<br>tRdy for a zero length<br>data packet.<br>Cleared automatically. | FlushFIFO                        | CPU writes a 1 to this<br>bit to flush the next<br>packet to be transmit-<br>ted from the endpoint<br>IN FIFO. The FIFO<br>pointer is reset and<br>the InPktRdy bit is<br>cleared. May be set<br>simultaneously with<br>InPktRdy to abort the<br>packet that has just<br>been loaded into the<br>FIFO.                                                     |
| UnderRunSentStall          | SentStall               | Set when a STALL<br>handshake is trans-<br>mitted. The CPU<br>should clear this bit.                                                                                                                                                                           | UnderRun                         | In ISO mode, this bit<br>is set when a zero<br>length data packet is<br>sent after receiving an<br>IN token with the InP-<br>ktRdy bit not set.<br>In Bulk/Interrupt<br>mode, this bit is set<br>when a NAK is<br>returned in response<br>to an IN token.<br>The CPU should clear<br>this bit.                                                             |
| FIFONotEmptyInPktRdy       | InPktRdy                | CPU sets this bit after<br>loading a data packet<br>into the FIFO.<br>Cleared automati-<br>cally when the data<br>packet has been<br>transmitted. An inter-<br>rupt is generated<br>when this bit is<br>cleared (if enabled).                                  | FIFONotEmpty                     | Set when there is at<br>least 1 packet in the<br>IN FIFO.                                                                                                                                                                                                                                                                                                  |
| InPktRdyOutPktRdy          | OutPktRdy               | Set when a data<br>packet has been<br>received. An interrupt<br>is generate when this<br>bit is set (if enabled).<br>CPU clears this bit by<br>setting the Serviced-<br>OutPktRdy bit.                                                                         | InPktRdy                         | CPU sets this bit after<br>loading a data packet<br>into the FIFO.<br>Cleared automati-<br>cally when a data<br>packet has been<br>transmitted. If FIFO is<br>double-buffered, it is<br>also automatically<br>cleared when there is<br>space for a second<br>packet in the FIFO.<br>An interrupt is gener-<br>ate (if enabled) when<br>the bit is cleared. |

| Table 85: Index 0 | (IDX0) Register | <b>Multi-function Fields</b> |
|-------------------|-----------------|------------------------------|
|-------------------|-----------------|------------------------------|

# **15.7 Response to USB Conditions or Host Actions**

The USB Controller core responds automatically to certain conditions on the USB bus or actions by the host. The details are given in the following sections.

## 15.7.1 Stall Issued to Control Transfer

The USB Controller core will automatically issue a STALL handshake to a Control transfer under the following conditions:

- 1. The host sends more data during an OUT Data phase of a Control transfer than was specified in the device request during the SETUP phase. This condition is detected by the USB Controller when the host sends an OUT token (instead of an IN token) after the CPU has unloaded the last OUT packet and set IDX0\_DATAEND.
- 2. The host requests more data during an IN data phase of a Control transfer than was specified in the device request during the SETUP phase. This condition is detected by the USB Controller when the host sends an IN token (instead of an OUT token) after the CPU has cleared IDX0\_InPktRdy and set IDX0\_DATAEND in response to the ACK issued by the host to what should have been the last packet.
- 3. The Host sends more than IDX0\_MAXPAYLOAD data with an OUT data token.
- 4. The Host sends more than a zero length data packet for the OUT Status phase.

# 15.7.2 Zero Length OUT Data Packets in Control Transfers

A zero-length OUT data packet is used to indicate the end of a Control transfer. In normal operation, such packets should only be received after the entire length of the device request has been transferred (i.e., after the CPU has set IDX0\_DATAEND). If, however, the host sends a zero-length OUT data packet before the entire length of device request has been transferred, this signals the premature end of the transfer. In this case, the USB Controller will automatically flush any IN token loaded by CPU ready for the Data phase from the FIFO and set IDX0\_SETUPEND.

# 15.8 Suspend/Resume

When the USB Controller sees no activity on the USB for 3 ms, it will go into Suspend mode. If the Suspend interrupt has been enabled, a Suspend interrupt will also be generated. At this point, the USB Controller can then be left active or the application may arrange to disable the USB Controller by stopping its clock. When in Suspend mode, the CFG0\_SUSPEN bit will go low (if enabled).

The USB may exit Suspend mode by sending Resume signaling on the bus. If the Resume interrupt is enabled, an interrupt will be generated.

No Resume interrupt is generated when Suspend mode is exited by the CPU. When the CFG0\_RESUME bit is set, then the USB Controller will exit Suspend mode. When this bit is set, the USB Controller will exit Suspend mode and drive Resume signaling onto the bus. The CPU should clear this bit after 10 ms (a maximum of 15 ms) to end Resume signaling.

Alternatively, software may perform a "Remote Wakeup". How the USB Controller responds depends on whether it has been left active or inactive during the suspend, which is discussed below.

### 15.8.1 USB Controller Active During Suspend

When the USB Controller goes into Suspend mode, the transceiver will also be put into Suspend mode by the SUSPENDM line if the CFG0\_Enabl bit is set. When the USB Controller remains active, however, it can detect when Resume signaling occurs on the USB. It will then bring the transceiver out of Suspend mode and generate a Resume interrupt.

### 15.8.2 USB Controller Inactive During Suspend

When the Suspend interrupt described above is received, the software may disable the USB Controller by stopping its clock. However, the USB Controller will not then be able to detect Resume signaling on the USB.

#### 15.8.3 Remote Wakeup

If the USB Controller is in Suspend mode and the software wants to initiate a remote wakeup, it should set the CFG0\_Resume bit to 1. If the clock to the USB Controller has been stopped, it will need to be restarted before this write can occur (by clearing CLKCTRL\_PHYAPBCLKDIS bit).

The software should leave the CFG0\_Resume bit set for approximately 10 ms (minimum of 2 ms, a maximum of 15 ms) then reset it to 0. By this time the hub should have taken over driving Resume signaling on the USB.

NOTE

No Resume interrupt will be generated when the software initiates a remote wakeup.

# 15.9 Start of Frame Packets

The USB Controller should receive a Start-Of-Frame packet from the host once every millisecond when in Full-speed mode, or every 125 microseconds when in High-speed mode. When the SOF packet is received, the 11-bit frame number contained in the packet is written into the CFG3\_FRMNUM field. In normal running, the core will also generate a SOF interrupt (CFG2\_SOF bit), if enabled by setting the CFG2\_SOFE bit, to indicate that the core's internal frame timer is starting a new frame.

Following a reset or a resume, there is a period of synchronization during which a SOF\_PULSE and a SOF interrupt will not necessarily be generated following each received SOF packet. After synchronization has been achieved, the core expects to receive a SOF packet every millisecond (or every 125 microseconds). If no SOF packet is received after 1.00358 ms (or 125.125  $\mu$ s), it is assumed that the packet has been lost and a SOF\_PULSE (together with a SOF interrupt, if enabled) will still be generated though the Frame register will not be updated. The USB Controller will continue to generate a SOF\_PULSE every millisecond (or 125 microseconds) and will resynchronize these pulses to the received SOF packets when these packets are successfully received again.

# 15.10 Dynamic FIFO Sizing

The USB Controller is configured to have a single overall FIFO size of 4 kB, areas of which may then be allocated to the different endpoints, excluding endpoint 0, when the USB Controller is initialized.

The allocation of FIFO space to the different endpoints requires the specification for each IN and OUT endpoint of:

- The start address of the FIFO within the RAM block
- The maximum size of packet to be supported
- Whether double-buffering is required

(These last two together define the amount of space that needs to be allocated to the FIFO.)

These details may be specified through the four register fields shown in Table 86, which are in the Indexed area of the USB Controller register map. The IDX2\_INFIFOSZ and IDX2\_OUTFIFOSZ fields are each 5-bit fields. The most significant bit of each is set if double-packet buffering is supported. The four lower bits of each field is set to the maximum packet size to be allowed, where  $2^{(D3:D0 + 3)}$  is the maximum packet size. E.g., if D3:D0 = 7, then the maximum packet size is 1024 bytes. Since the total FIFO size is limited to 4 kB, then the maximum value of either of these 4-bit size fields is 9. If double-packet buffering is not selected, then the FIFO will be the same size as the maximum packet size, otherwise it will be twice the size.

| Address | Register_Field     | Description               |
|---------|--------------------|---------------------------|
| 1A      | IDX2_INFIFOSZ      | IN Endpoint FIFO size     |
| 1B      | IDX2_OUTFIFOSZ     | OUT Endpoint FIFO size    |
| 1C, 1D  | FIFOADD_INFIFOADD  | IN Endpoint FIFO address  |
| 1E, 1F  | FIFOADD_OUTFIFOADD | OUT Endpoint FIFO address |

#### Table 86: USB Controller Register Map Additions for Dynamic FIFO Sizing

#### NOTE

The option of setting FIFO sizes dynamically only applies to Endpoints 1 to 5. The Endpoint 0 FIFO is fixed to 64 bytes and starts at address 0. It is the responsibility of the firmware (and the system designer) to ensure that all the IN and OUT endpoints that are active in the current USB configuration have a block of RAM assigned exclusively to them that is at least as large as the maximum packet size set for the endpoint.

The FIFOADD\_INFIFOADD and FIFOADD\_OUTFIFOADD fields are each 13-bits in length and are programmed to specify the start address of an endpoint FIFO, where the field value x 8 is the start address. Since the size of the FIFO for Endpoint 0 is fixed at 64 bytes, the start address of the FIFO for any other Endpoint is no lower than address 0x40, which means that its address field value is at least 8.

# 15.11 Endpoint 0 Handling

Endpoint 0 is the main control endpoint of the core. As such, the routines required to service Endpoint 0 are more complicated than those required to service other endpoints.

The software is required to handle all the Standard Device Requests that may be received via Endpoint 0. These are described in Universal Serial Bus Specification, Revision 2.0, Chapter 9. The protocol for these device requests involves different numbers and types of transaction per transfer. To accommodate this, the CPU needs to take a state machine approach to command decoding and handling.

The Standard Device Requests can be divided into three categories: Zero Data Requests (in which all the information is included in the command), Write Requests (in which the command will be followed by additional data), and Read Requests (in which the device is required to send data back to the host).

This section examines the sequence of events that the software must perform to process the different types of device request.

NOTE

The Setup packet associated with any Standard Device Request should include an 8-byte command. Any Setup packet containing a command field of anything other than 8 bytes will be automatically rejected by the USB Controller core.

### 15.11.1 Zero Data Requests

Zero data requests have all their information included in the 8-byte command and require no additional data to be transferred. Examples of zero data Standard Device Requests are: SET\_FEATURE, CLEAR\_FEATURE, SET\_ADDRESS, SET\_CONFIGURATION, SET\_INTERFACE.

The sequence of events will begin, as with all requests, when the software receives an Endpoint 0 interrupt. The IDX0\_OutPktRdy bit will also have been set. The 8-byte command should then be read from the Endpoint 0 FIFO, decoded and the appropriate action taken. For example if the command is SET\_ADDRESS, the 7-bit address value contained in the command should be written to the CFG0 FuncAddr field.

Both the IDX0\_ServicedOutPktRdy bit (indicating that the command has been read from the FIFO) and the IDX0\_DataEnd bit (indicating that no further data is expected for this request) should be set.

When the host moves to the status stage of the request, a second Endpoint 0 interrupt will be generated to indicate that the request has completed. No further action is required from the software: the second interrupt is just a confirmation that the request completed successfully.

If the command is an unrecognized command, or for some other reason cannot be executed, then when it has been decoded, the IDX0\_ServicedOutPktRdy bit and the IDX0\_SendStall bit should be set. When the host moves to the status stage of the request, the USB Controller will send a STALL to tell the host that the request was not executed. A second Endpoint 0 interrupt will be generated and the IDX0\_SentStall bit will be set.

If the host sends more data after the DataEnd bit has been set, then the USB Controller will send a STALL. An Endpoint 0 interrupt will be generated and the SentStall bit will be set.

#### 15.11.2 Write Requests

Write requests involve an additional packet (or packets) of data being sent from the host after the 8-byte command. An example of a write Standard Device Request is: SET\_DESCRIPTOR.

The sequence of events will begin, as with all requests, when the software receives an Endpoint 0 interrupt. The IDX0\_OutPktRdy bit will also have been set. The 8-byte command should then be read from the Endpoint 0 FIFO and decoded.

As with a zero data request, the IDX0\_ServicedOutPktRdy bit (indicating that the command has been read from the FIFO) should be set, but in this case the IDX0\_DataEnd bit should not be set (indicating that more data is expected).

When a second Endpoint 0 interrupt is received, the IDX0 register should be read to check the endpoint status. The IDX0\_OutPktRdy bit should be set to indicate that a data packet has been received. The IDX2\_ENDPTOUTCOUNT field should then be read to determine the size of this data packet. The data packet can then be read from the Endpoint 0 FIFO.

If the length of the data associated with the request (indicated by the wLength field in the command) is greater than the maximum packet size for Endpoint 0, further data packets will be sent. In this case, the IDX0\_ServicedOutPktRdy bit should be set, but the IDX0\_DataEnd bit should not be set.

When all the expected data packets have been received, the IDX0\_ServicedOutPktRdy bit and the DataEnd bit (indicating that no more data is expected) should be set.

When the host moves to the status stage of the request, another Endpoint 0 interrupt will be generated to indicate that the request has completed. No further action is required from the software, the interrupt is just a confirmation that the request completed successfully.

If the command is an unrecognized command, or for some other reason cannot be executed, then when it has been decoded, the IDX0\_ServicedOutPktRdy bit and th IDX0\_SendStall bit should be set. When the host sends more data, the USB Controller will send a STALL to tell the host that the request was not executed. An Endpoint 0 interrupt will be generated and the IDX0\_SentStall bit will be set.

If the host sends more data after the DataEnd has been set, then the USB Controller will send a STALL. An Endpoint 0 interrupt will be generated and the SentStall bit will be set.

### 15.11.3 Read Requests

Read requests have a packet (or packets) of data sent from the function to the host after the 8-byte command. Examples of read Standard Device Requests are: GET\_CONFIGURATION, GET\_INTERFACE, GET\_DESCRIPTOR, GET\_STATUS, SYNCH\_FRAME.

The sequence of events will begin, as with all requests, when the software receives an Endpoint 0 interrupt. The IDX0\_OutPktRdy bit will also have been set. The 8-byte command should then be read from the Endpoint 0 FIFO and decoded. The IDX0\_ServicedOutPktRdy bit (indicating that the command was read from the FIFO) should then be set.

The data to be sent to the host should then be written to the Endpoint 0 FIFO. If the data to be sent is greater than the maximum packet size for Endpoint 0, only the maximum packet size should be written to the FIFO. The IDX0\_InPktRdy bit (indicating that there is a packet in the FIFO to be sent) should be set. When the packet has been sent to the host, another Endpoint 0 interrupt will be generated and the next data packet can be written to the FIFO.

When the last data packet has been written to the FIFO, the IDX0\_InPktRdy bit and the IDX0\_DataEnd bit (indicating that there is no more data after this packet) should be set.

When the host moves to the status stage of the request, another Endpoint 0 interrupt will be generated to indicate that the request has completed. No further action is required from the software: the interrupt is just a confirmation that the request completed successfully.

If the command is an unrecognized command, or for some other reason cannot be executed, then when it has been decoded, the IDX0\_ServicedOutPktRdy bit and the IDX0\_SendStall bit should be set. When the host requests data, the USB Controller will send a STALL to tell the host that the request was not executed. An Endpoint 0 interrupt will be generated and the IDX0\_SentStall bit will be set.

If the host requests more data after the DataEnd has been set, then the USB Controller will send a STALL. An Endpoint 0 interrupt will be generated and the SentStall bit will be set.

## 15.11.4 Endpoint 0 States

As illustrated in Figure 51, the Endpoint 0 control needs three modes – IDLE, TX and RX – corresponding to the different phases of the control transfer and the states Endpoint 0 enters for the different phases of the transfer. The default mode on power-up or reset should be IDLE.

IDX0\_OutPktRdy becoming set when Endpoint 0 is in IDLE state indicates a new device request. Once the device request is unloaded from the FIFO, the USB Controller decodes the descriptor to find whether there is a Data phase and, if so, the direction of the Data phase for the control transfer (in order to set the FIFO direction).

Depending on the direction of the Data phase, Endpoint 0 goes into either TX state or RX state. If there is no Data phase, Endpoint 0 remains in IDLE state to accept the next device request.

The actions that the CPU needs to take at the different phases of the possible transfers (e.g. Loading the FIFO, Setting InPktRdy) are indicated in the diagram on the following page.

NOTE

The USB Controller changes the FIFO direction depending on the direction of the Data phase independently of the CPU.



Figure 51. Endpoint 0 States

# 15.11.5 Endpoint 0 Service Routine

An Endpoint 0 interrupt is generated:

• When the core sets the IDX0\_OutPktRdy bit after a valid token has been received and data has been written to the FIFO.

- When the core clears the IDX0\_InPktRdy bit after the packet of data in the FIFO has been successfully transmitted to the host.
- When the core sets the IDX0\_SentStall bit after a control transaction is ended due to a protocol violation.
- When the core sets the IDX0\_SetupEnd bit because a control transfer has ended before IDX0\_DataEnd bit is set.

Whenever the Endpoint 0 service routine is entered, the firmware must first check to see if the current control transfer has been ended due to either a STALL condition or a premature end of control transfer. If the control transfer ends due to a STALL condition, the IDX0\_SentStall bit would be set. If the control transfer ends due to a premature end of control transfer, the SetupEnd bit would be set. In either case, the firmware should abort processing the current control transfer and set the state to IDLE.

Once the firmware has determined that the interrupt was not generated by an illegal bus state, the next action taken depends on the Endpoint state as shown in Figure 52.

If Endpoint 0 is in IDLE state, the only valid reason an interrupt can be generated is as a result of the core receiving data from the USB bus. The service routine must check for this by testing the OutPktRdy bit. If this bit is set, then the core has received a SETUP packet. This must be unloaded from the FIFO and decoded to determine the action the core must take. Depending on the command contained within the SETUP packet, Endpoint 0 will enter one of three states:

- If the command is a single packet transaction (SET\_ADDRESS, SET\_INTERFACE, etc.) without any data phase, the endpoint will remain in IDLE state.
- If the command has an OUT data phase (SET\_DESCRIPTOR, etc.) the endpoint will enter RX state.
- If the command has an IN data phase (GET\_DESCRIPTOR, etc.) the endpoint will enter TX state.

If the endpoint is in TX state, the interrupt indicates that the core has received an IN token and data from the FIFO has been sent.

The firmware must respond to this either by placing more data in the FIFO if the host is still expecting more data2 or by setting the DataEnd bit to indicate that the data phase is complete. Once the data phase of the transaction has been completed,

Endpoint 0 should be returned to IDLE state to await the next control transaction.

If the endpoint is in RX state, the interrupt indicates that a data packet has been received. The firmware must respond by unloading the received data from the FIFO. The firmware must then determine whether it has received all of the expected data2. If it has, the firmware should set the DataEnd bit and return Endpoint 0 to IDLE state. If more data is expected, the firmware should set the IDX0\_ServicedOutPktRdy bit to indicate that it has read the data in the FIFO and leave the endpoint in RX state.



Figure 52. Endpoint 0 Service Routine

# 15.11.5.1 Idle Mode

IDLE mode is the mode the Endpoint 0 control needs to select at power-on or reset and is the mode to which the Endpoint 0 control should return when the RX and TX modes are terminated.

It is also the mode in which the SETUP phase of control transfer is handled as outlined in Figure 53 below.





## 15.11.5.2 TX Mode

When the endpoint is in TX state, all arriving IN tokens need to be treated as part of a Data phase until the required amount of data has been sent to the host. If either a SETUP or an OUT token is received whilst the endpoint is in the TX state, this will cause a Setup End condition to occur as the core expects only IN tokens.

Three events can cause TX mode to be terminated before the expected amount of data has been sent:

- The host sends an invalid token causing a IDX0\_SetupEnd condition.
- The firmware sends a packet containing less than the maximum packet size for Endpoint 0 (IDX0\_MAX-PAYLOAD).
- The firmware sends an empty data packet.

Until the transaction is terminated, the firmware simply needs to load the FIFO when it receives an interrupt which indicates that a packet has been sent from the FIFO. (An interrupt is generated when InPktRdy is cleared.)

When the firmware forces the termination of a transfer (by sending a short or empty data packet), it should set the DataEnd bit to indicate to the core that the Data phase is complete and that the core should next receive an acknowledge packet.



Figure 54. IN Data Phase for Control Transfer

### 15.11.5.3 RX Mode

In RX mode, all arriving data should be treated as part of a Data phase until the expected amount of data has been received. If either a SETUP or an IN token is received while the endpoint is in RX state, this will cause a Setup End condition to occur as the core expects only OUT tokens.

Three events can cause RX mode to be terminated before the expected amount of data has been received:

- The host sends an invalid token causing an IDX0\_SetupEnd condition
- The host sends a packet which contains less than the maximum packet size for Endpoint 0
- The host sends an empty data packet

Until the transaction is terminated, the firmware simply needs to unload the FIFO when it receives an interrupt which indicates that new data has arrived (IDX1\_OutPktRdy set) and to clear OutPktRdy by setting the IDX0\_ServicedOutPktRdy bit.

When the firmware detects the termination of a transfer (by receiving either the expected amount of data or an empty data packet), it should set the DataEnd bit to indicate to the core that the Data phase is complete and that the core should receive an acknowledge packet next.



Figure 55. OUT Data Phase for Control Transfer

## 15.11.6 Error Handling

A control transfer may be aborted due to a protocol error on the USB, the host prematurely ending the transfer, or if the function controller software wishes to abort the transfer (e.g., because it cannot process the command).

The USB Controller will automatically detect protocol errors and send a STALL packet to the host under the following conditions:

- The host sends more data during the OUT Data phase of a write request than was specified in the command. This condition is detected when the host sends an OUT token after the IDX0\_DataEnd bit has been set.
- The host request more data during the IN Data phase of a read request than was specified in the command. This condition is detected when the host sends an IN token after the DataEnd bit has been set.
- The host sends more than IDX0\_MAXPAYLOAD data bytes in an OUT data packet.
- The host sends a non-zero length DATA1 packet during the STATUS phase of a read request.

When the USB Controller has sent the STALL packet, it sets the IDX0\_SentStall bit and generates an interrupt. When the software receives an Endpoint 0 interrupt with the SentStall bit set, it should abort the current transfer, clear the SentStall bit, and return to the IDLE state.

If the host prematurely ends a transfer by entering the STATUS phase before all the data for the request has been transferred, or by sending a new SETUP packet before completing the current transfer, then the SetupEnd bit will be set and an Endpoint 0 interrupt generated. When the software receives an Endpoint 0 interrupt with the SetupEnd bit set, it should abort the current transfer, set the IDX0\_ServicedSetupEnd bit, and return to the IDLE state. If the OutPktRdy bit is set this indicates that the host has sent another SETUP packet and the software should then process this command.

If the software wants to abort the current transfer, because it cannot process the command or has some other internal error, then it should set the SendStall bit. The USB Controller will then send a STALL packet to the host, set the SentStall bit and generate an Endpoint 0 interrupt.

# 15.12 IN Endpoint Packet Handling

The sizes of the IN FIFOs for Endpoints 1 to 5 are selected through the INFIFOSZ field of the IDX2 register. The maximum size of data packets that may be placed in an IN endpoint's FIFO for transmission is programmable and is determined by the values written to the INDX0\_MAXPAYLOAD field.

The use of single or double packet buffering is part of the specification for the endpoint FIFO – see "Dynamic FIFO Sizing" on page 154.) When double packet buffering is enabled, two data packets can be buffered in the FIFO. When single packet buffering is enabled, only one packet can be buffered even if the packet is less than half the FIFO size.

NOTE

The maximum packet size set for any endpoint must not exceed the FIFO size. Also note that unexpected results may occur if the MAXPAYLOAD field is written to while data is in the FIFO.

## 15.12.1 Single Packet Buffering

If the size of the IN endpoint FIFO is less than twice the maximum packet size for this endpoint (as set in the IDX2\_INFIFOSZ field), only one packet can be buffered in the FIFO and single packet buffering is enabled.

As each packet to be sent is loaded into the IN FIFO, the InPktRdy bit in the IDX0 register needs to be set. If the IDX0\_AUTOSET field is set, the IDX0\_InPktRdy bit will be automatically set when a maximum-sized packet is loaded into the FIFO. For packet sizes less than the maximum and where IDX0\_AUTOSET may not be used (high bandwidth Isochronous/Interrupt transactions), IDX0\_InPktRdy will always have to be set manually (i.e., by the CPU).

When the IDX1\_InPktRdy bit is set, either manually or automatically, the packet is deemed ready to be sent. The FIFONotEmpty bit in IDX0 is also set.

When the packet has been successfully sent, both InPktRdy and FIFONotEmpty are cleared and the appropriate IN endpoint interrupt generated (if enabled). The next packet can then be loaded into the FIFO.

### 15.12.2 Double Packet Buffering

NOTE

Double packet buffering is *disabled* if DPKTBUFDIS field of the IDX0 register is set. The default setting for this bit is enabled. However, for double packet buffering to be enabled, bit 4 of the 5-bit IDX2\_INFIFOSZ field must be set.

The following conditions must exist to enable buffering of two data packets (double packet buffering):

- The size of the IN endpoint FIFO (as set in the IDX2\_INFIFOSZ field) is at least twice the maximum packet size for the endpoint (as set in the IDX1\_MAXPAYLOAD field).
- Bit 4 of the IDX2\_INFIFOSZ field must be set.
- DPKTBUFDIS field of the IDX0 register is cleared.

When the first packet to be received is loaded into the OUT FIFO, the IDX1\_OutPktRdy field is set and the appropriate OUT endpoint interrupt is generated (if enabled) to signal that a packet can now be unloaded from the FIFO.

As each packet to be sent is loaded into the IN FIFO, the IDX1\_InPktRdy bit needs to be set. If the IDX0\_AUTOSET field is set, the InPktRdy bit will be automatically set when a maximum-sized packet is loaded into the FIFO. For packet sizes less than the maximum, InPktRdy will always have to be set manually (i.e., by the CPU).

When the InPktRdy bit is set, either manually or automatically, the packet is deemed ready to be sent. The FIFONotEmpty bit in IDX0 is also set.

After the first packet is loaded, InPktRdy will then immediately be cleared and an interrupt is generated. A second packet can now be loaded into the IN FIFO and InPktRdy set again (either manually or automatically if the packet is the maximum size). Both packets are now ready to be sent.

When the first packet has been successfully sent, InPktRdy is cleared and the appropriate IN endpoint interrupt generated (if enabled) to signal that another packet can now be loaded into the IN FIFO. The state of the FIFONotEmpty bit at this point indicates how many packets may be loaded. If the FIFONotEmpty bit is set then there is another packet in the FIFO and only one more packet can be loaded. If the FIFONotEmpty bit is clear then there are no packets in the FIFO and two more packets can be loaded.

## 15.12.3 High Bandwidth Isochronous Endpoints

In High-speed mode, Isochronous IN endpoints can transmit up to three 'USB' packets in any microframe, with a payload of up to 1024 bytes in each packet, corresponding to a data transfer rate of up to 3072 bytes per microframe.

The USB Controller supports this by allowing the user to load data packets of up to 3072 bytes (i.e., 3 × 1024 bytes) into the associated FIFO in a single transaction. From the point of view of the software in the CPU, the operation is then exactly as described above for Single Packet Buffering or Double Packet Buffering (as appropriate) except that IDX0\_InPktRdy will always need to be set manually (i.e., by the CPU) as IDX0\_AUTOSET does not operate with high-bandwidth Isochronous transfers.

Any data packet loaded into the FIFO that is larger than the maximum payload is automatically split into 'USB' packets of the maximum payload, or smaller, for transmission over the USB. The number of USB packets transmitted per microframe and the maximum payload in each packet is defined through the INDX0 register. The INDX0\_MAXPAYLOAD determine the maximum payload in any USB packet while the PKTSPLITOPTION field determines the maximum number of such packets that can be sent in one microframe (2 or 3). Together, these set the maximum size of packet that can be loaded into the FIFO.

At least one USB packet will always be sent: the number of further USB packets sent in the same microframe will depend on the amount of data loaded into the FIFO. The InPktRdy bit will be cleared and an interrupt generated only when all the packets have been sent.

Each USB packet is sent in response to an IN token. If, at the end of a microframe, the USB Controller has not received enough IN tokens to send all the USB packets (e.g. because one of the IN tokens received was corrupted), the remainder of the data packet will be flushed from the FIFO. The IDX0\_InPktRdy bit will then be cleared and the IncompTx bit in the IDX0 register set to indicate that not all of the data loaded into the FIFO was sent.

#### NOTE

Any second data packet in the FIFO will not be flushed.



Figure 56. High-speed Isochronous IN Endpoint Transmission

# 15.12.4 Optional Special Handling

The packets transferred in Bulk operations are defined by the USB Specification to be either 8, 16, 32, 64 or 512 bytes in size, with the 512 byte option only applying to High Speed transfers. For some system designs, however, it may be more convenient for the application software to write larger amounts of data to an endpoint in a single operation than can be transferred in a single USB operation. A particular case in point is where the same endpoint is used for high-speed transfers of 512 bytes under certain circumstances but for full-speed transfers under other circumstances. When operating at full-speed, the maximum amount of data transferred in a single operation is then just 64 bytes.

To cater to such circumstances, the USB Controller includes a configuration option which, if selected, allows larger data packets to be written to Bulk IN endpoints which are then split into packets of an appropriate (specified) size for transfer across the USB bus. (A similar option exists for reading from Bulk OUT endpoints in larger volumes than individual USB packets.)

Under this option, the IDX0 register for the endpoint is increased to 16 bits and the MAXPAYLOAD field of the register defines the payload for each individual transfer, while the PKTSPLITOPTION field defines a multiplier. The application software can then write data packets of size multiplier × payload to the FIFO which the USB Controller will then split into individual packets of the stated payload for transmission over

the USB. From the application software's point of view, the resulting operation will not differ from the transmission of a single USB packet except in the size of the packet written.

#### NOTE

This feature is only for use with Bulk endpoints and, in accordance with the USB Specification, the payload must be either 8, 16, 32, 64 or 512 bytes with the 512-byte option only applicable for High-Speed transfers. The payload recorded in the INDX0\_MAXPAYLOAD field must also match the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint. The associated FIFO must also be large enough to accommodate the data packet prior to being split.

# 15.13 OUT Endpoint Packet Handling

The sizes of the OUT FIFOs for Endpoints 1 to 5 are determined through the IDX2\_OUTFIFOSZ field. The maximum amount of data received by an OUT endpoint in any frame or microframe (in High-speed mode) is programmable and is determined by the value written to the IDX1\_MAXPAYLOAD field for that endpoint. (maximum payload × number of transactions/microframe (where applicable)).

The use of single or double packet buffering is part of the specification for the endpoint FIFO. When double packet buffering is enabled, two data packets can be buffered in the FIFO: when single packet buffering is enabled, only one packet can be buffered even if the packet is less than half the FIFO size.

| NOTE                                                   |
|--------------------------------------------------------|
| The maximum packet size must not exceed the FIFO size. |

# 15.13.1 Single Packet Buffering

If the size of the OUT endpoint FIFO is less than twice the maximum packet size for this endpoint (as set in the IDX2\_OUTFIFOSZ field), only one data packet can be buffered in the FIFO and single packet buffering is enabled.

When a packet is received and placed in the OUT FIFO, the IDX1\_OutPktRdy bit and the FIFOFULL bit in IDX1 are set and the appropriate OUT endpoint is generated (if enabled) to signal that a packet can now be unloaded from the FIFO.

After the packet has been unloaded, the IDX1\_OutPktRdy field needs to be cleared in order to allow further packets to be received. If the AUTOCLEAR bit in IDX1 is set and a maximum-sized packet is unloaded from the FIFO, the IDX1\_OutPktRdy field is cleared automatically. The FIFOFULL bit is also cleared. For packet sizes less than the maximum, OutPktRdy will always have to be cleared manually (i.e., by the CPU).

| Remainder (MAXPAYLOAD/4)        | Actual Bytes<br>Read | Packet Sizes Which Will Clear OutPktRdy              |
|---------------------------------|----------------------|------------------------------------------------------|
| 0 (i.e., MAXPAYLOAD = 64 bytes) | MAXPAYLOAD           | MAXPAYLOAD, MAXPAYLOAD-1, MAXPAYLOAD-2, MAXPAYLOAD-3 |
| 3 (i.e., MAXPAYLOAD = 63 bytes) | MAXPAYLOAD+1         | MAXPAYLOAD, MAXPAYLOAD-1, MAXPAYLOAD-2               |
| 2 (i.e., MAXPAYLOAD = 62 bytes) | MAXPAYLOAD+2         | MAXPAYLOAD, MAXPAYLOAD-1                             |
| 1 (i.e., MAXPAYLOAD = 61 bytes) | MAXPAYLOAD+3         | MAXPAYLOAD                                           |

Table 87: Unloaded Packet Sizes < MAXPAYLOAD Requiring Manual Clear of OutPktRdy

# 15.13.2 Double Packet Buffering

#### NOTE

Double packet buffering is *disabled* if DPKTBUFDIS field of the IDX0 register is set. The default setting for this bit is enabled. However, for double packet buffering to be enabled, bit 4 of the 5-bit IDX2\_OUTFIFOSZ field must be set.

The following conditions must exist to enable buffering of two data packets (double packet buffering):

- The size of the OUT endpoint FIFO (as set in the IDX2\_OUTFIFOSZ field) is at least twice the maximum packet size for the endpoint (as set in the IDX1\_MAXPAYLOAD field).
- Bit 4 of the IDX2\_OUTFIFOSZ field must be set.
- DPKTBUFDIS field of the IDX1 register is cleared.

When the first packet to be received is loaded into the OUT FIFO, the IDX1\_OutPktRdy field is set and the appropriate OUT endpoint interrupt is generated (if enabled) to signal that a packet can now be unloaded from the FIFO.

NOTE

The FIFOFULL bit in IDX1 is not set at this point: it is only set if a second packet is received and loaded into the OUT FIFO.

After each packet has been unloaded, OutPktRdy needs to be cleared in order to allow further packets to be received. If the AUTOCLEAR bit in IDX1 is set and a maximum-sized packet is unloaded from the FIFO, the OutPktRdy field will be cleared automatically. For packet sizes less than the maximum, OutPktRdy will always have to be cleared manually (i.e., by the CPU).

If the FIFOFULL bit was set to 1 when OutPktRdy is cleared, the USB Controller will first clear the FIFOFULL bit. It will then set OutPktRdy again to indicate that there is another packet waiting in the FIFO to be unloaded.

### 15.13.3 High Bandwidth Isochronous Endpoints

In High-speed mode, Isochronous OUT endpoints can receive up to three 'USB' packets in any microframe, with a payload of up to 1024 bytes in each packet, corresponding to a data transfer rate of up to 3072 bytes per microframe.

The USB Controller supports this by automatically combining all the USB packets received during a microframe into a single packet of up to 3072 bytes (i.e., 3 × 1024 bytes) within the OUT FIFO. From the point of view of the software in the CPU, the operation is then exactly as described above for Single Packet Buffering or Double Packet Buffering (as appropriate) except that OutPktRdy will always need to be cleared manually (i.e., by the CPU) as AUTOCLEAR does not operate with high-bandwidth Isochronous transfers.

The maximum number of USB packets that may be received in any microframe and the maximum payload of these packets are defined through the IDX1 register. The IDX1\_MAXPAYLOAD field determines the maximum payload in any USB packet while the IDX1\_PKTSPLITOPTION determines the maximum number of these packets that may be received in a microframe (2 or 3).

The number of USB packets sent in any microframe will depend on the amount of data to be transferred, and is indicated through the PIDs used for the individual packets. If the indicated number of packets have not been received by the end of a microframe, the IncompRx bit in the IDX1 register will be set to indicate that the data in the FIFO is incomplete. Equally, if a packet of the wrong data type is received, then the PID Error bit is the IDX1 register will be set. In each case an interrupt will, however, still be generated to allow the data that has been received to be read from the FIFO.

### NOTE

The circumstances in which a PID Error or IncompRx is reported depends on the precise sequence of packets received. When the core is operating in Peripheral mode, the details are as follows.



Figure 57. Packets Sent per Microframe

| Number of<br>Packets<br>Expected | Data Packet(s)<br>Received | Response                        | Number of<br>Packets<br>Expected | Data Packet(s)<br>Received | Response      |
|----------------------------------|----------------------------|---------------------------------|----------------------------------|----------------------------|---------------|
|                                  | DATA0 ('D0')               | ОК                              |                                  | D0                         | ОК            |
| 1                                | DATA1 ('D1')               | PID Error set                   |                                  | D1                         | IncompRx set  |
|                                  | DATA2 ('D2')               | PID Error set                   |                                  | D2                         | IncompRx set  |
|                                  | MDATA ('DM')               | PID Error set                   |                                  | DM                         | IncompRx set  |
|                                  | D0                         | ОК                              |                                  | DM D0                      | PID Error set |
|                                  | D1                         | IncompRx set                    |                                  | DM D1                      | ОК            |
|                                  | D2                         | IncompRx set<br>+ PID Error set | 3                                | DM D2                      | IncompRx set  |
| 2                                | DM                         | IncompRx set                    |                                  | DM DM                      | IncompRx set  |
|                                  | DM D0                      | PID Error set<br>OK             |                                  | DM DM D0                   | PID Error set |
|                                  | DM D1                      |                                 |                                  | DM DM D1                   | PID Error set |
|                                  | DM D2                      | PID Error set                   |                                  | DM DM D2                   | ОК            |
|                                  | DM DM                      | PID Error set                   |                                  | DM DM DM                   | PID Error set |

 Table 88: PID Errors and IncompRx Responses in Peripheral Mode

# 15.13.4 Optional Special Handling

The packets transferred in Bulk operations are defined by the USB Specification to be either 8, 16, 32, 64 or 512 bytes in size, with the 512 byte option only applying to High Speed transfers. For some system designs, however, it may be more convenient for the application software to read larger amounts of data from an endpoint in a single operation than can be transferred in a single USB operation. A particular case in point is where the same endpoint is used for high-speed transfers of 512 bytes under certain circumstances but for full-speed transfers under other circumstances. When operating at full-speed, the maximum amount of data transferred in a single operation is then just 64 bytes.

To cater for such circumstances, the USB Controller includes a configuration option which, if selected, causes the USB Controller to combine the packets received across the USB bus into larger data packets prior to being read by the application software. (A similar option exists for writing to Bulk IN endpoints in larger volumes than individual USB packets – see Section 9.4.)

The IDX1 register for the endpoint has the bottom 11 bits of the register defined as the payload, MAXPAYLOAD field, for each individual transfer, while the PKTSPLITOPTION field defines a multiplier. The USB Controller will then combine the appropriate number of the USB packets it receives into a single data packet of size multiplier × payload within the FIFO before asserting OutPktRdy to alert the application software to the presence of a packet to read in the FIFO. From the application software's point of view, the resulting operation will not differ from the receipt of a single USB packet except in the size of the packet read.

This facility is offered as a configuration option rather than as a standard feature because it increases the gate count.

#### NOTE

This feature is only for use with Bulk endpoints and, in accordance with the USB Specification, the payload must be either 8, 16, 32, 64 or 512 bytes with the 512byte option only applicable for High-Speed transfers. The payload recorded in the IDX1\_MAXPAYLOAD field must also match the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint. The associated FIFO must also be large enough to accommodate the amalgamated data packet.

#### NOTE

OutPktRdy is only set when either the specified number of packets have been received or a "short" USB packet is received (i.e., a packet of less than the specified payload for the endpoint). If a protocol is being used whereby the endpoint receives bulk transfers that are a multiple of the recorded payload size with no short packet to terminate it, the IDX1\_MAXPAYLOAD field should not be programmed to expect more packets than there are in the transfer (otherwise the software will not be interrupted at the end of the transfer).

# 15.14 Bulk Transactions

## 15.14.1 Bulk IN Endpoint

A Bulk IN endpoint is used to transfer non-periodic data from the function controller to the host. The following optional features are available for use with a Bulk IN endpoint:

Double packet buffering

The use of single or double packet buffering is part of the specification for the endpoint FIFO) When enabled, up to two packets can be stored in the FIFO awaiting transmission to the host.

AutoSet

When the AutoSet feature is enabled, the IDX0\_InPktRdy bit will be automatically set when a packet of MAXPAYLOAD bytes has been loaded into the FIFO.

Automatic Packet Splitting

For some system designs, it may be convenient for the application software to write larger amounts of data to an endpoint in a single operation than can be transferred in a single USB operation. A particular case in point is where the same endpoint is used for high-speed transfers of 512 bytes under certain circumstances but for full-speed transfers under other circumstances. When operating at full-speed, the maximum amount of data transferred in a single operation is then just 64 bytes. To support such circumstances, the USB Controller includes a configuration option which, if selected, allows larger data packets to be written to Bulk endpoints which are then split into packets of an appropriate (specified) size for transfer across the USB bus. The necessary packet size information is set via the IDX0\_MAXPAYLOAD field.

### 15.14.1.1 Setup

Before using a Bulk IN endpoint the INDX0\_MAXPAYLOAD field must be written with the maximum packet size (in bytes) for the endpoint. This value should be the same as the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint. In addition, the relevant interrupt enable bit in the CFG2 register should be set to 1 (if an interrupt is required for this endpoint), and the relevant fields in the IDX0 register should be set as shown in Table 89.

| Field      | Setting | Description                                                                                         |
|------------|---------|-----------------------------------------------------------------------------------------------------|
| AutoSet    | 0/1     | Set to 1 if the AutoSet feature is required.                                                        |
| ISO        | 0       | Set to 0 to enable Bulk protocol.                                                                   |
| Mode       | 1       | Set to 1 to ensure the FIFO is enabled (only necessary if the FIFO is shared with an OUT endpoint). |
| FrcDataTog | 0       | Set to 0 to allow normal data toggle operation.                                                     |
| DPktBufDis | 0       | Set to 0 to enable Double Packet Buffering.<br>Set to 1 to disable double packet buffering.         |

When a Bulk IN endpoint is first configured, following a SET\_CONFIGURATION or SET\_INTERFACE command on Endpoint 0, then IDX0\_CIrDataTog bit should be set. This will ensure that the data toggle (which is handled automatically by the USB Controller) starts in the correct state. Also if there are any data packets in the FIFO (indicated by the IDX0\_FIFONotEmpty bit being set), they should be flushed by setting

the IDX0\_FlushFIFO bit. It may be necessary to set this bit twice in succession if double buffering is enabled.

### 15.14.1.2 Operation

When data is to be transferred over a Bulk IN pipe, a data packet is loaded into the FIFO and the IDX0 register written to set the InPktRdy bit. When the packet has been sent, the InPktRdy bit will be cleared by the USB Controller and an interrupt generated so that the next packet can be loaded into the FIFO. If double packet buffering is enabled (i.e., if the size of the FIFO is at least twice the maximum packet size set in the IDX0 register), then after the first packet has been loaded and the InPktRdy bit set, InPktRdy will be immediately cleared by the USB Controller and an interrupt generated so that a second packet can be loaded into the FIFO. This means the software can operate the same way, loading a packet when it receives an interrupt, regardless of whether double packet buffering is enabled or not.

In general, the packet size must not exceed the payload specified in the IDX0\_MAXPAYLOAD field. This defines the maximum packet size for a single transfer over the USB and, for bulk transfers, is required by the USB Specification to be either 8, 16, 32, 64 (Full-Speed or High-Speed) or 512 bytes (High-Speed only). If more than this amount of data is to be transferred, this needs to be sent as multiple USB packets which should all carry the full payload, except for the last packet which holds the residue.

The exception to this rule applies where the automatic Bulk packet splitting option has been selected when the core was configured. Where this option has been selected, packets up to 32 times IDX0\_MAXPAYLOAD can be written to the FIFO (assuming that the FIFO is big enough to accept these larger packets) which are then split by the core into packets of the appropriate size for transfer over the USB. The size of the packets written to the FIFO is given by m × payload where IDX0\_PKTSPLITOPTION = m - 1. All the application software needs to do to take advantage of this feature is to set the appropriate values in the IDX0\_MAXPAYLOAD field (and ensure that the value written to it matches the value given in the wMaxPacketSize field of the Standard Endpoint Descriptor for the associated endpoint). As far as the application software is concerned, the process of transferring these larger packets is no different from that used to transfer a standard-sized Bulk packet.

The host may determine that all the data for a transfer has been sent by knowing the total size of the data block. Alternatively, it may infer that all the data have been sent when it receives a packet which is less than the MAXPAYLOAD in size. In the latter case, if the total size of the data block is an exact multiple of the payload, it will be necessary for the function to send a null packet after all the data has been sent. This is done by setting InPktRdy when the next interrupt is received, without loading any data into the FIFO.

#### 15.14.1.3 Error Handling

If the software wants to shut down the Bulk IN pipe, it should set the IDX0\_SendStall bit. When the USB Controller receives the next IN token, it will send a STALL to the host, set the IDX0\_SentStall bit and generate an interrupt. When the software receives an interrupt with the SentStall bit set, it should clear the SentStall bit. It should leave the SendStall bit set until it is ready to re-enable the Bulk IN pipe.

#### NOTE

If the host failed to receive the STALL packet for some reason, it will send another IN token, so it is advisable to leave the SendStall bit set until the software is ready to re-enable the Bulk IN pipe. When a pipe is re-enabled, the data toggle sequence should be restarted by setting the IDX0\_CIrDataTog bit.

#### 15.14.2 Bulk OUT Endpoint

A Bulk OUT endpoint is used to transfer non-periodic data from the host to the function controller.

The following optional features are available for use with a Bulk OUT endpoint:

### Double packet buffering

The use of single or double packet buffering is part of the specification for the endpoint FIFO). When enabled, up to two packets can be stored in the FIFO.

AutoClear

When the AutoClear feature is enabled, the IDX1\_OutPktRdy bit will be automatically cleared when a packet of MAXPAYLOAD bytes has been unloaded from the FIFO.

Automatic Packet Combining

For some system designs, it may be convenient for the application software to read larger amounts of data from an endpoint in a single operation than can be transferred in a single USB operation. A particular case in point is where the same endpoint is used for high-speed transfers of 512 bytes under certain circumstances but for full-speed transfers under other circumstances. When operating at full-speed, the maximum amount of data transferred in a single operation is then just 64 bytes. To support such circumstances, the USB Controller includes a configuration option which, if selected, causes the USB Controller to combine the packets received across the USB bus into larger data packets prior to being read by the application software. The necessary packet size information is set via the PKTSPLITOPTION and MAXPAYLOAD fields.

### 15.14.2.1 Setup

Before using a Bulk OUT endpoint, the IDX0\_MAXPAYLOAD field must be written with the maximum packet size (in bytes) for the endpoint. This value should be the same as the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint. In addition, the relevant interrupt enable bit in the IntrOutE register should be set to 1 (if an interrupt is required for this endpoint) and CFG2 register should be set as shown in Table 90. Bit D0 is unused/Read-only.

| Field      | Setting | Description                                                                                 |
|------------|---------|---------------------------------------------------------------------------------------------|
| AutoClear  | 0/1     | Set to 1 if the AutoClear feature is required.                                              |
| ISO        | 0       | Set to 0 to enable Bulk protocol.                                                           |
| DisNye     | 0       | Set to 0 to allow normal PING flow control.                                                 |
| DPktBufDis | 0       | Set to 0 to enable double packet buffering.<br>Set to 1 to disable double packet buffering. |

Table 90: IDX0 Register Field Settings for Bulk OUT Endpoint

When a Bulk OUT endpoint is first configured, following a SET\_CONFIGURATION or SET\_INTERFACE command on Endpoint 0, the IDX1 register should be written to set the ClrDataTog bit. This will ensure that the data toggle (which is handled automatically by the USB Controller) starts in the correct state. Also if there are any data packets in the FIFO (indicated by the OutPktRdy bit being set), they should be flushed by setting the FlushFIFO bit. It may be necessary to set this bit twice in succession if double buffering is enabled.

### 15.14.2.2 Operation

When a data packet is received by a Bulk OUT endpoint, the IDX1\_OutPktRdy bit is set and an interrupt is generated. The software should read the IDX2\_ENDPTOUTCOUNT field for the endpoint to determine the size of the data packet. The data packet should be read from the FIFO, then the OutPktRdy bit should be cleared.

The packets received should not exceed the size specified in the IDX1\_MAXPAYLOAD field (because this should match the value set in the wMaxPacketSize field of the endpoint descriptor sent to the host). When a block of data larger than wMaxPacketSize needs to be sent to the function, it will be sent as multiple packets. All the packets will be wMaxPacketSize in size, except the last packet which will contain the remainder. The software may use an application specific method of determining the total size of the block and hence when the last packet has been received. Alternatively, it may infer that the entire block has been received when it receives a packet which is less than wMaxPacketSize in size. (If the total size of the data block is a multiple of wMaxPacketSize, a null data packet will be sent after the data to signify that the transfer is complete.)

1. In general, the application software will need to read each packet from the FIFO individually. The exception to this rule applies where the option for automatic combining of Bulk packets has been selected when the core was configured. Where this option has been selected, the core can receive up to 32 packets at a time and combine them into a single packet within the FIFO (assuming that the FIFO is big enough to accept these larger packets). The size of the packets written to the FIFO is given by m × wMaxPacketSize where IDX1\_PKTSPLITOPTION = m – 1. All the application software needs to do to take advantage of this feature is set the appropriate values in the IDX1\_MAXPAYLOAD field (and ensure that the value written to it matches the value given in the wMaxPacketSize field of the endpoint descriptor). As far as the application software is concerned, the process of transferring these larger packets is no different from that used to transfer a standard-sized Bulk packet.

## 15.14.2.3 Error Handling

If the software wants to shut down the Bulk OUT pipe, it should set the SendStall bit. When the USB Controller receives the next packet it will send a STALL to the host, set the SentStall bit and generate an interrupt.

When the software receives an interrupt with the SentStall bit set, it should clear the SentStall bit. It should leave the SendStall bit set until it is ready to re-enable the Bulk OUT pipe.

NOTE If the host failed to receive the STALL packet for some reason, it will send another packet, so it is advisable to leave the SendStall bit set until the software is ready to re-enable the Bulk OUT pipe. When a Bulk OUT pipe is re-enabled, the data toggle sequence should be restarted by setting the IDX1\_ClrDataTog bit.

# 15.14.3 Interrupt Transactions

### 15.14.3.1 Interrupt IN Endpoint

An Interrupt IN endpoint is used to transfer periodic data from the function controller to the host.

An Interrupt IN endpoint uses the same protocol as a Bulk IN endpoint and can be used the same way. Interrupt IN endpoints also support one feature that Bulk IN endpoints do not, in that they support continuous toggle of the data toggle bit. This feature is enabled by setting the FrcDataTog bit in the IDX0 register. When this bit is set to 1, the USB Controller will consider the packet as having been successfully sent and toggle the data bit for the endpoint, regardless of whether an ACK was received from the host.

### 15.14.3.2 Interrupt OUT Endpoint

An Interrupt OUT endpoint is used to transfer periodic data from the host to a function controller.

An Interrupt OUT endpoint uses almost the same protocol as a Bulk OUT endpoint and can be used the same way. The one difference is that Interrupt endpoints do not support PING flow control. This means that the USB Controller should never respond with a NYET handshake, only ACK/NAK/STALL. To ensure this,

the DisNye bit in the IDX1 register should be set to 1 to disable the transmission of NYET handshakes in High-speed mode.
## 15.15 Isochronous Transactions

#### 15.15.1 Isochronous IN Endpoint

An Isochronous IN endpoint is used to transfer periodic data from the function controller to the host. This section describes the use of full-speed Isochronous IN endpoints and low bandwidth (1 packet per microframe) high-speed Isochronous IN endpoints. High bandwidth high-speed (> 8 Mbps) endpoints are described in a later section.

The following optional features are available for use with an Isochronous IN endpoint:

Double packet buffering

The use of single or double packet buffering is part of the specification for the endpoint FIFO.) When enabled, up to two packets can be stored in the FIFO awaiting transmission to the host. Double packet buffering is generally advisable for Isochronous IN endpoints in order to avoid data underrun (see 'Operation' below).

AutoSet

When the AutoSet feature is enabled, the InPktRdy bit will be automatically set when a packet of MAXPAYLOAD bytes has been loaded into the FIFO. However, this feature is not particularly useful with Isochronous endpoints because the packets transferred often are not maximum packet size and the InCSRL register needs to be accessed following every packet to check for Underrun errors.

#### 15.15.1.1 Setup

Before using an Isochronous IN endpoint, the MAXPAYLOAD field must be written with the maximum packet size (in bytes) for the endpoint. This value should be the same as the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint. In addition, the relevant interrupt enable bit in the CFG2 register should be set to 1 (if an interrupt is required for this endpoint) and the high byte of the IDX0 register should be set as shown in Table 91. (Bit D0 is unused):

| Field      | Setting | Description                                                                                         |
|------------|---------|-----------------------------------------------------------------------------------------------------|
| AutoSet    | 0/1     | Set to 1 if the AutoSet feature is required.                                                        |
| ISO        | 1       | Set to 1 to enable Isochronous protocol.                                                            |
| Mode       | 1       | Set to 1 to ensure the FIFO is enabled (only necessary if the FIFO is shared with an OUT endpoint). |
| FrcDataTog | 0       | Ignored in Isochronous mode.                                                                        |
| DPktBufDis | 0       | Set to 0 to enable double packet buffering.<br>Set to 1 to disable double packet buffering.         |

Table 91: IDX0 Register Field Settings for Isochronous IN Endpoint

#### 15.15.1.2 Operation

An Isochronous endpoint does not support data retries, so if data underrun is to be avoided, the data to be sent to the host must be loaded into the FIFO before the IN token is received. The host will send one IN token per frame (or microframe in Highspeed mode), however the timing within the frame (or microframe) can vary. If an IN token is received near the end of one frame and then at the start of the next frame, there will be little time to reload the FIFO. For this reason, double buffering is usually required for an Isochronous IN endpoint.

The AutoSet feature can be used with an Isochronous IN endpoint, in the same way as for a Bulk IN endpoint. However, unless the data arrives from the source at an absolutely consistent rate, synchronized to the host's frame clock, the size of the packets sent to the host will have to increase or decrease from frame to frame (or from microframe to microframe) to match the source data rate. This means that the actual packet sizes will not always be MAXPAYLOAD in size, rendering the AutoSet feature useless.

An interrupt is generated whenever a packet is sent to the host and the software may use this interrupt to load the next packet into the FIFO and set the InPktRdy bit in the IDX0 register in the same way as for a Bulk IN endpoint. As the interrupt could occur almost any time within a frame(/microframe), depending on when the host has scheduled the transaction, this may result in irregular timing of FIFO load requests. If the data source for the endpoint is coming from some external hardware, it may be more convenient to wait until the end of each frame (or microframe) before loading the FIFO as this will minimize the requirement for additional buffering. This can be done by using the SOF interrupt to trigger the loading of the next data packet. The CFG2\_SOF bit is set once per frame(/microframe) when a SOF packet is received. The interrupts may still be used to set the InPktRdy bit in IDX0 and to check for data overruns/underruns (see 'Error Handling' below).

Starting up a double-buffered Isochronous IN pipe can be a source of problems. Double buffering requires that a data packet is not transmitted until the frame(/microframe) after it is loaded. There is no problem if the function loads the first data packet at least one frame(/microframe) before the host sets up the pipe (and therefore starts sending IN tokens). But if the host has already started sending IN tokens by the time the first packet is loaded, the packet may be transmitted in the same frame(/microframe) as it is loaded, depending on whether it is loaded before, or after, the IN token is received. This potential problem can be avoided by setting the ISO Update bit in the CFG2 register. When this bit is set to 1, any data packet loaded into an Isochronous IN endpoint FIFO will not be transmitted until after the next SOF packet has been received, thereby ensuring that the data packet is not sent too early.

#### 15.15.1.3 Error Handling

If the endpoint has no data in its FIFO when an IN token is received, it will send a null data packet to the host and set the UnderRun bit in the IDX0 register. This is an indication that the software is not supplying data fast enough for the host. It is up to the application to determine how this error condition is handled.

If the software is loading one packet per frame(/microframe) and it finds that the InPktRdy bit in the IDX0 register is set when it wants to load the next packet, this indicates that a data packet has not been sent (perhaps because an IN token from the host was corrupted). It is up to the application how it handles this condition: it may choose to flush the unsent packet by setting the FlushFIFO bit in the IDX0 register, or it may choose to skip the current packet.

#### 15.15.2 Isochronous OUT Endpoint

An Isochronous OUT endpoint is used to transfer periodic data from the host to the function controller. This section describes the use of Full-speed Isochronous OUT endpoints and low bandwidth (1 packet per microframe) High-speed Isochronous OUT endpoints. High bandwidth High-speed (> 8 Mbps) endpoints are described in a later section.

The following optional features are available for use with an Isochronous OUT endpoint:

Double packet buffering

The use of single or double packet buffering is part of the specification for the endpoint FIFO.) When enabled, up to two packets can be stored in the FIFO awaiting transmission to the host.

NOTE

Double packet buffering is generally advisable for Isochronous OUT endpoints in order to avoid data overrun (see 'Operation' below).

#### AutoClear

When the AutoClear feature is enabled, the IDX1\_OutPktRdy bit will be automatically cleared when a packet of MAXPAYLOAD bytes has been unloaded from the FIFO. However, this feature is not particularly useful with Isochronous endpoints because the packets transferred often are not maximum packet size and the IDX1 register needs to be accessed following every packet to check for Overrun or CRC errors.

#### 15.15.2.1 Setup

Before using an Isochronous OUT endpoint, the MAXPAYLOAD field must be written with the maximum packet size (in bytes) for the endpoint. This value should be the same as the wMaxPacketSize field of the Standard Endpoint Descriptor for the endpoint.

In addition, the relevant interrupt enable bit in the CFG2 register should be set to 1 (if an interrupt is required for this endpoint) and IDX1 register should be set as shown in Table 92.

| Field      | Setting | Description                                                                                 |  |
|------------|---------|---------------------------------------------------------------------------------------------|--|
| AutoClear  | 0/1     | Set to 1 if the AutoClear feature is required.                                              |  |
| ISO        | 0       | to 1 to enable Isochronous protocol.                                                        |  |
| DisNye     | 0       | Ignored in Isochronous mode.                                                                |  |
| DPktBufDis | 0       | Set to 0 to enable double packet buffering.<br>Set to 1 to disable double packet buffering. |  |

Table 92: IDX0 Register Field Settings for Isochronous OUT Endpoint

#### 15.15.2.2 Operation

An Isochronous endpoint does not support data retries so if a data overrun is to be avoided, there must be space in the FIFO to accept a packet when it is received. The host will send one packet per frame (or microframe in High-speed mode), however the time within the frame can vary. If a packet is received near the end of one frame(/microframe) and another arrives at the start of the next frame, there will be little time to unload the FIFO. For this reason, double buffering is usually required for an Isochronous OUT endpoint.

The AutoClear feature can be used with an Isochronous OUT endpoint, in the same way as for a Bulk OUT endpoint. However, unless the data sink receives data at an absolutely consistent rate and is synchronized to the host's frame clock, the size of the packets sent from the host will have to increase or decrease from frame to frame (or from microframe to microframe) to match the required data rate. This means that the actual packet sizes will not always be MAXPAYLOAD in size, rendering the AutoClear feature useless.

An interrupt is generated whenever a packet is received from the host and the software may use this interrupt to unload the packet from the FIFO and clear the OutPktRdy bit in the IDX1 register in the same way as for a Bulk OUT endpoint.

As the interrupt could occur almost any time within a frame(/microframe), depending on when the host has scheduled the transaction, the timing of FIFO unload requests will probably be irregular. If the data sink for the endpoint is going to some external hardware, it may be better to minimize the requirement for additional buffering by waiting until the end of each frame(/microframe) before unloading the FIFO. This can be done by using the SOF interrupt to trigger the unloading of the data packet. Other interrupts may still be used to clear the OutPktRdy bit in the IDX1 register and to check for data overruns/underruns (see 'Error Handling' below).

#### 15.15.2.3 Error Handling

If there is no space in the FIFO to store a packet when it is received from the host, the OverRun bit in the IDX1 register will be set. This is an indication that the software is not unloading data fast enough for the host. It is up to the application to determine how this error condition is handled.

If the USB Controller finds that a received packet has a CRC error, it will still store the packet in the FIFO and set the IDX1\_OutPktRdy bit and the IDX1\_DataError bit. It is left up to the application how this error condition is handled.

#### 15.15.2.4 High-speed High-bandwidth IN Endpoint

Full-speed Isochronous endpoints and low bandwidth High-speed Isochronous endpoints transfer a single packet per frame (/microframe). In High-speed mode, this allows a maximum data transfer rate of 8 MB/ sec (64 Mbps), as the maximum payload for any packet on the USB is 1024 bytes. However, if the endpoint is defined to be high bandwidth, it can support data transfer rates up to 24 MB/sec (192 Mbps) by performing up to three such transactions per microframe.

High bandwidth lsochronous endpoints use PID sequencing to ensure that the host and function know how many packets are being transferred and to detect lost packets. The USB Controller handles the PID sequencing and the splitting of data into 'USB' packets for transmission over the USB automatically. To the software, it should seem as though a single packet of up to 3072 bytes (3 × 1024 bytes) is being transferred.

When setting up a high bandwidth Isochronous IN endpoint, the IDX0\_MAXPAYLOAD field should be written with the same value that was used in the wMaxPacketSize field of the endpoint descriptor. The lower 11 bits specify the maximum payload of a single transaction over the USB, while bits 11 and 12 specify the maximum number of such transactions per microframe.

The data to be sent in a microframe is loaded into the FIFO as a single packet and the IDX0\_InPktRdy bit is set in the same way as for a low bandwidth Isochronous IN endpoint. If the amount of data loaded exceeds the maximum data payload specified in the lower 11 bits of MAXPAYLOAD, the USB Controller will automatically split the data into two or three 'USB' packets for transmission.

The only difference the software should see between a high bandwidth and a low bandwidth Isochronous IN endpoint is an additional error condition that can occur with a high bandwidth endpoint. If the USB Controller has split the data to be sent into two or three 'USB' packets but has not received sufficient IN tokens from the host to send all the packets, the remainder of the data packet will be flushed from the FIFO at the end of the microframe and the InPktRdy bit will be cleared. In addition, the IDX0\_IncompTx bit will be set to indicate to the software that not all the data was sent. Any second data packet in the FIFO will not be flushed.



Figure 58. High-Speed High-Bandwidth IN Endpoint

#### 15.15.3 High-speed High-bandwidth OUT Endpoint

Full-speed Isochronous endpoints and low bandwidth High-speed Isochronous endpoints transfer a single packet per frame (/microframe). In High-speed mode, this allows a maximum data transfer rate of 8 MB/ sec (64 Mbps), as the maximum payload for any packet on the USB is 1024 bytes. However, if the endpoint is defined to be high bandwidth, it can support data transfer rates up to 24 MB/sec (192 Mbps) by performing up to three such transactions per microframe.

High bandwidth lsochronous endpoints use PID sequencing to ensure that the host and function know how many packets are being transferred and to detect lost packets. The USB Controller handles the PID sequencing and the combining of data from two or three USB packets automatically. To the software, it should seem as though a single packet, of up to 3072 bytes (3 × 1024 bytes), has been received.

When setting up a high bandwidth Isochronous OUT endpoint, the IDX1 register should be written with the same value that was used in the wMaxPacketSize field of the endpoint descriptor. The lower 11 bits specify the maximum payload of a single transaction over the USB, while bits 11 and 12 specify the maximum number of such transactions per microframe.

The USB Controller will combine the data from each of the packets received during the microframe and place the combined data packet into the FIFO. The IDX1\_OutPktRdy bit will be set when all the data packets for that microframe have been received. The software can then unload the data from the FIFO in the same way as for a low bandwidth Isochronous OUT endpoint.



Figure 59. High-speed High-bandwidth OUT Endpoint

The number of USB packets sent in any microframe will depend on the amount of data to be transferred, and is indicated through the PIDs used for the individual packets. If the indicated number of packets have not been received by the end of a microframe, the IDX1\_IncompRx bit will be set to indicate that the data in the FIFO is incomplete. Equally, if a packet of the wrong data type is received, then the IDX1\_DisNye bit will be set. In each case, an interrupt will, however, still be generated to allow the data that has been received to be read from the FIFO.

The circumstances in which a PID Error (indicated in IDX1\_DisNye field) or IncompRx is reported depends on the precise sequence of packets received. When the core is operating in Peripheral mode, the details are as follows.

| No. pkts<br>expected | Data pkt(s) received | Response                        | No. pkts<br>expected | Data pkt(s) received | Response      |
|----------------------|----------------------|---------------------------------|----------------------|----------------------|---------------|
|                      | DATA0 ('D0')         | ОК                              |                      | D0                   | ОК            |
| 1                    | DATA1 ('D1')         | PID Error set                   |                      | D1                   | IncompRx set  |
|                      | DATA2 ('D2')         | PID Error set                   |                      | D2                   | IncompRx set  |
|                      | MDATA ('DM')         | PID Error set                   |                      | DM                   | IncompRx set  |
|                      | D0                   | ОК                              |                      | DM D0                | PID Error set |
|                      | D1                   | IncompRx set                    |                      | DM D1                | ОК            |
|                      | D2                   | IncompRx set +<br>PID Error set |                      | DM D2                | IncompRx set  |
|                      | DM                   | IncompRx set                    | 3                    | DM DM                | IncompRx set  |
| 2                    | DM D0                | PID Error set                   |                      | DM DM D0             | PID Error set |
|                      | DM D1                | ок                              |                      | DM DM D1             | PID Error set |
|                      | DM D2                | PID Error set                   |                      | DM DM D2             | ок            |
|                      | DM DM                | PID Error set                   |                      | DM DM DM             | PID Error set |

Table 93: PID Error or IncompRx Reporting in Peripheral Mode

## **15.16 Transaction Flows**

In the following transaction flows, host actions are shown against a white background, and the USB Controller actions are shown shaded.

#### 15.16.1 Control Transactions

#### 15.16.1.1 Setup Phase





#### 15.16.1.2 IN Data Phase



Figure 61. IN Data Phase Transaction



Figure 62. Status Phase Following IN Data Phase Transaction

#### 15.16.1.3 OUT Data Phase



Figure 63. OUT Data Phase Transaction



#### Figure 64. Status Phase Following OUT Data Phase Transaction

#### 15.16.2 Bulk/Interrupt Transactions

#### 15.16.2.1 IN Transactions



Figure 65. Bulk/Interrupt IN Transaction

#### 15.16.2.2 OUT Transaction



Figure 66. Bulk OUT Transaction

## 15.16.3 Full-speed Low-bandwidth Isochronous Transaction

## 15.16.3.1 IN Transaction



Figure 67. Full-speed Isochronous IN Transaction

#### 15.16.3.2 OUT Transaction



Figure 68. Full-speed Isochronous OUT Transaction

#### 15.16.4 High-bandwidth Isochronous Transactions

### 15.16.4.1 IN Transaction



Figure 69. High-bandwidth Isochronous IN Transaction

#### 15.16.4.2 OUT Transaction



Figure 70. High-bandwidth Isochronous OUT Transaction

#### 15.17 Test Modes

The USB Controller supports the four USB 2.0 test modes defined for High-speed functions. The test modes are entered by writing to the TestMode register (address 0Fh). A test mode is usually requested by the host sending a SET\_FEATURE request to Endpoint 0. When the software receives the request, it should wait until the Endpoint 0 transfer has completed (when it receives the Endpoint 0 interrupt indicating that the status phase has completed) then write to the TestMode register.

NOTE These test modes have no purpose in normal operation.

#### 15.17.1 Test Mode Test\_SE0\_NAK

To enter the Test\_SE0\_NAK test mode, the software should set the Test\_SE0\_NAK bit by writing 6'h01 to the TestMode register. The USB Controller will then go into a mode in which it responds to any valid IN token with a NAK.

#### 15.17.2 Test Mode Test Test\_J

To enter the Test\_J test mode, the software should set the Test\_J bit by writing 6'h02 to the TestMode register. The USB Controller will then go into a mode in which it transmits a continuous J on the bus.

#### 15.17.3 Test Mode Test\_K

To enter the Test\_K test mode, the software should set the Test\_K bit by writing 6'h04 to the TestMode register. The USB Controller will then go into a mode in which it transmits a continuous K on the bus.

#### 15.17.4 Test Mode Test\_Packet

To execute the Test\_Packet test, the software should first write the standard test packet (shown below) to the Endpoint 0 FIFO and set the InPktRdy bit in the CSR0 register (D1). It should then write 6'h08 to the TestMode register to enter Test\_Packet test mode.

The 53 byte test packet to load is as follows (all bytes in hex). The test packet only has to be loaded once, the USB Controller will keep re-sending the test packet without any further intervention from the software.

00 00 00 00 00 00 00 00

AA AA AA AA AA AA AA

AA EE EE EE EE EE EE EE

EE FE FF FF FF FF FF FF

FF FF FF FF FF 7F BF DF

EF F7 FB FD FC 7E BF DF

EF F7 FB FD 7E

This data sequence is defined in Universal Serial Bus Specification Revision 2.0, Section 7.1.20. The USB Controller will add the DATA0 PID to the head of the data sequence and the CRC to the end.

## **16. Secure Digital Input Output (SDIO)**

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.





## 16.1 Functional Description

The SD/SDIO Host Controller, or referred here as just SD Host Controller, interfaces to the system bus and has the components and functionality described in the following sub-sections.

## 16.1.1 Host Controller Register Set

The Host Controller register set implements the registers defined by the SD Host Controller Specification (Version 3.00). The registers are byte/DWORD accessible from the Target Interface.

The Host Controller register set also implements the Data Port Registers for the PIO Mode transfers. The register set provides the control signals to rest of the blocks and monitors the status signals from the blocks to set interrupt status bits and eventually generate an interrupt signal to the host bus. The Host Controller register set acts as the bridge between host CPU and Host Controller. The SD/SDIO controller registers are programmed by the host processor through the host (AHB) target interface. Interrupts are generated to the host processor based on the values set in the interrupt status register and interrupt enable registers.

#### 16.1.2 PIO/DMA Controller

The PIO/DMA Controller module implements the SDMA and ADMA2 engines as defined in the SD Host Controller Specification and maintains the block transfer counts for PIO operation. It interacts with the registers set and starts the DMA engine when a command with data transfer is involved. The DMA Controller interfaces to the host (AHB) master module to generate transfers. On the other side it interfaces with the block buffer to store/fetch block data. The DMA Controller implements a separate DMA for SDMA operation and separate DMA for the ADMA2 operation. In addition it implements the Host Transaction Generator that generates controls for the Host Master Interface module.

#### 16.1.3 Block Buffer

The Host Controller uses a dual-port block buffer (read/write on both ports) that is used to store the block data during SD transfers. The size of the block buffer is 2 kB. This allows support for a 512-byte block size for SD/eMMC and up to a 1 kB block size for SDIO implementations. The block buffer uses circular buffer architecture. One side of the block buffer is interfaced with the DMA controller and operates on the host clock. The other side of the block buffer interfaces with SD control logic and operates on the SD clock. During a write transaction (data transferred from the SoC to the SD3.0 / SDIO3.0 / eMMC4.51 card), the data is fetched from the system memory and is stored in the block buffer. When a block of data is available, the SD control logic will transfer it onto the SD interface. The DMA controller continues to fetch additional blocks of data when the block buffer has space. During a read transaction (data transferred from SD3.0 / SDIO3.0 / eMMC4.51 card to the host controller), the data from the SD3.0 / SDIO3.0 / eMMC4.51 card will be written to the block buffer and at the end, when the CRC of the block is valid, the data is committed. When a block of data is available, then the DMA controller transfers this data to the system memory. The SD control logic meanwhile receives the next block of data provided there is space in the block buffer. If the Host Controller cannot accept any data from the SD3.0 / SDIO3.0 / eMMC4.51 card, then it will issue read wait (if the card supports a read wait mechanism) to stop the data transfer from the card or by stopping the clock.

NOTE

FIFO depth can be varied using parameter passed to the core using the 'dot parameter instantiation'.

NOTE

When the Block Buffer size is twice that of the block size, the block buffer behaves like a ping-pong buffer.

#### 16.1.4 SD Clock Generation

The SD Clock Generator module generates the SD clock from the reference clock (xin\_clk), based on the controls programmed in the Clock Control Register. These include the Clock Divide Value, SD Clock Enable etc. The outputs from this module are the SD\_CLK and the SD\_CARD Clock. The SD\_CLK is used by the SD control logic and the SD\_CARD clock connected to the "CLK" pin on the SD interface. This module also generates system resets to various clock domains.

#### 16.1.5 SD Card Detect

The SD Card detect logic monitors the SD\_CD# pin for card insertion/removal events. It implements debouncing logic to filter the false transitions on the SD\_CD# pin. Card insertion and removal events are reported in the SD host register set from which the interrupt is eventually generated.

#### 16.1.6 SD Timeout Control

The SD timeout control logic implements a timeout check between block transfers. It uses the contents of the Timeout Control Register to implement a timeout between blocks. This module operates under the control of the Transmit Control and Receive Control modules (based on direction). When a timeout is detected, the event is reported to Transmit Control or Receive Control module.

#### 16.1.7 SD Command Control

The SD Command Control module generates a command sequence on the CMD line of the SD interface for every new command programmed by the software. The Command Control module also implements the

response reception and checks the validity of the response. It uses the response type field to determine the length of the response and the presence of the CRC7 field. The response is received on the receive clock, which is either the looped back clock or the tuned clock. Once the response is received, the contents of the response (start bit, command Index, CRC7 and end bit) are verified and the response status is captured in various status bits in the module's register set. It also implements a timeout check upon response reception to make sure that the response is received within the specified time (5 or 64 clocks based on command type). The received response is then stored in appropriate fields in the Response Register. The SD Command Control module generates controls in the SD Transmit Control module and SD Receive Control module based on transfer direction. The SD Command Control module also generates an auto command (AutoCMD12 or AutoCMD23) when enabled.

#### 16.1.8 SD Transmit Control

The SD Transmit Control module is used for write transfers when transferring data to the card. Once the command is issued, this module waits for the block of data to be available in the block buffer and transfers this on to the SD's DATn line(s). Based on the configuration of the data lines (1-bit, 4-bits or 8-bits), the data from the block buffer is appropriately routed. The CRC16 is individually calculated on a per-lane basis and is attached at the end of the block transfer before the END bit. In the case of DDR operation, it implements separate CRC16 for each edge of the clock. At the end of block transfer, it waits for the CRC response on the DAT0 line and the result of the CRC check is stored in the CRC status register field.

This module also checks for a Write Busy indication (DAT0 Line) before transferring the next block of data. A timeout check is implemented to make sure that the Write Busy is asserted before the specified time limit.

#### 16.1.9 SD Receive Control

The SD Receive Control module is used for read transfers when receiving data from the card. Once the command is issued, this module waits for the block of data to be received from the card. Based on the configuration of the data lines (1-bit, 4-bits or 8-bits), the data from the SD interface is assembled into byte, and eventually in 32-bit word, format before it is written to the block buffer. The CRC16 is individually calculated on a per-lane basis and is compared to the received CRC16 at the end of the block transfer before the END Bit. In the case of DDR operation, it implements separate CRC16 checks for each edge of the clock.

The data is received on the receive clock. This receive clock is either the looped-back clock (SDCARD\_CLK from the IO\_BUF) or the tuned clock using DLL or DLY elements. A timeout check is performed to make sure that the gap between received blocks does not exceed the specified time limit.

#### 16.1.10 SD Tuning Block

The SD tuning block is used for SDR104 or SDR50 (optionally when enabled) and eMMC Hs200 modes to tune the receive clock. The tuning block generates the delay controls in the external Delay Controller module. The tuning modules receive the 64-byte tuning block (SD mode) or 128-byte tuning block (eMMC mode) and maintains a tuning vector to determine the optimal delay. The tuning block can be configured with a number of delay taps (maximum of 32). Using this the tuning block performs the tuning and selects the optimal tap point for the receive clock.

#### 16.1.11 SD Intf Control

The SD interface control block maps the internal signals to the external SD Interface and vice versa. Based on the bus width (1-, 4- or 8-bit) the internal signals are driven out appropriately. In the case of DDR operation, the outputs are driven on the negative edge of the SD\_CLK.

The inputs from RxFlops module are latched on the RX\_CLK (looped-back or tuned clock) and is output to the Receive Control module for further processing.

## 16.2 Clocks

The Host Controller has the following clocks:

**xin\_clk** (external input) - This is the main clock used on the SD interface side of the Host Controller. This clock input is used to generate the SD clock based on the divisor value programmed by the host driver. To support SDR50, this clock needs to be about 100 MHz (use 96 MHz HFRC clock).

**ahb\_clk** (external input, commonly called host\_clk) - host bus clock. Based on the selected host interface, the appropriate clock is used. The host bus interface modules (AHB master/target modules), the SD register set and the DMA controller operate on this clock.

**sd\_clk** (internally generated) - This clock is derived from xin\_clk based on the clock control register value programmed by the driver. This clock is used by most of the SD control logic (SD command control, SD transmit control, SD tuning module and the other side of the block buffer).

**sdcard\_clk** - same as sd\_clk, except that this is available only when SD clock enable is set by the driver. This is the clock supplied to the SD card. The Host Controller supports both full speed and high speed cards. For high speed cards, the Host Controller should clock out the data on the rising edge of clk\_sd and for full speed cards, the Host Controller should clock out the data on the falling edge of clk\_sd. During read transactions, when the read FIFOs are full and there is no space to accept one block of data from the card, the host controller will halt the transaction. This will stop the clock to the card so an overrun condition will not occur on the SD side.

**sdcard\_clk\_dly** - This is the delayed version of the sd\_clk. This clock is used to flop the SD outputs to provide hold time on the output pins.

**rxclk\_in** - This is the looped-back clock from the sdcard\_clk. This will account for the delay on the clock being driven onto the SD Interface through the chip and the IO pad.

**rx\_clk** - This is selected from the rxclk\_in (after manual tap delay) or the tuned clock (auto tap delay). The response receive logic in the command control and the SD receive controller operates on this clock. The received data is assembled into bytes and is synchronized to sd\_clk before writing into the block buffer.

#### 16.2.1 SD clocking Architecture

Figure 72 shows the architecture of clocks used in the interface part of the SD Controller. The Primary Clock input **xin\_clk** is the main clock used to generate various derived clocks.



Figure 72. SDIO Module Clock Derivation

## 16.3 Advanced DMA

The SD Host Controller Standard Specification version 2.00 defines the ADMA (Advanced DMA) transfer algorithm. The DMA algorithm defined in the SD Host Controller Standard Specification version 1.00 is called SDMA (Single-operation DMA). SDMA has a disadvantage that a DMA interrupt is generated at every page boundary which requires the CPU to reprogram a new system address. This SDMA algorithm creates a performance bottleneck when interrupting at every page boundary. ADMA adopts a scatter-gather DMA algorithm so that higher data transfer speed is achievable. The CPU can program a list of data transfers between system memory and SD card in the descriptor table before executing ADMA. This enables ADMA to operate without interrupting the CPU. Furthermore, ADMA can support not only32-bit system memory addressing but also 64-bit system memory addressing (the 32-bit system memory addressing uses the lower 32-bit field of 64-bit address registers).

There are two types of ADMA - ADMA1 and ADMA2. ADMA1 supports data transfer of only 4KB-aligned data in system memory. ADMA2 improves upon this restriction so that data of any location and any size can be transferred in system memory. The format of the descriptor table is different between them. Version 2.00 of the SD Host Controller specification defines ADMA2 as standard ADMA and recommends supporting ADMA2 rather than ADMA1. DMA mode ADMA1 is not supported in versions 3.0 of the specification and later. When the term "ADMA" is used in this document, it means ADMA2.

#### 16.3.1 Block Diagram of ADMA2

Figure 73 shows the ADMA2 algorithm block diagram.



#### Figure 73. ADMA2 Block Diagram

A 32-bit addressing Descriptor Table is created in system memory by the CPU. Each descriptor line (one executable unit) consists of address, length and attribute fields. The attribute specifies operation of the descriptor line. ADMA2 includes blocks for SDMA, state machine and registers. ADMA2 does not use 32-bit SDMA System Address Register (offset 0) but uses the 64-bit Advanced DMA System Address register (offset 058h) for descriptor pointer. Writing to the Command Register triggers off an ADMA2 transfer.

ADMA2 fetches one descriptor line and executes it. This procedure is repeated until the end of the descriptor is reached (End=1 in attribute).

#### 16.3.2 An Example of ADMA2 Programming

The CPU forms the Descriptor Table when each slice is placed somewhere in contiguous system memory. It describes each descriptor with a set of address, length and attributes fields. Each sliced data is transferred in turn as programmed in the descriptor, as shown in Figure 74.



Figure 74. Example Descriptor Table

#### 16.3.3 Data Address and Data Length Requirements

There are 3 requirements when programing the descriptor.

- 1. The minimum unit of address is 4 bytes.
- 2. The maximum data length of each descriptor line is less than 64KB.
- 3. Total Length = Length 1 + Length 2 + Length 3 + ... + Length n = multiple of Block Size.

If the total length of a descriptor is not a multiple of block size, an ADMA2 transfer might not be terminated. In this case, the transfer should be aborted by a data timeout.

The Block Count register limits the maximum blocks transferred to 65535 blocks. If an ADMA2 operation is less than or equal to 65535 blocks transferred, then the Block Count register can be used. In this case, total length of the Descriptor Table is equal to the block size multiplied by the block count. If an ADMA2 operation is more than 65535 blocks transferred, then the Block Count Register is disabled by setting Block Count Enable to 0 in the Transfer Mode Register. In this case, the length of the data transfer is not designated by block count but by the Descriptor Table. Therefore, the timing of detecting the last block on the SD bus may be different and it affects the control of Read Transfer Active, Write Transfer Active and DAT line Active in the Present State register. In case of a read operation, several blocks may be read more than required. The CPU should ignore an out of range error if the read operation is for the last block of memory area.

#### 16.3.4 Descriptor Table

Figure 75 shows a 32-bit addressing Descriptor Table.

| Addre  | ss Field | Lei    | ngth                                                          | Rese    | erved       |                                                                                                                                            |                        | Attri                  | bute      |                        |                          |                     |
|--------|----------|--------|---------------------------------------------------------------|---------|-------------|--------------------------------------------------------------------------------------------------------------------------------------------|------------------------|------------------------|-----------|------------------------|--------------------------|---------------------|
| 63     | 32       | 2 31   | 16                                                            | 15      | 06          | 05                                                                                                                                         | 04                     | 03                     | 02        | 01                     | 00                       |                     |
| 32-bit | Address  | 16-bit | Length                                                        | 000     | 000         | Act2                                                                                                                                       | Act1                   | 0                      | Int       | End                    | Valid                    |                     |
|        |          |        |                                                               |         |             |                                                                                                                                            |                        |                        |           |                        |                          |                     |
|        |          |        |                                                               | Valid   | Vali<br>ADI | id=1 india<br>MA Error                                                                                                                     | ates this<br>Interrupt | line of de<br>and stop | SCRIPTOR  | is effecti<br>o prever | ve. If Vali<br>it runawa | id=0 generate<br>y. |
| /      |          |        |                                                               | End     | Enc         | End=1 indicates end of descriptor. The Transfer Complete Interrupt is<br>generated when the operation of the descriptor line is completed. |                        |                        |           |                        |                          |                     |
| 1      |          |        |                                                               | Int     | INT<br>line | =1 gener<br>is compl                                                                                                                       | rates DM<br>leted.     | A Interrup             | ot when t | he opera               | ition of th              | e descriptor        |
|        | _        |        |                                                               |         |             |                                                                                                                                            |                        |                        |           |                        |                          |                     |
| Act2   | Act1     | Symbol | Com                                                           | ment    |             |                                                                                                                                            |                        | Opera                  | ation     |                        |                          |                     |
| 0      | 0        | Nop    | No Operation Do not execute current line and go to next line. |         |             |                                                                                                                                            |                        |                        |           |                        |                          |                     |
| 0      | 1        | rsv    | reserved (Same                                                |         |             | e as Nop. Do not execute current line and go to next line.)                                                                                |                        |                        |           | line.)                 |                          |                     |
| 1      | 0        | Tran   | Transfe                                                       | er Data | Trans       | nsfer data of one descriptor line                                                                                                          |                        |                        |           |                        |                          |                     |
| 1      | 1        | Link   | Link Descriptor Link to another descriptor                    |         |             |                                                                                                                                            |                        |                        |           |                        |                          |                     |

Figure 75. 32-bit Addressing Descriptor Table

One descriptor line consumes a 64-bit (8-byte) memory space. Attributes are used to specify an action in the control descriptor. Three separate actions are specified. A "Nop" operation skips the current descriptor line and fetches the next one. A "Tran" operation transfers data designated by the address and length fields. A "Link" operation is used to connect two separate descriptors. The address field of the link operation points to the next Descriptor Table. The combination of Act2=0 and Act1=1 is reserved and performs the same operation as Nop. A future version of the controller may use this field and define a new operation. A 32-bit address is stored in the lower 32 bits of a 64-bit address register. The Address field should be set on a 32-bit boundary (lower 2 bits are always set to 0) for a 32-bit address descriptor table.

Figure 76 below shows the definition of the length field in the Descriptor Table.

| Length Field | Value Of Length |
|--------------|-----------------|
| 0000h        | 65536bytes      |
| 0001h        | 1byte           |
| 0002h        | 2bytes          |
|              |                 |
| FFFFh        | 65535bytes      |

Figure 76. Descriptor Table Length Field Definitions

## **16.4 Driver flow sequence**

The following sub-sections describe the flow sequence for three types of SDIO transactions:

- 1. Non DMA data transaction
- 2. DMA data transaction
- 3. ADMA data transaction

#### 16.4.1 Non-DMA Transaction

The sequence for transactions when not using DMA (non-DMA) is shown in Figure 77 and the steps are listed in Table 94.





| Step | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
|------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1    | Set the value corresponding to the executed data byte length of one block to Block Size register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| 2    | Set the value corresponding to the executed data block count to Block Count register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
| 3    | Set the value corresponding to the issued command to Argument register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| 4    | Set the value to Multi / Single Block Select and Block Count Enable. Set the value corresponding to the issued command to Data Transfer Direction, Auto CMD12 Enable and DMA Enable.                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| 5    | Set the value corresponding to the issued command to Command register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|      | When writing the upper byte of Command register, SD command is issued.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| 6    | Wait for the Command Complete Interrupt                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| 7    | Write 1 to the Command Complete in the Normal Interrupt Status register for clearing this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| 8    | Read Response register and get necessary information in accordance with the issued command.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| 9    | In the Case Where this sequence is for write to a card, go to step (10-W). In case of read from a card, go to step (10-R).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 10-W | Wait for Buffer Write Ready Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|      | Non DMA Write Transfer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|      | On receiving the Buffer Write Ready interrupt the ARM processor will act as a master and start transferring the data via Buffer data port register (fifo_1). Transmitter starts sending the data in SD bus when a block of data is ready in fifo_1. While transmitting the data in sd bus the buffer write ready interrupt is sent to the ARM Processor for the second block of data. The ARM processor will act as a master and start sending the second block of data via Buffer data port register to fifo_2. Buffer write ready interrupt will be asserted only when a fifo is empty to receive a block of data. |
| 11-W | Write 1 to the Buffer Write Ready in the Normal Interrupt Status register for clearing this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 12-W | Write block data (in according to the number of bytes specified at the step (1)) to Buffer Data Port register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| 13-W | Repeat until all blocks are sent and then go to step (14).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 10-R | Non DMA Read Transfer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
|      | Buffer Read Ready interrupt is asserted whenever a block of data is ready in one of the fifo's. On receiving the Buffer Read Ready interrupt the ARM processor will act as a master and start read-<br>ing the data via Buffer data port register (fifo_1). Receiver start reading the data from SD bus only when a fifo is empty to receive a block of data. When both the fifo's are full the host controller will stop the data coming from the card through read wait mechanism (if card supports read wait) or through clock stopping.<br>Wait for Buffer Read Ready Interrupt                                  |
| 11-R | Write 1 to the Buffer Read Ready in the Normal Interrupt Status register for clearing this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| 12-R | Read block data (in according to the number of bytes specified at the step (1)) from the Buffer Data Port register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| 13-R | Repeat until all blocks are received and then go to step (14).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| 14   | If this sequence is for Single or Multiple Block Transfer, go to step (15). In case of Infinite Block Transfer, go to step (17).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 15   | Wait for Transfer Complete Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
| 16   | Write 1 to the Transfer Complete in the Normal Interrupt Status register for clearing this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| 17   | Perform the sequence for Abort Transaction.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |

Table 94: Non-DMA Transfer Flow Sequence Steps

Note: Step (1) and (2) can be executed at same time. Step (4) and (5) can be executed at same time.

#### 16.4.2 DMA Transaction

The burst types like 8-beat incrementing burst or 4-beat incrementing burst or single transfer is used to transfer or receive the data from system memory mainly to avoid the hold of the Host/System bus by the master for a longer time.

The sequence for using DMA is shown in Figure 78 and the flow sequence steps are listed in Table 95.





| Table 95: DM | A Transfer Flow | w Sequence Steps |
|--------------|-----------------|------------------|
|--------------|-----------------|------------------|

| Step | Description                                                                                                                                                                          |
|------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1    | Set the system address for DMA in the System Address register.                                                                                                                       |
| 2    | Set the value corresponding to the executed data byte length of one block in the Block Size register.                                                                                |
| 3    | Set the value corresponding to the executed data block count in the Block Count register.                                                                                            |
| 4    | Set the value corresponding to the issued command to Argument register.                                                                                                              |
| 5    | Set the value to Multi / Single Block Select and Block Count Enable. Set the value corresponding to the issued command to Data Transfer Direction, Auto CMD12 Enable and DMA Enable. |

| Step | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
|------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 6    | Set the value corresponding to the issued command to Command register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|      | When writing the upper byte of Command register, SD command is issued.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 7    | Wait for the Command Complete Interrupt                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| 8    | Write 1 to the Command Complete in the Normal Interrupt Status register for clearing this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
| 9    | Read Response register and get necessary information in accordance with the issued command.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
|      | DMA read transfer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
|      | On receiving the response end bit from the card for the write command (data flowing from Host to Card) the SD Host controller will act as the master and request the System/Host bus. After receiving the grant the host controller will start reading a block of data from the system memory and fills the first fifo. Whenever a block of data is ready the transmitter will start sending the data in SD bus. While transmitting the data in SD bus the host controller requests the bus to fill the second block in second fifo. Ping Pong fifo's are used to increase the throughput. Similarly the host controller reads a block of data from the system memory. Transfer complete Interrupt will be set only after transferring all the blocks of data to the card. |
|      | DMA write transfer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                         |
|      | The block of data received from the Card (data flowing from Card to Host) is stored in first half of the fifo. Whenever a block of data is ready the SD Host controller will act as the master and request the System/Host bus. After receiving the grant the host controller will start writing a block of data into the system memory from the first fifo.                                                                                                                                                                                                                                                                                                                                                                                                               |
|      | While transmitting the data into System memory the host controller will receive the second block of data and store in second fifo. Similarly the host controller write a block of data into the system memory whenever data is ready. This will continue till all the blocks are transferred to the System memory. Transfer complete Interrupt will be set only after transferring all the blocks of data to the System memory.                                                                                                                                                                                                                                                                                                                                            |
|      | Note: Host controller will receive a block of data from the card only when it has room to store a block of data in fifo. When both the fifo's are full the host controller will stop the data coming from the card through read wait mechanism (if card supports read wait) or through clock stopping.                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 10   | Wait for the Transfer Complete Interrupt and DMA Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
| 11   | If Transfer Complete is set 1, go to Step(4) else if DMA Interrupt is set to 1, go to Step(12). Transfer Complete is higher priority than DMA Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| 12   | Write 1 to the DMA Interrupt in the Normal Interrupt Status register to clear this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| 13   | Set the next system address of the next data position to the System Address register and go to Step (10).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| 14   | Write 1 to the Transfer Complete and DMA Interrupt in the Normal Interrupt Status register to clear this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |

# Note: Step (2) and Step (3) can be executed at same time. Step (5) and Step (6) can also be executed at same time.

As an example, if the host wants to transfer 4 kB of data to the card, assuming a maximum block size of 512 bytes, then the host driver will program the Block Size Register as 512 and the Block Count Register with the value 8. The master and transmitter residing inside the Host Controller will get the information (how much data to transfer) from these registers. Using the above information, the master will initiate a data read transaction (to read a block of data - 512 bytes from system memory). Whenever a block of data is ready in the FIFO, the transmitter will start transmitting the block of data (512) on the SD bus. After transmitting the entire block of data to the card, the transmitter will wait for a status response from the card. The transmitter will send the next block of data only after it receives a good status response from the card for the previous block of data, otherwise the transaction will be aborted and the host will go for a fresh transaction.

## 16.4.3 ADMA Transactions

The sequence for using DMA is shown in Figure 78 and the flow sequence steps are listed in Table 95.



Figure 79. ADMA Transaction Flow Sequence

| Step | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1    | Create Descriptor table for ADMA in the system memory                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| 2    | Set the Descriptor address for ADMA in the ADMA System Address register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| 3    | Set the value corresponding to the executed data byte length of one block in the Block Size.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| 4    | Set the value corresponding to the executed data block count in the Block Count register. If the Block Count Enable in the Transfer Mode register is set to 1, total data length can be designated by the Block Count register and the Descriptor Table. These two parameters shall indicate same data length. However, transfer length is limited by the 16-bit Block Count register. If the Block Count Enable in the Transfer Mode register is set to 0, total data length is designated by not Block Count register but the Descriptor Table. In this case, ADMA reads more data than length programmed in descriptor from SD card. Too much read operation is aborted asynchronously and extra read data is discarded when the ADMA is completed |
| 5    | Set the argument value to the Argument register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| 6    | Set the value to the Transfer Mode register. The host driver determines Multi / Single Block Select, Block Count Enable, Data Transfer Direction, Auto CMD12 Enable and DMA Enable.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| 7    | Set the value to the Command register.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
|      | Note: When writing to the upper byte [3] of the Command register, the SD com-<br>mand is issued and DMA is started.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| 8    | Wait for the Command Complete Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
| 9    | Write 1 to the Command Complete in the Normal Interrupt Status register to clear this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
| 10   | Read Response register and get necessary information of the issued command.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| 11   | Wait for the Transfer Complete Interrupt and ADMA Error Interrupt.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| 12   | If Transfer Complete is set 1, go to Step (13) else if ADMA Error Interrupt is set to 1, go to Step                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| 13   | Write 1 to the Transfer Complete Status in the Normal Interrupt Status register to clear this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| 14   | Write 1 to the ADMA Error Interrupt Status in the Error Interrupt Status register to clear this bit.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| 15   | Abort ADMA operation. SD card operation should be stopped by issuing abort command. If necessary, the host driver checks ADMA Error Status register to detect why ADMA error is generated.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |

Note: Step (3) and Step (4) can be executed simultaneously. Step (6) and Step (7) can also be executed simultaneously.

## 16.5 Using the AmbiqSuite SDK to Program and Use the SDIO Module

In order to access the eMMC card, follow the steps listed below.

```
    Define a global SD card host pointer
```

```
am_hal_card_host_t *pSdhcCardHost = NULL;
```

#### Configure the SDIO pins

1-bit, 4-bit and 8-bit bus widths are supported. Set the bus width appropriately for the hardware design.

```
//
// Configure SDIO PINs.
//
am_bsp_sdio_pins_enable(AM_HAL_HOST_BUS_WIDTH_8);
```

#### Find and initialize the SD card host

Currently only one SD card host is supported, so always use 'AM\_HAL\_SDHC\_CARD\_HOST', 'true' parameter. This forces the SD card host to reinitialize from scratch. Using 'false' will not force the SD card host to do reinitialization.

```
//
// Get the uderlying SDHC card host instance
//
pSdhcCardHost = am_hal_get_card_host(AM_HAL_SDHC_CARD_HOST, true);
if (pSdhcCardHost == NULL)
{
    am_util_stdio_printf("No such card host and stop\n");
    while(1);
}
am_util_stdio_printf("card host is found\n");
```

#### Define an eMMC card and check if it is present

Currently only the eMMC card type is supported.

```
am_hal_card_t eMMCard;
//
// check if card is present
//
while (am_hal_card_host_find_card(pSdhcCardHost, &eMMCard) != AM_HAL_STATUS_SUCCESS)
{
    am_util_stdio_printf("No card is present now\n");
    am_util_delay_ms(1000);
    am_util_stdio_printf("Checking if card is available again\n");
}
```

#### Initialize the eMMC card

Optionally, the user can provide a card power control callback function during card initialization, as well as establish a card "power off" policy. Two policies are defined - one powers off only the SDHC controller and the other powers off both the SDHC controller and the eMMC card.

After initialization, the eMMC card is running with the default setting - 1-bit bus width and lowest clock speed

#### Configure the eMMC card

Configure the bus width, bus speed, bus voltage and eMMC bus mode according to the hardware design and eMMC chip.

#### Perform synchronous block read or block write

A synchronous block read or block write function needs to specify the start sector (or block) and sector (block) numbers.
The sector (or block) size is 512 bytes.

The read & write buffer size should be **512\*BLK\_NUM**.

The return value, **ui32Status**, includes the successful transferred sector (or block) number in the upper 16 bits and the transfer status in the lower 16 bits.

```
ui32Status = am_hal_card_block_read_sync(&eMMCard, START_BLK, BLK_NUM, (uint8_t *)ui8RdBuf);
am_util_stdio_printf("Synchronous Reading %d blocks is done, Xfer Status %d\n", ui32Status >> 16, ui32Status & 0xffff);
ui32Status = am_hal_card_block_write_sync(&eMMCard, START_BLK, BLK_NUM, (uint8_t *)ui8WrBuf);
am_util_stdio_printf("Synchronous Writing %d blocks is done, Xfer Status %d\n", ui32Status >> 16, ui32Status & 0xffff);
```

#### Perform asynchronous block read or block write

An asynchronous block read or block write needs to define an SDIO interrupt service routine and a register callback function.

```
void am_sdio_isr(void)
{
    uint32_t ui32IntStatus;
    am_hal_sdhc_intr_status_get(pSdhcCardHost->pHandle, &ui32IntStatus, true);
    am_hal_sdhc_intr_status_clear(pSdhcCardHost->pHandle, ui32IntStatus);
    am_hal_sdhc_interrupt_service(pSdhcCardHost->pHandle, ui32IntStatus);
}
```

```
am_hal_card_register_evt_callback(&eMMCard, am_hal_card_event_test_cb);
```

```
void am_hal_card_event_test_cb(am_hal_host_evt_t *pEvt)
 {
     am_hal_card_host_t *pHost = (am_hal_card_host_t *)pEvt->pCtx;
     if (AM_HAL_EVT_XFER_COMPLETE == pEvt->eType &&
        pHost->AsyncCmdData.dir == AM_HAL_DATA_DIR_READ)
     {
        bAsyncReadIsDone = true;
        am_util_stdio_printf("Last Read Xfered block %d\n", pEvt->ui32BlkCnt);
    }
     if (AM HAL EVT XFER COMPLETE == pEvt->eType &&
        pHost->AsyncCmdData.dir == AM_HAL_DATA_DIR_WRITE)
     {
        bAsyncWriteIsDone = true;
        am_util_stdio_printf("Last Write Xfered block %d\n", pEvt->ui32BlkCnt);
     }
    if (AM_HAL_EVT_SDMA_DONE == pEvt->eType &&
        pHost->AsyncCmdData.dir == AM_HAL_DATA_DIR_READ)
     {
        am_util_stdio_printf("SDMA Read Xfered block %d\n", pEvt->ui32BlkCnt);
     }
    if (AM_HAL_EVT_SDMA_DONE == pEvt->eType &&
        pHost->AsyncCmdData.dir == AM_HAL_DATA_DIR_WRITE)
     {
        am_util_stdio_printf("SDMA Write Xfered block %d\n", pEvt->ui32BlkCnt);
     }
    if (AM_HAL_EVT_DAT_ERR == pEvt->eType)
     {
L
        am_util_stdio_printf("Data error type %d\n", pHost->AsyncCmdData.eDataError);
    if (AM_HAL_EVT_CARD_PRESENT == pEvt->eType)
// Write 'BLK NUM' blocks to the eMMC flash
11
bAsyncWriteIsDone = false;
am_hal_card_block_write_async(&eMMCard, START_BLK, BLK_NUM, (uint8_t *)ui8WrBuf);
// Read 'BLK_NUM' blocks to the eMMC flash
11
bAsyncReadIsDone = false;
am_hal_card_block_read_async(&eMMCard, START_BLK, BLK_NUM, (uint8_t *)ui8RdBuf);
```

### 16.5.1 Calibration for 48 MHz SDR or DDR

Due to board layout and other factors, the highest specified data rate may require a timing calibration to assure robust communication. A calibration function is available which allows the user to determine the proper timing settings for the SDIO interface. Performing this calibration is recommended before using the block-oriented read or write functions.

In order to get more accurate Tx/Rx settings, a much larger number of sectors may be used for the calibration. Note that doing this may require more time for the calibration to determine optimum Tx/Rx timing settings.

After calling this function, the Tx/Rx settings will take effective immediately.

```
am_hal_card_emmc_calibrate(AM_HAL_HOST_UHS_DDR50, 48000000, AM_HAL_HOST_BUS_WIDTH_4,
    (uint8_t *)ui8WrBuf, START_BLK, 2, ui8TxRxDelays);
am_util_stdio_printf("SDIO TX delay - %d, RX Delay - %d\n", ui8TxRxDelays[0], ui8TxRxDelays[1]);
```

If the Tx/Rx settings are already known, then they can be installed directly by calling the below function.

```
uint8_t ui8TxRxDelays[2] = { 4, 5 };
am_hal_card_host_set_txrx_delay(pSdhcCardHost, ui8TxRxDelays);
```

# 17. Display Controller (DC)

The Display Controller (DC) is a dedicated hardware block allowing offload of display interface and control functions. The display controller is configured for 4-layer blending. The configuration register file controls the operation of the display controller.

Timing parameters are programmable and data are fetched for each layer by a dedicated DMA engine. Additional functions include:

- Gamma adjustment
- Dithering application

The DC can be configured to continuously refresh the display unit from the display buffer while the GPU accesses the memory. The refresh rate, resolution and color depth of the display determine the parameters of the DC.

The DC supports up to four layers sourced from different memory regions. Each layer can have separate color modes, alpha blending, and filtering attributes. The main control registers for each layer is the LAYERnMODE (n = 0-3) registers.

During the blending process, a translucent foreground color (current layer) with a background color (previous layer) are combined and a new blended color is produced. Foreground color translucency may range from completely transparent to completely opaque. If the foreground color is completely transparent, the blended color will be the background color and if the foreground color is completely opaque, the blended color will be the foreground color. When the translucency is somewhere in between, the blended color is computed as a weighted average of the foreground and background colors.

The DC supports palette color mapping and gamma correction per layer.

Dithering is the process of degrading the color image with a method that tries to produce better results than information truncation. Dithering is used to create the illusion of "color depth" in images with a limited color palette. In a dithered image, colors that are not available in the palette are approximated by a diffusion of colored pixels from within the available palette. The human eye perceives the diffusion as a mixture of the colors within it.

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

NOTE

The Apollo4 Lite SoC and Apollo4 Blue Lite SoC do not include a Display Controller module. This chapter and its content do not apply to these SoCs. The MSPI is used to interface to display panels from the Lite or Blue Lite SoC. DBI Type-B is not a supported display interface.

Please see the MSPI section for considerations when connecting to a display panel over MSPI instead of the Display Controller on other Apollo4 SoCs.

### 17.1 Software Support

The Display Controller is supported in both FreeRTOS and BareMetal (no operating system). A library of primitive graphics functions is available in pure ANSI C with no dependencies and is supported for FreeRTOS. The software package comes with a bit-accurate emulator and the NEMA® | PIX-Presso, a utility software for converting images to/from formats suitable for low power embedded devices.

#### 17.1.1 Supported Software

OS support

- FreeRTOS support
- No OS (BareMetal)
- Graphics API support
  BareMetal Library in portable ANSI C
  Software Emulators and suites
- DC API
- NEMA® PIX-Presso

## 17.2 Timing Generator

The DC continually refreshes the display unit from the display buffer while the GPU accesses the memory. The refresh rate, resolution and color depth of the display determine the parameters of the DC.

The Timing Generator is designed to be easily programmed using timing information in the same format as X11 Modeline definitions. Figure 80 shows how the parameters are defined.



Figure 80. Display Controller Timing

The Timing Generator requires timing parameters as specified in the equations below for the vertical and horizontal sections, where the horizontal timing is pixel clock based while the vertical timing is line based.

Equations 1-3 determine in pixel clock cycles, the X11 Modeline horizontal:

```
Front Porch (FRONTPORCHXY_FPCLKCYCLES field)

FRONTPORCHXY_FPCLKCYCLES = ResolutionX + FrontP orchX (1)
```

```
Blanking Period (BLANKINGXY_HSYNCPULSE field)
```

```
BLANKINGXY_HSYNCPULSE = ResolutionX + FrontPorchX + BlankingX (2)
```

```
Back Porch (BACKPORCHXY_BPCLKCYCLES field)
BACKPORCHXY_BPCLKCYCLES = ResolutionX + FrontP orchX + BlankingX + BackPorchX (3)
```

Equations 4-6 determine in number of lines, the X11 Modeline vertical:

Front Porch (FRONTPORCHXY\_FLINES field)
FRONTPORCHXY\_FLINES = ResolutionY + FrontPorchY(4)

Blanking Period (BLANKINGXY\_VSYNCLINES field) BLAN K\_XY\_VSYNCLINES = ResolutionY + FrontP orchY + BlankingY (5)

Back Porch (BACKPORCHXY\_BLINES field) BACKPORCHXY\_BLINES = ResolutionY + FrontP orchY + BlankingY + BackP orchY (6)

## 17.3 Layer Overlays

The Display Controller supports four layers (n= 0, 1, 2, 3) sourced from different memory regions. Each layer can have separate color modes, alpha blending, and filtering attributes. The main control registers for each layer is the LAYERnMODE (n = 0-3) registers.

Layering starts with a 32-bit background color (RGBA), as shown in Figure 81, which is applied on the entire screen. If it is not needed the background register BGCOLOR field is set to 0.



Figure 81. Display Controller RGBA Background Color

Layer 0 is applied on top of that with a requested blending method as shown in Figure 82. The background color can be used for blending and global values, such as alpha blending and palettes. If no layer is enabled (LAYERnMODE\_LAYERnEN = 0), only the background color will be displayed.





The base frame address is set in the LAYERnADDR registers. The number of bytes from one row of pixels to the next row of pixels in memory is called stride. These bytes affect how the image is stored in memory and the LAYERnSTRIDE\_LAYERnSTRIDEDIST bits are used to add the space between the frame lines.

For each layer, the start location (LAYERnSTARTXY), the visible size (LAYERnSIZEXY) and the resolution (LAYERnRESXY) are needed. For example, if the resolution of an image is 800x600 and the size of the required image for layer 0 is 400x300 at location (68, 32) then the value of the registers are shown in Table 97:

| Parameter                  | Register       | Register Field | Value |
|----------------------------|----------------|----------------|-------|
| Resolution X               | RESXY          | XRES           | 800   |
| Resolution Y               | RESXY          | YRES           | 600   |
| Start coordinate X         | LAYER0_STARTXY | LAYER0XOFF     | 68    |
| Start coordinate Y         | LAYER0_STARTXY | LAYER0YOFF     | 32    |
| Pixel size in X coordinate | LAYER0_SIZEXY  | LAYER0PIXSZEX  | 400   |
| Pixel size in y coordinate | LAYER0_SIZEXY  | LAYER0PIXSZEY  | 300   |

#### Table 97: Layer 0 Example

Each layer supports Alpha Blending (LAYERnMODE\_LAYERnDBLEND). If no background is desired, the start x,y coordinates can be placed on (0,0) and the size (LAYERnSIZEXY) can be equal to resolution (registers LAYERnRESXY).

The layers can have different resolutions depending on the layer needs. In addition, each layer has a choice of color output modes that are set by the LAYERnMODE\_LAYERnCOLMODE field. So, each layer can have different color formats. Figure 83 shows how the second layer is overlayed.







Figure 84. Display Controller Third and Fourth Layers

## 17.4 Blending Modes

During blending process, a translucent foreground color (current layer) with a background color (previous layer) are combined and a new blended color is produced. Foreground color's translucency may range from completely transparent to completely opaque. If the foreground color is completely transparent, the blended color will be the background color and if the foreground color is completely opaque, the blended color will be the foreground color. When the translucency ranges in between, the blended color is computed as a weighted average of the foreground and background colors.Each layer is blended on top of the previous generated blended layer based on the following equation:

$$c = c_{\mathsf{S}} * F_{\mathsf{S}} + c_{\mathsf{d}} * F_{\mathsf{d}}$$

Table 98 shows the blending modes the Display Controller supports:

| LAYERnDBLEND<br>Field Value | Blending Mode                             | F <sub>s</sub>     | $F_{d}$            |
|-----------------------------|-------------------------------------------|--------------------|--------------------|
| 0000                        | blend black                               | 0                  | 0                  |
| 0001                        | blend white                               | 1                  | 1                  |
| 0010                        | blend alpha source                        | a <sub>s</sub>     | a <sub>s</sub>     |
| 0011                        | blend alpha global                        | $a_{g}$            | $a_{g}$            |
| 0100                        | blend alpha source and alpha global       | $a_{s} * a_{g}$    | $a_{s} * a_{g}$    |
| 0101                        | blend inverted source                     | 1 - a <sub>s</sub> | $1 - a_s$          |
| 0110                        | blend inverted global                     | 1 – a <sub>g</sub> | $1 - a_g$          |
| 0111                        | blend inverted source and inverted global | $1 - (a_s * a_g)$  | $1 - (a_s * a_g)$  |
| 1010                        | blend alpha destination                   | $a_{d}$            | a <sub>d</sub>     |
| 1101                        | blend inverted destination                | 1 - a <sub>d</sub> | 1 - a <sub>d</sub> |

#### Table 98: Blending Modes

Blending is enabled when the LAYERnMODE\_LAYERnDBLEND field is set with the respective values of Table 98. The destination blending function refers to the current layer while the source blending function refers to the previous layer.

## 17.5 Palette/Gamma Correction

Palette/Gamma correction is enabled when the MODE\_GAMARAMPEN register bit is set. For Palette/ Gamma correction, the Color Look Up Table (LnLUT) memory must be programmed. The Global Palette/ Gamma register memory (GLLUT) is a 3x256x8 memory array that holds the RGB value for each of the 256 colors in the palette.

The same memories can be used to map RGB values to new RGB values to perform gamma correction. In this mode, the memory area containing the color data to the display will contain the LUT indexes instead of the actual color data. Then, the LUT maps these indexes to the color values contained in the palette registers before being sent to the LCD display. There are also optional LUT on a per layer basis.

Figure 85 shows that each RGB value is independently mapped to a new RGB value based on the content of the lookup tables. This also allows a 256-color palette as each gray value is mapped to a distinct color.



Figure 85. Gamma Correction of RGB Values

## 17.6 Dithering

Dithering is the process of degrading the color image with a method that tries to produce better results than information truncation.

Dithering is used to create the illusion of "color depth" in images with a limited color palette. In a dithered image, colors that are not available in the palette are approximated by a diffusion of colored pixels from within the available palette. The human eye perceives the diffusion as a mixture of the colors within it. Dithering is enabled by setting the MODE\_DITHEREN bit.



a) RGBA8888

b) RGBA4444 c) RGBA4444 with dithering Figure 86. Dithering on Limited Color Palette

### 17.7 Color Modes

The DC supports multiple color formats. The supported color formats are described in the following sections.

#### 17.7.1 Input Formats

#### 17.7.1.1 Binary BW 1-bit

Values range from 0 (white) to 1 (black)

 BW
 BW<

#### 17.7.1.2 Grayscale 4-bit

Values range from 0 (black) to 16 (white)

#### 

#### 17.7.1.3 Grayscale 8-bit

Values range from 0 (black) to 255 (white)

L7 L6 L5 L4 L3 L2 L1 L0 L7 L6 L5 L4 L3 L2 L1 L0

#### 17.7.1.4 Palette 256 8-bit (LUT8)

Values range from 0 to 255. These are looked-up from the Palette Color Table.

|    |                  |    | Pix | el 0 |    |    |    |
|----|------------------|----|-----|------|----|----|----|
| 7  | 6                | 5  | 4   | 3    | 2  | 1  | 0  |
| 17 | <mark>1</mark> 6 | 15 | 14  | 13   | 12 | 11 | 10 |
|    |                  |    | Byt | :e 0 |    |    |    |

#### 17.7.1.5 RGBA 8888 32-bits

**R7 R6 R5 R4 R3 R2 R1 R0 G7 G6 G5 G4 G3 G2 G1 G0 B7 B6 B5 B4 B3 B2 B1 B0 A7 A6 A5 A4 A3 A2 A1 A0** 

### 17.7.1.6 ARGB 8888 32-bit

A7 A6 A5 A4 A3 A2 A1 A0 **R7 R6 R5 R4 R3 R2 R1 R0 G7 G6 G5 G4 G3 G2 G1 G0 B7 B6 B5 B4 B3 B2 B1 B0** 

17.7.1.7 ABGR 8888 32-bit

 A7
 A6
 A5
 A4
 A3
 A2
 A1
 A0
 B7
 B6
 B5
 B4
 B3
 B2
 B1
 B0
 G7
 G6
 G5
 G4
 G3
 G2
 G1
 G0
 R7
 R6
 R3
 R2
 R1
 R0

17.7.1.8 BGRA 8888 32-bit

 B7
 B6
 B5
 B4
 B3
 B2
 B1
 B0
 G7
 G6
 G5
 G4
 G3
 G2
 G1
 G0
 R7
 R6
 R5
 R4
 R3
 R2
 R1
 R0
 A7
 A6
 A5
 A4
 A3
 A2
 A1
 A0

17.7.1.9 RGB 888 24-bit

 R6
 R5
 R4
 R3
 R2
 R1
 R0
 G7
 G6
 G5
 G4
 R3
 R2
 R1
 R0
 G7
 G6
 G5
 G4
 R3
 R2
 R1
 R0
 R1
 R1<

17.7.1.10 RGBA 5551 16-bit

 R4
 R3
 R2
 R1
 R0
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0
 A0
 R4
 R3
 R2
 R1
 R0
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0
 A0
 R4
 R3
 R2
 R1
 R0
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0
 A0

17.7.1.11 RGB 565 16-bit

 R4
 R3
 R2
 R1
 R0
 G5
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0
 R4
 R3
 R2
 R1
 R0
 G5
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0
 R4
 R3
 R2
 R1
 R0
 G5
 G4
 G3
 G2
 G1
 G0
 B4
 B3
 B2
 B1
 B0

17.7.1.12 RGB 332 8-bit

 R2
 R1
 R0
 G2
 G1
 G0
 B1
 B0
 R2
 R1
 R0
 G2
 G1
 G0
 B1
 B0
 B1
 B0
 R2
 G1
 G0
 B1
 B0
 R2
 R1
 R0
 G2
 G1
 G0
 B1
 B0
 B1
 B0
 B1
 B0
 B1
 B1
 B1
 B1<

17.7.1.13 YUYV 32-bit 2-pixels

 Y06
 Y06
 Y04
 Y03
 Y02
 Y01
 Y00
 U7
 U6
 U5
 U4
 U3
 U2
 U1
 U0
 Y17
 Y16
 Y13
 Y12
 Y11
 Y10
 V7
 V6
 V5
 V4
 V3
 V2
 V1
 V0

### 17.7.1.14 YUY2 32-bit 2-pixels

U7 U6 U5 U4 U3 U2 U1 U0 Y07 Y06 Y05 Y04 Y03 Y02 Y01 Y00 V7 V6 V5 V4 V3 V2 V1 V0 Y17 Y16 Y15 Y14 Y13 Y12 Y11 Y10

### 17.7.1.15 YUV Packed

| Y37 | Y36 | Y35 | Y34 | Y33 | Y32 | Y31 | Y30 | Y27 | Y26 | Y25 | Y24 | Y23 | Y22 | Y21 | Y20 | Y17 | Y16 | Y15 | Y14 | Y13 | Y12 | Y11 | Y10 | Y07 | Y06 | Y05 | Y04 | Y03 | Y02 | Y01 | Y00 |
|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|
| Y77 | Y76 | Y75 | Y74 | Y73 | Y72 | Y71 | Y70 | Y67 | Y66 | Y65 | Y64 | Y63 | Y62 | Y61 | Y60 | Y57 | Y56 | Y55 | Y54 | Y53 | Y52 | Y51 | Y50 | Y47 | Y46 | Y45 | Y44 | Y43 | Y42 | Y41 | Y40 |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |
| U17 | U16 | U15 | U14 | U13 | U12 | U11 | U10 | U07 | U06 | U05 | U04 | U03 | U02 | U01 | U00 | V17 | V16 | V15 | V14 | V13 | V12 | V11 | V10 | V07 | V06 | V05 | V04 | V03 | V02 | V01 | V00 |
| U37 | U36 | U35 | U34 | U33 | U32 | U31 | U30 | U27 | U26 | U25 | U24 | U23 | U22 | U21 | U20 | V37 | V36 | V35 | V34 | V33 | V32 | V31 | V30 | V27 | V26 | V25 | V24 | V23 | V22 | V21 | V20 |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |
|     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |     |

### 17.7.1.16 TSc4 16-pixels / 64-bits

| TS | c4 | TSc4 |
|----|----|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|
| TS | c4 | TSc4 |
| TS | c4 | TSc4 |
| TS | c4 | TSc4 |

### 17.7.1.17 TSc4 16-pixel / 96-bits

| _ |      |      |      |      |      |      |      |      |      | _    |      |      |      |      |      |      |      |      |      | _    |      |      |      |      |      |      |      |      |      |      |      |      |
|---|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|------|
|   | TSc6 |
|   | TSc6 |
|   | TSc6 |
|   | TSc6 |
|   | TSc6 |
|   | TSc6 |

### 17.7.1.18 TSc6A 16-pixels with Alpha / 96-bits

| TSc | ia ts | Sc6A | TSc6A |
|-----|-------|------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|
| TSc | ia ts | Sc6A | TSc6A |
| TSc | ia ts | Sc6A | TSc6A |
| TSc | ia ts | Sc6A | TSc6A |
| TSc | SA TS | Sc6A | TSc6A |
| TSc | ia ts | Sc6A | TSc6A |

### 17.7.1.19 Custom formats

Additional color formats can be easily supported upon our client's requests. Please do not hesitate to contact us for more information.

#### 17.7.1.20 Color expansion

The internal color format on the DC is always on 8-bit format. Therefore lower order color formats are expanded to 8-bits. This is achieved by high-order bit replication. For example, a format with just 5 bits on a color component is replicated as:

 $C[7:0] = \{C[4:0]; C[4:2]\}$ 

- 17.7.2 Output Formats
- 17.7.2.1 RGB 888 24-bit

R7 R6 R5 R4 R3 R2 R1 R0 G7 G6 G5 G4 G3 G2 G1 G0 B7 B6 B5 B4 B3 B2 B1 B0

17.7.2.2 RGB 666 18-bit

B5 Β4 B3 B2 R1 R0 G5 G4 G3 G2 G1 G0 Β1 B0 R5 R4 R3 R2

#### 17.7.2.3 RGB 565 16-bit

R4 R1 R0 G5 G4 G3 G2 G1 G0 Β4 B3 B2 B1 **B**0 R3 R2

#### 17.7.2.4 RGB 444 12-bit

| - R3 | 8 R2 | R1 | R0 | G3 | G2 | G1 | G0 | B3 | B2 | B1 | B0 |
|------|------|----|----|----|----|----|----|----|----|----|----|

17.7.2.5 RGB 332 8-bit



17.7.2.6 RGB 111 3-bit



## 17.7.2.7 Binary

Values range from 0 (black or white) to 1 (black or white)

LO

## 17.8 TSc Framebuffer Decompression

TS Framebuffer compression operates in screen blocks (4x4 pixel blocks) and, depending on the configuration, achieves TSC<sup>™</sup>4, TSC<sup>™</sup>6 and TSC<sup>™</sup>6a lossy, fixed-ratio compression.

- TSC<sup>™</sup>4 is a 6:1 compression (4bpp)
- TSC<sup>™</sup>6 is a 4:1 compression (6bpp)
- TSC<sup>™</sup>6a is a 4:1 compression (6bpp) with alpha channel

Compression is performed at run time using minimal hardware. Pixel data can be stored in the framebuffer in compressed form and decompressed in the DC. Figure 87 shows the TS compression operation. The output of the TSC<sup>™</sup>4 compression is 64 bits per 4x4 block of pixels and the output of the TSC<sup>™</sup>6 compression is 96 bits per 4x4 block of pixels.



Figure 87. FTSC™4 /TSC™6 Framebuffer Compression Module

## 17.9 Display Formats

The Display Controller is enabled, and display data format and interface configuration are selected by setting fields in the MODE register. The DISPFMT field in this register sets one of several data formats. The DBITYPEBEN field enables the DBI Type-B interface used for the Serial/Dual/Quad SPI, and the DC400ACT field enables the Display Controller.

When the DBITYPEBEN field is set to enable DBI Type-B interface, the DBICFG register is used to activate the interface, specify 3- or 4-wire SPI, DualSPI or QuadSPI, and set data width, data word order and color format.

NOTE Use of the DPI-2 interface is not supported and therefore is not described in this Display Formats section.

The following sections describe the display formats the DC supports.

To see an image from a frame buffer on a given TFT display at least one layer will need to be defined and set up using one or more LAYERn registers.

### 17.9.1 MIPI DBI-Type B (Display Bus Interface)

Internally the DC uses a MIPI DBI-Type B interface to the Display Serial Interface (DSI) module. The data width can be 8, 9 or 16 bits wide.

During a write cycle, the DC sends data to the DSI. WRX is driven from high to low then pulled back to high during the write cycle. The DC provides information during the write cycle while the DSI reads the information on the rising edge of WRX. DCX is driven low while command information is on the interface and is pulled high when data is present.

Figure 88 depicts the write sequence timing waveforms. Note that blue line indicates the high impedance state.





During a read cycle, the DC reads data from the DSI. RDX is driven from high to low then pulled back to high during the read cycle. The DSI provides information to the DC during the read cycle which is valid on the rising edge of RDX. DCX is driven high during the read cycle. Figure 89 depicts the read sequence timing waveforms. Note that blue line indicates the high impedance state.



Figure 89. DBI-Type B Read Sequence

DBI-Type B interface is enabled through the MODE\_DBITYPEBEN register bit. After the DBICFG\_CSXCFG bit is set, then the DBICFG\_CSXSET bit controls the setting of the CSX signal. Setting the DBICFG\_RESXLOW bit sets the RESX signal. The output data width for this interface is controlled by the TYPEBWIDTH field, and the color format by the DBICOLORFMT field. The CLKCTRL register is set accordingly to activate the DBIB\_CLK. The DBI\_CMD register controls the read/write commands from/to the DBI-Type B interface, and register DBI\_RDAT stores the value of data during read mode.

### 17.9.1.1 DBI-Type B Output Formats - 8bit Interface

The following figures show the color coding DBI-Type B Interface support when Data Bus (DBIB\_DB) is 8 bits wide.

### 17.9.1.1.1 RGB332, 8bits/pixel, 256 colors

| Data      | D7       | D6       | D5      | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|---------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit2 | R1, Bit1 | R1 Bit0 | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit2 | R2, Bit1 | R2 Bit0 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit1 | B2, Bit0 |
| Pixel n+2 | R3, Bit2 | R3, Bit1 | R3 Bit0 | G3, Bit2 | G3, Bit1 | G3, Bit0 | B3, Bit1 | B3, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit2, LSB =Bit0 for Red and Green data, MSB = Bit1, LSB = Bit0 for Blue data.

#### 17.9.1.1.2 RGB444, 12bits/pixel 4,096 colors

| Data                | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|---------------------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n             | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |
| Pixel n / Pixel n+1 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |
| Pixel n+1           | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B1, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit3, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.1.1.3 RGB565, 16bits/pixel 65,536 colors

| Data      | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit5 | G1, Bit4 | G1, Bit3 |
| Pixel n   | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 | G2, Bit5 | G2, Bit4 | G2, Bit3 |
| Pixel n+1 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B1, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit5, LSB = Bit0 for Green data, MSB = Bit4, LSB = Bit0 for Red and Blue data.

#### 17.9.1.1.4 RGB666, 18bits/pixel 262,144 colors

| D7       | D6                                                                         | D5                                                                                                                                                                                                          | D4                                                                                                                                                     | D3                                                                                                                                                                                                                                                                                                                                                                                                | D2                                                                                                                                                                                                                 | D1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | D0                                                                                                                                                                                                                                                             |
|----------|----------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| R1, Bit5 | R1, Bit4                                                                   | R1, Bit3                                                                                                                                                                                                    | R1, Bit2                                                                                                                                               | R1, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | R1, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
| G1, Bit5 | G1, Bit4                                                                   | G1, Bit3                                                                                                                                                                                                    | G1, Bit2                                                                                                                                               | G1, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | G1, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
| B1, Bit5 | B1, Bit4                                                                   | B1, Bit3                                                                                                                                                                                                    | B1, Bit2                                                                                                                                               | B1, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | B1, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
| R2, Bit5 | R2, Bit4                                                                   | R2, Bit3                                                                                                                                                                                                    | R2, Bit2                                                                                                                                               | R2, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | R2, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
| G2, Bit5 | G2, Bit4                                                                   | G2, Bit3                                                                                                                                                                                                    | G2, Bit2                                                                                                                                               | G2, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | G2, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
| B2, Bit5 | B2, Bit4                                                                   | B2, Bit3                                                                                                                                                                                                    | B2, Bit2                                                                                                                                               | B2, Bit1                                                                                                                                                                                                                                                                                                                                                                                          | B2, Bit0                                                                                                                                                                                                           |                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                |
|          | D7<br>R1, Bit5<br>G1, Bit5<br>B1, Bit5<br>R2, Bit5<br>G2, Bit5<br>B2, Bit5 | D7       D6         R1, Bit5       R1, Bit4         G1, Bit5       G1, Bit4         B1, Bit5       B1, Bit4         R2, Bit5       R2, Bit4         G2, Bit5       G2, Bit4         B2, Bit5       B2, Bit4 | D7D6D5R1, Bit5R1, Bit4R1, Bit3G1, Bit5G1, Bit4G1, Bit3B1, Bit5B1, Bit4B1, Bit3R2, Bit5R2, Bit4R2, Bit3G2, Bit5G2, Bit4G2, Bit3B2, Bit5B2, Bit4B2, Bit3 | D7         D6         D5         D4           R1, Bit5         R1, Bit4         R1, Bit3         R1, Bit2           G1, Bit5         G1, Bit4         G1, Bit3         G1, Bit2           B1, Bit5         B1, Bit4         B1, Bit3         B1, Bit2           R2, Bit5         R2, Bit4         R2, Bit3         R2, Bit2           G2, Bit5         B2, Bit4         B2, Bit3         B2, Bit3 | D7D6D5D4D3R1, Bit5R1, Bit4R1, Bit3R1, Bit2R1, Bit1G1, Bit5G1, Bit4G1, Bit3G1, Bit2G1, Bit1B1, Bit5B1, Bit4B1, Bit3B1, Bit2B1, Bit1R2, Bit5R2, Bit4R2, Bit3R2, Bit2R2, Bit1G2, Bit5B2, Bit4B2, Bit3B2, Bit2B2, Bit1 | D7         D6         D5         D4         D3         D2           R1, Bit5         R1, Bit4         R1, Bit3         R1, Bit2         R1, Bit1         R1, Bit3           G1, Bit5         G1, Bit4         G1, Bit3         G1, Bit2         G1, Bit1         G1, Bit3           B1, Bit5         B1, Bit4         B1, Bit3         B1, Bit2         B1, Bit1         B1, Bit3           R2, Bit5         R2, Bit4         R2, Bit3         R2, Bit2         R2, Bit1         R2, Bit3           G2, Bit5         G2, Bit4         G2, Bit3         G2, Bit2         G2, Bit1         G2, Bit3           B2, Bit5         B2, Bit4         B2, Bit3         B2, Bit3         B2, Bit3         B2, Bit3         B2, Bit3         B2, Bit3 | D7D6D5D4D3D2D1R1, Bit5R1, Bit4R1, Bit3R1, Bit2R1, Bit1R1, Bit0G1, Bit5G1, Bit4G1, Bit3G1, Bit2G1, Bit1G1, Bit0B1, Bit5B1, Bit4B1, Bit3B1, Bit2B1, Bit1B1, Bit0R2, Bit5R2, Bit4R2, Bit3R2, Bit2R2, Bit1R2, Bit0B2, Bit5B2, Bit4B2, Bit3B2, Bit2B2, Bit1B2, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit5, LSB = Bit0 for Red, Green, and Blue data.

### 17.9.1.1.5 RGB888, 24bits/pixel 16,777,216 colors

| Data      | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit7 | R1, Bit6 | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |
| Pixel n   | G1, Bit7 | G1, Bit6 | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |
| Pixel n   | B1, Bit7 | B1, Bit6 | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit7 | R2, Bit6 | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |
| Pixel n+1 | G2, Bit7 | G2, Bit6 | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 |
| Pixel n+1 | B2, Bit7 | B2, Bit6 | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit7, LSB = Bit0 for Red, Green, and Blue data.

### 17.9.1.2 DBI-Type B Output Formats - 9bit Interface

The following figures show the color coding DBI-Type B Interface supports when Data Bus (DBIB\_DB) is 9 bits wide:

#### 17.9.1.2.1 RGB666, 18bits/pixel 262,144 colors

| Data      | D8       | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit5 | G1, Bit4 | G1, Bit3 |
| Pixel n   | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 | G2, Bit5 | G2, Bit4 | G2, Bit3 |
| Pixel n+1 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |

The Data order is as follows, MSB = D8, LSB = D0. Picture Data is MSB = Bit5, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.1.3 DBI-Type B Output Formats - 16bit Interface

The following figures show the color coding DBI-Type B Interface supports when Data Bus (DBIB\_DB) is 16 bits wide:RGB332, 8bits/pixel, 256 colors

| Data                  | D15      | D14      | D13      | D12      | D11      | D10      | D9       | D8       | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------------------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n / Pixel n+1   | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit1 | B1, Bit0 | R2, Bit2 | R2, Bit1 | R2, Bit0 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit1 | B2, Bit2 |
| Pixel n+2 / Pixel n+3 | R3, Bit2 | R3, Bit1 | R3, Bit0 | G3, Bit2 | G3, Bit1 | G3, Bit0 | B3, Bit1 | B3, Bit0 | R4, Bit2 | R4, Bit1 | R4, Bit0 | G4, Bit2 | G4, Bit1 | G4, Bit0 | B4, Bit1 | B4, Bit2 |
| Pixel n+4 / Pixel n+5 | R5, Bit2 | R5, Bit1 | R5, Bit0 | G5, Bit2 | G5, Bit1 | G5, Bit0 | B5, Bit1 | B5, Bit0 | R6, Bit2 | R6, Bit1 | R6, Bit0 | G6, Bit2 | G6, Bit1 | G6, Bit0 | B6, Bit1 | B6, Bit2 |

The Data order is as follows: MSB = D15, LSB = D0. Picture data is MSB = Bit2, LSB = Bit0 for Red and Green data, and MSB = Bit1, LSB = Bit0 for Blue data.

### 17.9.1.3.1 RGB444, 16bits/pixel 65,536 colors

| Data      | D15 | D14 | D13 | D12 | D11      | D10      | D9       | D8       | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|-----|-----|-----|-----|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   |     |     |     |     | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 |     |     |     |     | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |
| Pixel n+2 |     |     |     |     | R3, Bit3 | R3, Bit2 | R3, Bit1 | R3, Bit0 | G3, Bit3 | G3, Bit2 | G3, Bit1 | G3, Bit0 | B3, Bit3 | B3, Bit2 | B3, Bit1 | B3, Bit0 |

The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit3, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.1.3.2 RGB565, 16bits/pixel 65,536 colors



The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit5, LSB = Bit0 for Green data and MSB = Bit4, LSB = Bit0 for Red and Blue data.

#### 17.9.1.3.3 RGB666, 18bits/pixel 262,144 colors - Option 1

| Data                | D15      | D14      | D13      | D12      | D11      | D10      | D9 | D8 | D7       | D6       | D5       | D4       | D3       | D2       | D1 | D0 |
|---------------------|----------|----------|----------|----------|----------|----------|----|----|----------|----------|----------|----------|----------|----------|----|----|
| Pixel n             | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |    |    | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |    |    |
| Pixel n / Pixel n+1 | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |    |    | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |    |    |
| Pixel n+1           | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 |    |    | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |    |    |

The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit5, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.1.3.4 RGB666, 18bits/pixel 262,144 colors - Option 2

| Data      | D15      | D14      | D13      | D12      | D11      | D10      | D9 | D8 | D7       | D6       | D5       | D4       | D3       | D2       | D1 | D0 |
|-----------|----------|----------|----------|----------|----------|----------|----|----|----------|----------|----------|----------|----------|----------|----|----|
| Pixel n   |          |          |          |          |          |          |    |    | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |    |    |
| Pixel n   | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |    |    | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |    |    |
| Pixel n+1 |          |          |          |          |          |          |    |    | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |    |    |
| Pixel n+1 | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 |    |    | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |    |    |

The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit5, LSB = Bit0 for Red, Green, and Blue data.

### 17.9.1.3.5 RGB888, 24bits/pixel 16,777,216 colors - Option 1



The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit7, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.1.3.6 RGB888, 24bits/pixel 16,777,216 colors - Option 2

| Data      | D15      | D14      | D13      | D12      | D11      | D10      | D9       | D8       | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   |          |          |          |          |          |          |          |          | R1, Bit7 | R1, Bit6 | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |
| Pixel n   | G1, Bit7 | G1, Bit6 | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit7 | B1, Bit6 | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 |          |          |          |          |          |          |          |          | R2, Bit7 | R2, Bit6 | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |
| Pixel n+1 | G2, Bit7 | G2, Bit6 | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit7 | B2, Bit6 | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |

The Data order is as follows, MSB = D15, LSB = D0. Pixel data is MSB = Bit7, LSB = Bit0 for Red, Green, and Blue data.

#### 17.9.2 SPI, 3-/4-wire Serial Peripheral Interface

The Display Controller supports SPI serial interface with three or four distinct signals. The 3-wire (9-bit data word) serial interface uses the DC\_SPI\_CS\_N (chip select), DISP\_SPI\_SCK (serial clock), and DISP\_SPI\_SD (bi-directional serial data) or DISP\_SPI\_SDO (serial data out only). The 9-bit data word consists of 1-bit data/command indicator and 8-bit data word.

The 4-wire (8-bit data word) serial interface uses DC\_SPI\_CS\_N (chip select), DISP\_SPI\_SCK (serial clock), DISP\_SPI\_SD (bi-directional serial data) or DISP\_SPI\_SDO (serial data out only), and one additional signal DISP\_SPI\_DCX (data/command indicator). The DISP\_SPI\_SD or DISP\_SPI\_SDO signal is regarded as a command when DISP\_SPI\_DCX is low and as data when DISP\_SPI\_DCX is high.

The chip select can be assigned to any GPIO0-GPIO104 via the PINCFGn\_NCESRCn field. See the "Implementing Display Controller Connections" section in the GPIO chapter for pad configuration information for these SPI signals.

The following figures shows how the DC is connected to an LCD monitor using the aforementioned methods.



Figure 90. SPI 3- or 4-wire Interface (Dashed Line Used in 4-wire) - Data Out Only



Figure 91. SPI 3- or 4-wire Interface (Dashed Line Used in 4-wire) - Bi-directional Data

NOTE

Because of pad function limitation, each of the MSPI2 and the Display SPI signals is available on only one pad, and since signals for each of these two interfaces share the same pads, only one of MSPI2 or Display SPI can be used at a time.

To enable the DC to use one of these interfaces, the DC\_MODE\_DBITYPEBEN bit must be set. Then the selection of SPI3 or SPI4 is done by setting the DC\_DBICFG\_SPI3 or DC\_DBICFG\_SPI4 bit.

After the LCD is initialized and during the write mode, the Display Controller sends commands and data to the LCD. The 3-wire serial data packet (DISP\_SPI\_SD) contains a control bit and a transmission byte, 9 bits in total. In 4-wire serial interface, data packet (DISP\_SPI\_SD) contains only the transmission byte (8 bits) and the control bit is transferred by the DISP\_SPI\_DCX signal separately. The MSB is transmitted first.

The CS signal, the GPIO configured as GPIO\_PINCFGn\_NCESRCn = DC\_SPI\_CS\_N, is configurable. It can be set high or low to indicate the start of the data transmission. Data can be sampled either by the falling or the rising edge of the DISP\_SPI\_SCK depending on the configuration. Also, the clock polarity is configurable (whether the clock starts at high or low edge). If the CS pin remains high/low (depending on

the configuration) after the last bit of DISP\_SPI\_SD signal, the serial interface expects the DISP\_SPI\_DCX bit (3-wire serial interface) or D7 (4-wire serial interface) of the next byte at the next rising/falling edge of DISP\_SPI\_SCK. The serial clock (DISP\_SPI\_SCK) is idle when no communication is taking place.

Figure 92 and Figure 93 depict the corresponding timing waveforms of the 3-wire and 4-wire serial interface during write mode respectively. In these examples the falling edge of SPI\_CS indicates the start of data transmission and data are sampled at the rising edge of the serial clock. The SPI\_DC signal in Figure 93 when low, indicates a command and when high, indicates data.







Figure 93. SPI 4-wire Serial Write Transmission

In case of SPI LCD displays, command and data are bind. The addresses are generated and the address bits can be reversed depending on the type of the LCD display. These displays use the 4-wire serial interface with DC signal not being used. Figure 94 shows one line transmission data.



Figure 94. Single Line Update Mode

During the mode select period, 6 bits are transmitted. During the address select period, 10 bits of the line address are transmitted. The data are transited soon after and the transmission duration depends on the RGB format. After the end of the line, 16 clock cycles are needed. Dummy data are sent during that period and can be either High or Low. Figure 95 shows multiple lines transmission data.



Figure 95. Multiple Lines Update Mode

As with the transmission of only one line, the mode select period needs 6 clock cycles and the address select period needs 10 clock cycles. The duration of the transmitted data depends on the RGB format and the number of lines of each frame from the screen resolution. The dummy data transmission, needed after the end of each line, takes 6 clock cycles. Dummy data can be either High or Low. At the end of the frame (nth line) the duration of the dummy data transmission is 16 clock cycles.

#### 17.9.2.1 SPI Output formats

Through the SPI interface, the DC supports the following output color formats:

- Binary (Black & White) 1 bit/pixel
- RGB111 3 bits/pixel
- RGB332 8 bits/pixel
- RGB565 16 bits/pixel
- RGB666 18 bits/pixel
- RGB888 24 bits/pixel

#### 17.9.2.1.1 Configuration 0xC1 (RGB111 - Option 0)

| Data                | D7 | D6 | D5 | D4 | D3 | D2 | D1 | D0 |
|---------------------|----|----|----|----|----|----|----|----|
| Pixel n / Pixel n+1 |    |    | R1 | G1 | B1 | R2 | G2 | B2 |

The Data order is as follows, MSB = D7, LSB = D0.

### 17.9.2.1.2 Configuration 0xC9 (RGB111 - Option 1)

| Data                  | D7 | D6 | D5 | D4 | D3 | D2 | D1 | D0 |
|-----------------------|----|----|----|----|----|----|----|----|
| Pixel n / Pixel $n+1$ |    | R1 | G1 | B1 |    | R2 | G2 | B2 |

The Data order is as follows, MSB = D7, LSB = D0.

### 17.9.2.1.3 Configuration 0xD1 (RGB111 - Option 2)

| Data                | D7 | D6 | D5 | D4 | D3 | D2 | D1 | D0 |
|---------------------|----|----|----|----|----|----|----|----|
| Pixel n / Pixel n+1 | R1 | G1 | B1 |    | R2 | G2 | B2 |    |

The Data order is as follows, MSB = D7, LSB = D0.

### 17.9.2.1.4 Configuration 0xE1 (RGB111 - Option 4)

| Data                                | D7 | D6 | D5 | D4 | D3 | D2 | D1 | D0 |
|-------------------------------------|----|----|----|----|----|----|----|----|
| Pixel n / Pixel $n+1$ / Pixel $n+2$ | R1 | G1 | B1 | R2 | G2 | B2 | R3 | G3 |

The Data order is as follows, MSB = D7, LSB = D0.

### 17.9.2.1.5 Configuration 0xC2 (RGB332 - Option 0)

| Data      | D7       | D6       | D5      | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|---------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit2 | R1, Bit1 | R1 Bit0 | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit2 | R2, Bit1 | R2 Bit0 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit1 | B2, Bit0 |
| Pixel n+2 | R3, Bit2 | R3, Bit1 | R3 Bit0 | G3, Bit2 | G3, Bit1 | G3, Bit0 | B3, Bit1 | B3, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit2, LSB = Bit0 for Red and Green data, MSB = Bit1, LSB = Bit0 for Blue data.

### 17.9.2.1.6 Configuration 0xC3 (RGB444 - Option 0)

| Data                | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|---------------------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n             | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |
| Pixel n / Pixel n+1 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |
| Pixel n+1           | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B1, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit3, LSB = Bit0 for Red, Green, and Blue data.

### 17.9.2.1.7 Configuration 0xC5 (RGB565 - Option 0)

| Data      | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 | G1, Bit5 | G1, Bit4 | G1, Bit3 |
| Pixel n   | G1, Bit2 | G1, Bit1 | G1, Bit0 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 | G2, Bit5 | G2, Bit4 | G2, Bit3 |
| Pixel n+1 | G2, Bit2 | G2, Bit1 | G2, Bit0 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B1, Bit0 |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit5, LSB = Bit0 for Green data, MSB = Bit4, LSB = Bit0 for Red and Blue data.

### 17.9.2.1.8 RGB666 (0xC6 - Option 0)

| Data      | D7       | D6       | D5       | D4       | D3       | D2       | D1 | D0 |
|-----------|----------|----------|----------|----------|----------|----------|----|----|
| Pixel n   | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |    |    |
| Pixel n   | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |    |    |
| Pixel n   | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |    |    |
| Pixel n+1 | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |    |    |
| Pixel n+1 | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 |    |    |
| Pixel n+1 | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |    |    |

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit5, LSB = Bit0 for Red, Green, and Blue data.

| Data      | D7       | D6       | D5       | D4       | D3       | D2       | D1       | D0       |
|-----------|----------|----------|----------|----------|----------|----------|----------|----------|
| Pixel n   | R1, Bit7 | R1, Bit6 | R1, Bit5 | R1, Bit4 | R1, Bit3 | R1, Bit2 | R1, Bit1 | R1, Bit0 |
| Pixel n   | G1, Bit7 | G1, Bit6 | G1, Bit5 | G1, Bit4 | G1, Bit3 | G1, Bit2 | G1, Bit1 | G1, Bit0 |
| Pixel n   | B1, Bit7 | B1, Bit6 | B1, Bit5 | B1, Bit4 | B1, Bit3 | B1, Bit2 | B1, Bit1 | B1, Bit0 |
| Pixel n+1 | R2, Bit7 | R2, Bit6 | R2, Bit5 | R2, Bit4 | R2, Bit3 | R2, Bit2 | R2, Bit1 | R2, Bit0 |
| Pixel n+1 | G2, Bit7 | G2, Bit6 | G2, Bit5 | G2, Bit4 | G2, Bit3 | G2, Bit2 | G2, Bit1 | G2, Bit0 |
| Pixel n+1 | B2, Bit7 | B2, Bit6 | B2, Bit5 | B2, Bit4 | B2, Bit3 | B2, Bit2 | B2, Bit1 | B2, Bit0 |

## 17.9.2.1.9 RGB888 (0xC7 - Option 0)

The Data order is as follows, MSB = D7, LSB = D0. Picture Data is MSB = Bit7, LSB = Bit0 for Red, Green, and Blue data.

### 17.9.3 DualSPI and QuadSPI Interface

The Display Controller supports DualSPI and QuadSPI interface using DC\_QSPI\_CS\_N (chip select), DISP\_QSPI\_SCK (serial clock), and either two or four bi-directional serial data lines. For QuadSPI, the DISP\_QSPI\_D3 - DISP\_QSPI\_D0 lines are used, and for DualSPI, the DISP\_SPI\_DCX and DISP\_SPI\_SD lines are used. For this section, all other information is applicable in either DualSPI or QuadSPI mode.

The chip select can be assigned to any GPIO0-GPIO104 via the PINCFGn\_NCESRCn field. See the "Implementing Display Controller Connections" section in the GPIO chapter for pad configuration information for these SPI signals. Figure 96 and Figure 97 show how the DC is connected to an LCD monitor in either QuadSPI or DualSPi mode using the aforementioned signals.







#### Figure 97. DualSPI Interface

To enable the DC to use one of these interfaces, the DC\_MODE\_DBITYPEBEN bit must be set. After the LCD is initialized and during the write mode, the Display Controller sends commands and data to the LCD. The data packet transmitted on the data lines contains a control bit and a transmission byte, 9 bits in total. The MSB is transmitted first.

The CS signal, the GPIO configured as GPIO\_PINCFGn\_NCESRCn = DC\_QSPI\_CS\_N for QuadSPI or DC\_SPI\_CS\_N for DualSPI, is configurable. It can be set high or low to indicate the start of the data transmission. Data can be sampled either by the falling or the rising edge of the clock depending on the configuration. Also, the clock polarity is configurable (whether the clock starts at high or low edge). If the CS pin remains high/low (depending on the configuration) after the last bit of the data signal, the serial interface expects D7 of the next byte at the next rising/falling edge of the clock. The serial clock is idle when no communication is occurring.

The relative timing of the clock, data and chip selectsignals generally conform to the 3-wire SPI interface timing as shown and described in "SPI, 3-/4-wire Serial Peripheral Interface" on page 240.

# **18. Display Serial Interface (DSI)**

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

#### NOTE

The Apollo4 Lite SoC and Apollo4 Blue Lite SoC do not include a DSI module. This chapter and its content do not apply to these SoCs.

# **19. Graphics and the GFX Library on the Apollo4 Family MCUs**

This chapter is a programming guide for graphics and the use of the GFX library and GPU on the MCU.

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

### **19.1** Introduction to Graphics

Computer graphics is the science of communicating visually via a display and its interaction devices. It is a cross-disciplinary field in which physics, mathematics, human perception, human-computer interaction and engineering blend, towards creating artificial images with the help of programming. It heavily involves computations, creation and manipulation of data and is based on a set of well-defined principles. There are several structural elements that computer graphics are built upon, the most significant of which can be found in the following list.

**Pixel** in digital imaging is the smallest addressable element in an all points addressable display device. A pixel is generally considered as the smallest single component of a digital image and is often used as a measurement unit.

**Raster** image (or bitmapped image) is a matrix data structure representing the actual image content. Raster graphics are resolution-bound therefore unable to scale up without apparent loss of quality.

**Vector graphics** is a technique of using polygons, plane figures bound by a finite chain of straight-line segments closing a loop, to represent images. Vector graphics have inherent scale up abilities, only depending on the rendering device capability.

**Rasterization** is the process of converting an image described in a vector graphics format to a raster image consists of pixels for output on a video display or for storage in a bitmap format.

**Texture** is the digital representation of an object's surface. In addition to two-dimensional qualities such as color and transparency, a texture also incorporates three-dimensional ones such as reflectiveness. Well-defined textures are very important for realistic three-dimensional image representation.

**Texture mapping** is the process of wrapping a pre-defined texture around any two or three-dimensional object. Through this process, digital images and objects obtain a high level of detail.

**Texel** is the fundamental unit of texture space. Textures are represented by arrays of texels in the same way that pictures are represented by arrays of pixels.

**Vertex** is a data structure that describes the location of an object by properly define its corners as positions of points in two or three-dimensional space.

**Primitives** in computer graphics are the simplest geometric objects a system can handle. Common sets of two-dimensional primitives include lines, points, triangles and polygons while all other geometric elements are built up from these primitives. In three-dimensions, properly positioned triangles or polygons can be used as primitives to model more complex forms.

**Blending** is the process in which two or more images are combined per-pixel and weights to create new pictures.

**Fragment** is the data necessary to generate a single-pixel primitive. This data is possible to include raster position, color or texture coordinates.

**Interpolation** in computer graphics is the process of generating intermediate values between two known reference points to give the appearance of continuity and smooth transition. Several distinct interpolation techniques are used in both computer graphics and animation, such as linear, bilinear, spline and polynomial interpolation.

**Graphics Pipeline** is an abstract sequence that incorporates the basic operations of generic rasterizer implementations, in particular:

- (Vertex) Per-vertex transformation to screen space
- (Rasterize) Per-triangle iteration over pixels with perspective-correct interpolation
- (Pixel) Per-pixel shading
- (Output Merge) Merging the output of shading with the current color and depth buffers

The term "pipeline" is used due to the sequential steps that are used for the actual transformation from mathematical model to pixels; the results of the one stage are pushed on to the next stage so that the first stage can begin processing the next element immediately.



Figure 98. Rendering Flow

## 19.2 GPU and Graphics Software Support

The following is supported in software for graphics development:

- FreeRTOS support
  - Bare Metal (no-OS)
- Graphics API support
  - GFX API library in pure C
- Available software emulators and suites
  - NEMA® PIX-Presso
  - NEMA® GUI-Builder

The GFX library is available in pure ANSI C with no dependencies and is ported to FreeRTOS with the following features:

- Enables high quality 2.5D graphics on RTOS and OS-less systems.
- Proprietary low level library that interfaces directly with the GPU and provides a software abstraction layer to organize and employ drawing commands with ease and efficiency.
- Can be used as a back-end to existing APIs and as a standalone Graphics API.

The software package includes:

- NEMA®|GUI-Builder, a graphical cross-platform software framework enabling rapid high-end Graphics User Interface (GUI) development on low resource hardware
- NEMA® PIX-Presso, a utility software for converting images to/from formats suitable for low power embedded devices
# **19.3 GFX Library Architecture**

The GFX Library is a low level library that interfaces directly with the GPU and provides a software abstraction layer to organize and employ drawing commands with ease and efficiency. The target of the GFX library is to be able to be used as a back-end to existing APIs (such as OpenGL®, DirectFB or any proprietary one) but also to expose higher level drawing functions, so as to be used as a standalone Graphics API. Its small footprint, efficient design and lack of any external dependencies, makes it ideal for use in embedded applications. By leveraging the sophisticated architecture, it allows great performance with minimum CPU/SoC usage and power consumption.

The GFX library includes a set of higher level calls, forming a complete standalone Graphics API for applications in systems where no other APIs are needed. This API is able to carry out draw operations from as simple as lines, triangles and quadrilaterals to more complex ones like blitting and perspective correct texture mapping.

The GFX library is built on a modular architecture. An implementor may use only the lower layers of the architecture that provides communication to the GPU hardware, synchronization and basic primitives drawing. The very thin Hardware Abstraction Layer allows for fast integration to the underlying hardware. The upper low level drawing API acts as a back-end interface for accelerating any higher 3rd party Graphics API.

The GFX library modules are generally stacked one over another, forming a layered scheme. This gives the implementor the freedom to tailor the software stack according to ones needs.



Figure 99. GFX Library Architecture

The lowest layer is a thin Hardware Abstraction Layer (HAL). It includes some hooks for basic interfacing with the hardware such as register accessing, interrupt handling etc.

The layer above is the Command List Manager. It provides the appropriate API for creating, organizing and issuing Command Lists. This topic is discussed in detail in Section 19.6.1 on page 267.

Above the Command List Manager lies the Hardware Programming Layer (HPL). This is a set of helper functions that assemble commands for programming the GPU. These commands actually write to the GPU's Configuration Register File, which is used to program the submodules of the GPU.

Alongside the HPL resides the Blender module. This module programs the GPU's Programmable Processing Core. It creates binary executables for the Core. These executables correspond to the various blending modes that are supported by the GFX library.

On top of the GFX library stack lies the graphics API. This API offers function calls to draw geometry primitives (lines, triangles, quadrilaterals etc), blit images, render text, transform geometry objects, perform perspective correct texture mapping etc. When using the GFX library as a back-end for a third party Graphics API, much of the GFX library Graphics API may be disabled.

# **19.4 Graphics Pipeline of the GPU**

The GPU has been designed for graphics efficiency in ultra-compact silicon area. Its fixed-point data path and instruction set architecture (ISA) are tailored to GUIs acceleration and small display applications leading to substantial improvements in power consumption and silicon area. The GPU microarchitecture combines hardware-level support for multi-threading, VLIW and low-level vector processing in the most power efficient way. The GPU is connected to the CPU core via a 32/64-bit AXI4 bus.

The GPU is a modular architecture which includes a separate programmable Fragment Processing Core, a Z Buffer Unit, a Texture Map Unit and Render Output Unit. It utilize a fixed-point data path and custom VLIW Instruction Set Architecture (ISA) in a heavily multi-threaded execution pipeline which is optimized for extracting the maximum performance in 3D acceleration in a power and silicon area constrained system.

### **19.4.1** Instruction Set Architecture

The microarchitecture of the GPU for fragment processing is a four-issue VLIW core. The compact GPU VLIW Instruction Set Architecture (ISA) has been carefully built to support a variety of graphic operations in a few instructions. Each VLIW instruction is 64 bits long and consists of four sub-instructions (corresponding to sub-operations), as shown in Figure 100. In accordance with Figure 100, the sub-operations for the GPU are: i) instructions for Alpha channel arithmetic operations (SUBOP0), ii) instructions for RGB vector arithmetic operations (SUBOP1), iii) memory addressing instructions for reading and writing to memory (SUBOP2), and iv) branching instructions based on comparisons between RGB or Alpha values (SUBOP3).

Each VLIW bundle is designed to perform six arithmetic operations per cycle (simultaneously). If more than one VLIW bundle is enabled, the addressing sub-operations (SUBOP2) are issued once for all the enabled bundles while the remaining sub-operations (SUBOP0, SUBOP1, and SUBOP3) are issued once for each VLIW bundle.



Figure 100. GPU VLIW fragment instruction format

A Yield bit is responsible to stop the execution of a VLIW bundle when it is activated. Consider the register file organization, four simultaneous, read/write operations to general purpose registers (four general purpose registers in total; R0 to R3) and four read-only operations to constant registers (four constant registers in total; C0 to C3) are allowed at each cycle. Consider the alpha channel sub-operations (SUBOP0), more complex arithmetic operations such as "1-alpha" are natively supported (i.e., they can be done without extra instructions). For example, see Table 102 in which the sub-operand A1 is set to 0x4 value which reads the value 1-alpha of R1 register.

# 19.4.1.1 SUBOP0 format set

The SUBOP0 instructions is a subset of the GPU ISA that contains instructions for alpha operations. These instructions have one destination operand that can be written, see Table 99.

| Value | Register | Bits | Summary for DestA         |
|-------|----------|------|---------------------------|
| 0x0   | R0       | 7:0  | Writing R0's Alpha value. |
| 0x1   | R1       | 7:0  | Writing R1's Alpha value. |
| 0x2   | R2       | 7:0  | Writing R2's Alpha value. |
| 0x3   | R3       | 7:0  | Writing R3's Alpha value. |

#### Table 99: DestA Operand

The Alpha operations that can be executed are described in Table 100.

| Opcode | Instruction | Dest  | <b>A</b> 0 | <b>A</b> 1 | A2 | Summary                                                                             |
|--------|-------------|-------|------------|------------|----|-------------------------------------------------------------------------------------|
| 0x0    | NOP         |       |            |            |    | No operation                                                                        |
| 0x1    | AMOV        | DestA |            | A1         |    | Move A1 to DestA register                                                           |
| 0x2    | AMADD       | DestA | A0         | A1         | A2 | The result of multiplication A0 with A1 is added with A2 and saved on Dest Register |
| 0x3    | AMUL        | DestA |            | A1         |    | The result of reciprocal with A1 is saved on<br>DestA register                      |

Table 100: SUBOP0-Instructions

The source operands are A0, A1 and A2. The configuration values are described in Table 101, Table 102 and Table 103, respectively.

| Value | Register | Bits | Summary for Operand A0   |
|-------|----------|------|--------------------------|
| 0x0   | R0       | 7:0  | Read Alpha value from R0 |
| 0x1   | R1       | 7:0  | Read Alpha value from R1 |
| 0x2   | R2       | 7:0  | Read Alpha value from R2 |
| 0x3   | R3       | 7:0  | Read Alpha value from R3 |

Table 101: A0 Operand

| Value | Register  | Bits  | Summary for Operand A1            |
|-------|-----------|-------|-----------------------------------|
| 0x0   | R0        | 7:0   | Read Alpha value from R0          |
| 0x1   | R1        | 7:0   | Read Alpha value from R1          |
| 0x2   | R2        | 7:0   | Read Alpha value from R2          |
| 0x3   | R3        | 7:0   | Read Alpha value from R3          |
| 0x4   | R0        | 7:0   | Read (1 - Alpha) value from R0    |
| 0x5   | R1        | 7:0   | Read (1 - Alpha) value from R1    |
| 0x6   | R2        | 7:0   | Read (1 - Alpha) value from R2    |
| 0x7   | R3        | 7:0   | Read (1 - Alpha) value from R3    |
| 0x8   | R0        | 15:8  | Read Blue value from R0           |
| 0x9   | R0        | 23:16 | Read Green value from R0          |
| 0xa   | R0        | 31:24 | Read Red value from R0            |
| 0xb   | C0        | 7:0   | Read Alpha value from constant C0 |
| 0xc   | C1        | 7:0   | Read Alpha value from constant C1 |
| 0xd   | C2        | 7:0   | Read Alpha value from constant C2 |
| 0xe   | Immediate | 7:0   | Read 0xff immediate value         |
| Oxf   | Immediate | 7:0   | Read 0x0 immediate value          |

# Apollo4 SoC Family Programmer's Guide

### Table 102: A1 Operand

| Value | Register | Bits | Summary for Operand A2            |
|-------|----------|------|-----------------------------------|
| 0x0   | R0       | 7:0  | Read Alpha value from R0          |
| 0x1   | R1       | 7:0  | Read Alpha value from R1          |
| 0x2   | R2       | 7:0  | Read Alpha value from R2          |
| 0x3   | R3       | 7:0  | Read Alpha value from R3          |
| 0x4   | C0       | 7:0  | Read Alpha value from constant C0 |
| 0x5   | C1       | 7:0  | Read Alpha value from constant C1 |
| 0x6   | C2       | 7:0  | Read Alpha value from constant C2 |
| 0x7   | C3       | 7:0  | Read Alpha value from constant C3 |

#### Table 103: A2 Operand

## 19.4.1.2 SUBOP1 format set

The SUBOP1 instructions is a subset of the GPU ISA that contains instructions for the RGB operations. Operands are vectors of 3 elements. In case that an operand reads alpha value, the 8bit alpha value Becomes a vector of 3 elements with the same value. The RGB operations that can be executed are described in Table 104.

| Opcode | Instruction | Dest    | RGB0 | RGB1 | RGB2 | Summary                                                                                            |
|--------|-------------|---------|------|------|------|----------------------------------------------------------------------------------------------------|
| 0x0    | NOP         |         |      |      |      | No operation.                                                                                      |
| 0x1    | RMADD       | DestRGB | RGB0 | RGB1 | RGB2 | The result of multiplication RGB0<br>with RGB1 is added to RGB2 and<br>saved on Dest Register.     |
| 0x2    | RMOV        | DestRGB |      | RGB1 |      | Move RGB1 to Dest register.                                                                        |
| 0x3    | RADD        | DestRGB |      | RGB1 | RGB2 | Add RGB1 with RGB2 and save it to Dest register.                                                   |
| 0x5    | RMSB        | DestRGB | RGB0 | RGB1 | RGB2 | RGB2 is subed from the result of multiplication between RGB0 with RGB1 and saved on Dest Register. |
| 0x6    | RMAX        | DestRGB |      | RGB1 | RGB2 | If RGB2 is greater than RGB1 then<br>RGB0 is written to DST else RGB1.                             |
| 0x7    | RMIN        | DestRGB |      | RGB1 | RGB2 | If RGB2 is less than RGB1 then RGB0 is written to DST else RGB1.                                   |
| 0x8    | ADDXY       | DestRGB |      | RGB1 | RGB2 | R2 XY values are added with R1<br>XY values                                                        |
| 0x9    | ANDXY       | DestRGB |      | RGB1 | RGB2 | R2 XY values are ANDed with R1 XY values                                                           |
|        |             |         |      |      |      |                                                                                                    |

#### Table 104: SUBOP1 Instructions

These instructions have one destination operand that can be written, see Table 105.

| Value | Register | Bits | Summary for Operand DestRGB |
|-------|----------|------|-----------------------------|
| 0x0   | R0       | 31:8 | Writing R0's RGB values     |
| 0x1   | R1       | 31:8 | Writing R1's RGB values     |
| 0x2   | R2       | 31:8 | Writing R2's RGB values     |
| 0x3   | R3       | 31:8 | Writing R3's RGB values     |

## Table 105: DestRGB Operand

The source operands are RGB0, RGB1 and RGB2. The accepted values are described in Table 106, Table 107 and Table 108, respectively.

| Value | Register | Bits | Summary for RGB0       |
|-------|----------|------|------------------------|
| 0x0   | R0       | 7:0  | Read RGB value from R0 |
| 0x1   | R1       | 7:0  | Read RGB value from R1 |
| 0x2   | R2       | 7:0  | Read RGB value from R2 |
| 0x3   | R3       | 7:0  | Read RGB value from R3 |

## Table 106: RGB0 Operand

| Value | Register | Bits | Summary for RGB1                          |
|-------|----------|------|-------------------------------------------|
| 0x0   | R0       | 31:8 | Read RGB values from R0                   |
| 0x1   | R1       | 31:8 | Read RGB values from R1                   |
| 0x2   | R2       | 31:8 | Read RGB values from R2                   |
| 0x3   | R3       | 31:8 | Read RGB values from R3                   |
| 0x4   | R0       | 7:0  | Read Alpha value from R0                  |
| 0x5   | R1       | 7:0  | Read Alpha value from R1                  |
| 0x6   | R2       | 7:0  | Read Alpha value from R2                  |
| 0x7   | R3       | 7:0  | Read Alpha value from R3                  |
| 0x8   | R0       | 31:8 | Read (1 - R)(1 - G)(1 - B) values from R0 |
| 0x9   | R1       | 31:8 | Read (1 - R)(1 - G)(1 - B) values from R1 |
| 0xa   | R2       | 31:8 | Read (1 - R)(1 - G)(1 - B) values from R2 |
| 0xb   | R3       | 31:8 | Read (1 - R)(1 - G)(1 - B) values from R3 |
| Охс   | R0       | 7:0  | Read (1 - Alpha) value from R0            |
| 0xd   | R1       | 7:0  | Read (1 - Alpha) value from R1            |
| 0xe   | R2       | 7:0  | Read (1 - Alpha) value from R2            |
| 0xf   | R3       | 7:0  | Read (1 - Alpha) value from R3            |
| 0x10  | C0       | 31:8 | Read RGB values from constant C0          |
| 0x11  | C1       | 31:8 | Read RGB values from constant C1          |

Table 107: RGB1 Operand

| Value | Register | Bits | Summary for RGB2                 |
|-------|----------|------|----------------------------------|
| 0x0   | R0       | 31:8 | Read RGB values from R0          |
| 0x1   | R1       | 31:8 | Read RGB values from R1          |
| 0x2   | R2       | 31:8 | Read RGB values from R2          |
| 0x3   | R3       | 31:8 | Read RGB values from R3          |
| 0x4   | C0       | 31:8 | Read RGB values from constant C0 |
| 0x5   | C1       | 31:8 | Read RGB values from constant C1 |
| 0x6   | C2       | 31:8 | Read RGB values from constant C2 |
| 0x7   | C3       | 31:8 | Read RGB values from constant C3 |

Table 108: RGB2 Operand

## 19.4.1.3 SUBOP2 format set

The SUBOP2 instructions is a subset of the GPU ISA that contains instructions for addressing, reading and writing to memory. The Addressing operations that can be executed are described in Table 109.

| Opcode | Instruction | Dest  | Coords | Image | Summary                       |
|--------|-------------|-------|--------|-------|-------------------------------|
| 0x0    | NOP         |       |        |       | No operation.                 |
| 0x1    | CREADTEX    | PIXEL | Coords | IMG   | Conditionally Read<br>Texture |
| 0x3    | READTEX     | PIXEL | Coords | IMG   | Read from Texture             |
| 0x5    | PIXOUT      | PIXEL | Coords | IMG   | Pixout                        |

Table 109: SUBOP2 Instructions

Table 110 provides a summary for Pixel Operand while Table 111 and Table 112 describe the Coords Operand.

| Value | Register | Bits | Summary for PIXEL Operand |
|-------|----------|------|---------------------------|
| 0x0   | R0       | 7:0  | Writing R0's value.       |
| 0x1   | R1       | 7:0  | Writing R1's value.       |
| 0x2   | R2       | 7:0  | Writing R2's value.       |
| 0x3   | R3       | 7:0  | Writing R3's value.       |

#### Table 110: Pixel Operand

| Value | Register | Bits | Summary for Coords Operand           |
|-------|----------|------|--------------------------------------|
| 0x0   | iY, iX   |      | Read iY, iX values for addressing.   |
| 0x1   | i_R10    | 31:0 | Read TY4, TX4 values for addressing. |

#### Table 111: Addressing Coords Operand

| Value | Register        | Summary for Coords Operand |
|-------|-----------------|----------------------------|
| 0x0   | <b>TEX0BASE</b> | Read Image from IMAGE0.    |
| 0x1   | TEX1BASE        | Read Image from IMAGE1.    |
| 0x2   | TEX2BASE        | Read Image from IMAGE2.    |
| 0x3   | TEX3BASE        | Read Image from IMAGE3.    |

#### Table 112: TEXnBASE Coords Operand

## 19.4.1.4 SUBOP3 format set

The SUBOP3-instructions is a subset of the GPU ISA that contains compare instructions between the RGB values of the two operands LHS and RHS. The Compare operations that can be executed are described in Table 113.

| Opcode | Instruction | LHS | RHS | Summary                            |
|--------|-------------|-----|-----|------------------------------------|
| 0x0    | NOP         | LHS | RHS | No operation                       |
| 0x1    | EQ          | LHS | RHS | Compare if LHS is equal to RHS     |
| 0x2    | NEQ         | LHS | RHS | Compare if LHS is not equal to RHS |
| 0x3    | LESS        | LHS | RHS | Compare if LHS is less than RHS    |

#### Table 113: SUBOP3 Instructions

The source operands are LHS and RHS. The accepted values are described on Table 114.

| Value | Register | Bits | Summary for LHS and RHS Operands |
|-------|----------|------|----------------------------------|
| 0x0   | R0       | 31:8 | Read RGB values from R0          |
| 0x1   | R1       | 31:8 | Read RGB values from R1          |
| 0x2   | R2       | 31:8 | Read RGB values from R2          |
| 0x3   | R3       | 31:8 | Read RGB values from R3          |
| 0x4   | C0       | 31:8 | Read RGB values from constant C0 |
| 0x5   | C1       | 31:8 | Read RGB values from constant C1 |
| 0x6   | C2       | 31:8 | Read RGB values from constant C2 |
| 0x7   | C3       | 31:8 | Read RGB values from constant C3 |

| Table 114: LHS | , RHS | Operands |
|----------------|-------|----------|
|----------------|-------|----------|

## 19.4.2 Configuration Register File

The GPU is programmed through a set of registers. This register set is called the Configuration Register File (CRF) and each sub-module of the GPU is programmed through a subset of the CRF. The CRF can be memory mapped to the CPU address space, thus making it directly accessible. Writing the CRF directly is considered inefficient since it consumes a large volume of the CPU resources and ties the CPU execution to the GPU. For this reason, it can also be accessed indirectly through the Command List Processor (CLP).

## 19.4.3 Command List Processor

In order to decouple CPU and GPU execution and achieve both better performance and lower power consumption, the GPU incorporates an advanced Command List Processor (CLP), capable of reading entire list of commands from the main memory and relay them to the Configuration Register File.

The CPU pre-assembles Command Lists (CL) prior to submitting them to the Command List Processor for execution, while a single Command List can be submitted multiple times. This approach alleviates the CPU from recalculating drawing operations for repetitive tasks, resulting in more efficient resource utilization.

The steps for writing commands to the Configuration Registers through the Command List Processor are the following:

- 1. The CPU assembles a Command List, through the GFX library.
- 2. The CPU submits the Command List for execution. The Command List Processor is informed of a pending Command List.
- 3. The Command List Processor reads the Command List from the System Memory.
- 4. The Command List Processor relays the commands to the Configuration Register File.

### 19.4.4 Vertex Processing

The Vertex Processing unit in the rendering pipeline handles the processing of individual vertices. The Vertex Processing unit utilizes 64-bit VLIW (very long instruction word) instructions and performs computations on vertices which sends them to the Rasterizer unit through the Configuration Registers. The Vertex Processing unit is programmable through binary executables called Vertex Shaders. It transforms each vertex's 3D position in object space to the 2D coordinate at which it will appear on the screeen. It also calculates the depth value for the Z-buffer and manipulates properties such as position, depth, color and texture coordinates.

### 19.4.5 Rasterizer

The GPU is capable of drawing a multitude of geometrical shapes called Geometric Primitives, such as lines, rectangles, triangles and quadrilaterals. The Rasterizer Unit reads the coordinates of the primitives' vertices and feeds the rest of the graphics pipeline with the fragments contained in the geometry. A fragment contains information concerning a single pixel. This information includes raster position (coordinates), texture coordinates, interpolated color alpha and depth values.

The Rasterizer of the GPU gives each core (1-4) a specific pixel to draw as part of the image. Each core can draw independently from the others, but a shape should be fully drawn, before the next one starts.

### 19.4.6 Texture Map Unit

The Texture Map Unit produces texels that sends to the Fragment Processing Core. It is fed with texture's attributes (base address, dimensions, color format) and the required coordinates. The Texture Map Unit performs some internal processing and outputs the corresponding texel. Generating a texture element requires a series of operations like wrapping (clamp, mirror, repeat etc), reading corresponding color values from memory, converting the color values to RGBA8888 format and performing filtering if necessary. If texture compression technique is used, then on-the-fly decompression is performed.

### 19.4.7 Fragment Processing Core

The Fragment Processing Core is the main processing unit of the GPU's architecture. It is a 64-bit VLIW processor which performs computations on the fragments coming from the Rasterizer Unit and on the texels coming from the Texture Map Unit and calculates the final color and depth to a fragment. The Core is programmable through binary executables called Fragment Shaders.

### 19.4.8 Render Output Unit

The Render Output Unit (ROP) is the last stage of the Graphics Pipeline. The Fragment Processing Core feeds the Render Output Unit with the pixel's coordinates and color value. Before the color value is written to the memory, the color is converted to the Frame Buffer's format. If texture compression technique is used, then decompression is performed while reading from the Frame Buffer and compression is performed while writing to the Frame Buffer.

With the H/W Blender, the Render Output Unit reads pixels from the Fragment Processing Core (source) and pixels from the Frame Buffer (destination) to perform blending. Blending requires a series of

calculations between the source (foreground) and destination (background) color fragments to produce the final color, which is written back to memory. The following equations are used for the final color:

$$Fc = Sc \cdot Sf + Dc \cdot Df$$
  $Fa = Sa \cdot Sf + Da \cdot Df$ 

The Color and Alpha values range from 0 to 1, therefore each calculation result is also clamped to the same range. The available Blend Factors and the resulting RGBA values are listed in Table 118. Figure 108 shows the effect of the blending modes.

# **19.5 Frame Buffer Compression**

Framebuffer compression operates in screen blocks (4x4 pixel blocks) and, depending on the configuration, achieves TSC<sup>™</sup>4, TSC<sup>™</sup>6 and TSC<sup>™</sup>6a lossy, fixed-ratio compression.

- TSC<sup>™</sup>4 is a 6:1 compression (4bpp)
- TSC<sup>™</sup>6 is a 4:1 compression (6bpp)
- TSC<sup>™</sup>6a is a 4:1 compression (6bpp) with alpha channel

Compression is performed at run time using minimal hardware. Pixel data can be stored in the framebuffer in compressed form and decompressed in the NEMA®|dc. Figure 101 shows the TS compression operation. The output of the TSC<sup>™</sup>4 compression is 64 bits per 4x4 block of pixels and the output of the TSC<sup>™</sup>6 compression is 96 bits per 4x4 block of pixels.



Figure 101. TSC™4 /TSC™6 Framebuffer Compression Module

Compression is performed using the tsc binary file located in NemaTS/Hardware/bin/tsc directory. The syntax is:

./ tsc [ options ] <source > <target >

where <source> is the source image and <target> is the compressed image.

The available [options] are:

| -c -mode <mode></mode>                                                               | Compress source image in png format using TS compression (default option)                                                    |
|--------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------|
| -d <image_width>x<image_<br>heigth&gt; -mode <mode></mode></image_<br></image_width> | Decompress source image of TS format to target image in png format.                                                          |
| -е                                                                                   | Use diff command to compare the source image (in png format) with the target image (in png format). The result is rms error. |

The available modes <mode> are:

- [0] TSfc4 Use TS framebuffer (de)compression 4bpp (default)
- [1] TSfc6 Use TS framebuffer (de)compression 6bpp
- [2] TSfc6A Use TS framebuffer (de)compression 6bpp with alpha channel support
- [3] TStc4 Use TS texture (de)compression 4bpp
- [4] TStc6 Use TS texture (de)compression 6bpp

• [5] TStc6A Use TS texture (de)compression 6bpp with alpha channel support

Examples:

If a source image input.png needs compression in TSC<sup>™</sup>4 format, the command is:

./tsc -c -mode TSfc4 input .png output . tsfc4

or:

./tsc -c -mode [0] input .png output . tsfc4

If a source image in TSC<sup>™</sup>4 format input.tsfc4 needs decompression in 256x256 png format, the command is:

./tsc -d 256 x256 -mode TSfc4 input . tsfc4 output .png

or:

./tsc -d 256 x256 -mode [0] input . tsfc4 output .png

To compare two images in png format (input1.png and input2.png), the command is:

./tsc -e input1 .png input2 .png

The NEMA®| PIX-Presso can be used to aid the development of applications that use TSC formats. NEMA®| PIX-Presso is a utility for converting images to formats suitable for low power embedded devices and can produce TSC<sup>™</sup>4 and TSC<sup>™</sup>6 compressed textures which the developer can then load to memory.



Figure 102. NEMA® PIX-Presso State after a Conversion

NEMA®| PIX-Presso emulates the h/w compression and decompression algorithms, so the developer can check the resulting quality of the image and decide if TSC compression can be used for the specific application, or which TSc algorithm (TSC™4 or TSC™6) is more appropriate.

# 19.6 GFX Library API Guidelines

This section provides an overview of the GFX Library API, including some simple guidelines on how to use the library for development purposes. The information included in this section can be found elsewhere in this chapter but spans across several paragraphs, which makes it daunting to get started. This section introduces some simple code examples using the primitives of the GFX library to draw a graphics image in a timely manner.

## 19.6.1 Command Lists

A Command List (CL) is considered to be one of the most important features of the GPU. CL usage facilitates GPU and CPU decoupling, while its inherent re-usability greatly contributes to the decrease of the computational effort of the CPU. This approach renders the overall architecture capable of drawing complicated scenes while keeping the CPU workload to the very minimum.

The design principles of CLs allow developers to extend the features of their application while optimizing its functionality at the same time. For instance, a CL is capable of jumping to another CL, thus forming a chain of seamlessly interconnected commands. In addition, a CL is able to branch to another CL and once the branch execution is concluded, resume its functionality after the branching point.

The GFX library helps developers to easily take advantage of all these features through certain basic function calls that trigger the whole spectrum of CL capabilities. A short presentation of the most fundamental subset of them is listed in the following sections.

## 19.6.1.1 Create

The most straightforward command for initiating a simple coding example is the "Create" command which is listed below.

nema\_cmdlist\_t nema\_cl\_create(void)

This fundamental command allocates and initializes a new Command List for later use.

### 19.6.1.2 Bind

#### nema\_cl\_bind\_cmdlist(nema\_cmdlist\_t \*cl)

This command sets the referred Command List as active. From that point on, each subsequent drawing call will incrementally be incorporated in the active Command List. At any time, all drawing operations should be called when there is a bound Command List.

### 19.6.1.3 Unbind

nema\_cl\_unbind\_cmdlist(void)

Unbind the currently bound Command List.

### 19.6.1.4 Submit

```
nema_cl_submit_cmdlist(nema_cmdlist_t *cl)
```

Submit the referred Command List for execution. If this CL is currently the one that is bound, this call unbinds it. When a CL is submitted for execution, it should never be altered until it finishes execution. Writing in such a CL results in undefined behavior.

A typical routine for drawing would be the following:

```
nema_cmdlist_t cl = nema_cl_create(); // Create a new CL
nema_cl_bind_cmdlist(&cl); // Bind it
/* Drawing Operations */ // Draw scene
nema_cl_unbind_cmdlist(); // Unbind CL (optionally)
nema_cl_submit_cmdlist(&cl); // Submit CL for execution
```

### 19.6.2 Binding Textures

Every drawing operation should have an effect on a given destination texture. The texture must reside in some memory space which is visible to the GPU.

The above function binds a texture to serve as destination. The texture's attributes (GPU address, width, height, format and stride) are written inside the bound CL. Each subsequent drawing operation will have an effect on this destination texture.

Most common graphics operations include some kind of image blitting (copying), like drawing a background image, GUI icons or even font rendering. The following command binds a texture to be used as foreground:

This function call has a very similar functionality to the NemaGFX\_bind\_dst\_tex. It has one extra argument, NEMA\_tex\_mode\_t mode, that determines how to read a texture (point/bilinear sampling, wrapping mode etc).

The above example can now be extended as follows:

```
NEMA_RGBA8888,
                                         // Image format (32bit, rgba)
                  320*4):
                                       // Stride in bytes (width*4 bytes per pixel)
                                       // Bind Foreground Texture:
nema_bind_src_tex(SRC_IMAGE,
                                      // Source address
                                     // width, height
                  320, 240,
                  NEMA_RGBA8888 ,
                                        // Image format (32bit, rgba)
                  320*4,
                                       // Stride in bytes (width*4 bytes per pixel)
                  NEMA_FILTER_PS);
                                        // Do point sampling (default option)
/* Drawing Operations */
                                       // Draw scene
nema_cl_unbind_cmdlist();
                                      // Unbind CL (optionally)
nema_cl_submit_cmdlist(&cl);
                                   // Submit CL for execution
```

### 19.6.3 Clipping

When drawing a scene, it is often necessary to be able to define a rectangular area that the GPU is allowed to draw. This way, if some parts of a primitive (e.g. a triangle) falls outside the clipping area, that part is not going to be drawn at all, assuring correctness, better performance and improved power efficiency. The Clipping Rectangle can be defined as follows:

void nema\_set\_clip(int32\_t x, int32\_t y, int32\_t w, int32\_t h)

This function defines a Clipping Rectangle whose upper left vertex coordinates are (x, y) and its dimensions are w·h.

The default Clipping Rectangle usually is the entire canvas. In the above examples, we used textures with dimensions of 320x240. So, adding Clipping would result the following:

```
nema_cmdlist_t cl = nema_cl_create(); // Create a new CL
nema_cl_bind_cmdlist(&cl);
                                       // Bind it
                                       // Bind Destination Texture:
                                      // Destination address
nema_bind_dst_tex(DST_IMAGE,
                   320, 240,
NEMA_RGBA8888,
                                       // width, height
                                         // Image format (32bit, rgba)
                   320*4);
                                       // Stride in bytes (width*4 bytes per pixel)
nema_bind_src_tex (SRC_IMAGE,
320, 240,
NEMA_RGBA8888,
                                       // Bind Foreground Texture:
                                       // Source address
                                       // width, height
                                         // Image format (32bit, rgba)
                   320*4,
                                       // Stride in bytes (width*4 bytes per pixel)
                   NEMA_FILTER_PS);
                                         // Do point sampling (default option)
nema_set_clip(0, 0, 320, 240);
                                      // Define a 320x240 Clipping Rectangle
/* Drawing Operations */
                                       // Draw scene
nema_cl_unbind_cmdlist();
                                       // Unbind CL (optionally)
nema_cl_submit_cmdlist(&cl); // Submit CL for execution
```

## 19.6.4 Blending - Programming the Core

When building a graphical interface, the developer has to define what would be the result of drawing a pixel on the canvas. Since the canvas already contains the previous drawn scene, there must be a consistent way to determine how the source or foreground color (the one that is going to be drawn) will blend with the destination or background color that is already drawn. The source pixel can be fully opaque, thus will be drawn over the destination one, or it can be translucent and the result would be a blend of both the source and destination colors.

For example, blitting a background image of a GUI would require the Source Texture to cover entirely whatever is already drawn on the canvas. Afterwards, blitting an icon would require the background to be partially visible on the translucent areas of the icon. In order to make this possible, the GFX library incorporates a powerful set of predefined blending modes that allow the developer to build functional and eye catching applications:

```
void nema_set_blend_fill(nema_blend_mode_t blending_mode)
void nema_set_blend_blit(nema_blend_mode_t blending_mode)
```

These two functions refer to blending when filling a primitive (e.g. triangle) with a color or when blitting a texture respectively.

The previous example, after setting the correct blending mode for blitting a background texture, would evolve to the following:

```
nema_cmdlist_t cl = nema_cl_create(); // Create a new CL
nema_cl_bind_cmdlist(&cl);
                                      // Bind it
                                      // Bind Destination Texture:
nema_bind_dst_tex(DST_IMAGE,
320, 240,
                                     // Destination address
                                     // width, height
                  NEMA_RGBA8888,
                                       // Image format (32bit, rgba)
                  320*4);
                                      // Stride in bytes (width*4 bytes per pixel)
nema_bind_src_tex (SRC_IMAGE,
320, 240,
NEMA_RGBA8888,
                                      // Bind Foreground Texture:
                                     // Source address
                                       // width, height
                                       // Image format (32bit, rgba)
                  320*4,
                                       // Stride in bytes (width*4 bytes per pixel)
                  NEMA_FILTER_PS);
                                       // Do point sampling (default option)
                                     // Define a 320x240 Clipping Rectangle
nema_set_clip(0, 0, 320, 240);
nema_set_blend_blit(NEMA_BL_SRC);
                                        // Program the Core to draw the source color
                                       // without blending it with the destination
                                       // texture
                                      // Draw scene
/* Drawing Operations */
                                      // Unbind CL (optionally)
nema_cl_unbind_cmdlist();
nema_cl_submit_cmdlist(&cl); // Submit CL for execution
```

### 19.6.5 Drawing

Finally, after setting up the above, the CL contains all the information needed to blit an image or fill a Geometric Primitive with color. The GFX library has a rich set of functions to do that. For the example

above, let's assume that we need to draw a background 320x240 image, starting at screen coordinate (0,0) (the upper left corner of the canvas), and then draw a red rectangle that starts at point (20, 30) with dimensions 100x200:

```
nema_cmdlist_t cl = nema_cl_create(); // Create a new CL
nema_cl_bind_cmdlist(&cl);
                                      // Bind it
                                      // Bind Destination Texture:
nema_bind_dst_tex(DST_IMAGE,
320, 240,
                                     // Destination address
                                      // width, height
                  NEMA_RGBA8888,
                                         // Image format (32bit, rgba)
                                      // Stride in bytes (width*4 bytes per pixel)
                   320*4);
nema_bind_src_tex (SRC_IMAGE,
320, 240,
NEMA_RGBA8888,
                                      // Bind Foreground Texture:
                                     // Source address
                                      // width, height
                                       // Image format (32bit, rgba)
                   320*4,
                                      // Stride in bytes (width*4 bytes per pixel)
                   NEMA_FILTER_PS);
                                       // Do point sampling (default option)
                                   // Define a 320x240 Clipping Rectangle
nema_set_clip(0, 0, 320, 240);
nema_set_blend_blit(NEMA_BL_SRC);
                                        // Program the Core to draw the source
                                       // texture without blending it with the
                                       // destination texture
nema_blit(0, 0);
                                      // Blit the bound Source Texture to
                                       // Destination Texture
nema_set_blend_fill(NEMA_BL_SRC);
                                        // Program the Core to fill the Geometric
                                       // Primitive without blending it with the
                                       // destination texture
nema_fill_rect(20, 30, 100, 200, RED); // Fill a rectangular area with red color
nema_cl_unbind_cmdlist();
                                      // Unbind CL (optionally)
nema_cl_submit_cmdlist(&cl);
                                    // Submit CL for execution
```

The overall process described in the previous paragraphs, produces the output presented in the following figures.



Figure 103. Original Empty Framebuffer



Figure 104. Image Background



Figure 105. Final Output of the Drawing Process

# 19.7 Color Modes and Binding Textures

### 19.7.1 Color Modes

The GPU natively support a large set of texture formats, therefore they are capable of performing fast read and write operations by executing on the fly color conversion/decompression. Native formats expand from full 32-bit RGBA to 1-bit black and white colors, together with an optional proprietary compressed 4-bit-perpixel lossy format and can all be used as source or destination textures. The list of all supported formats is presented in Table 115 for quick reference purposes.

| Color Mode         | Description                                  |
|--------------------|----------------------------------------------|
| RGBX8888           | 32-bit color with no transparency            |
| RGBA8888           | 32-bit color with transparency               |
| XRGB8888           | 32-bit color with no transparency            |
| ARGB8888           | 32-bit color with transparency               |
| BGRA8888           | 32-bit color with transparency               |
| BGRX8888           | 32-bit color with transparency               |
| RGBA5650           | 16-bit color with no transparency            |
| RGBA5551           | 16-bit color with 1-bit transparency         |
| RGBA4444           | 16-bit color with transparency               |
| RGBA3320           | 8-bit color with no transparency             |
| L8                 | 8-bit gray scale (luminance) color           |
| A8                 | 8-bit translucent color                      |
| L2                 | 2-bit grayscale (luminance) color            |
| L4                 | 4-bit grayscale (luminance) color            |
| BW1                | 1-bit color (black or white)                 |
| UYVY               | UYVY color                                   |
| TSC <sup>™</sup> 4 | 4-bit proprietary compressed                 |
| YUV                | YUV                                          |
| Z24_8              | 32-bit (24+4) depth and stencil              |
| Z16                | 16-bit depth<br>Table 115: Supported Formats |

## 19.7.2 Binding Textures

The color modes discussed in 19.7.1 can be used for both source and destination textures. The GPU incorporate 4 texture slots allowing 4 textures to be bound simultaneously. This means that the hardware allows a single Shader to read from and/or write to 4 different textures. These textures have to be bound before the Shader is submitted for execution. The GFX library uses pre-assembled Shaders to perform blending operations. These Shaders are built upon the conventions of Table 116.

| Texture Slot | <u>Texture Usage</u>           |
|--------------|--------------------------------|
| NEMA_TEX0    | Destination/Background Texture |
| NEMA_TEX1    | Foreground Texture             |
| NEMA_TEX2    | Background Texture             |
| NEMA_TEX3    | DepthBuffer                    |
|              | Table 116: Shader conventions  |

For further clarifying the aforementioned conventions, let's assume the following example: We need to draw the scene shown in Figure 106, consisted of a background image and two icons. The scene requires the 3 source textures shown in Figure 107, and a Framebuffer, i.e., the destination texture.



Figure 106. Rendered Scene with Two Icons



Figure 107. Scene Textures

## Scenario 1

The scene will be drawn in 3 passes: first draw the background, then draw lcon\_0 and last draw lcon\_1.

- 1. First Pass: Draw Background
  - A. Bind the Framebuffer to NEMA\_TEX0 slot
  - B. Bind the Background texture to NEMA\_TEX1 slot
  - C. Set corresponding blending mode (NEMA\_BL\_SRC)
  - D. Blit NEMA\_TEX1 slot to NEMA\_TEX0 slot
- 2. Second Pass: Draw Icon\_0
  - A. Bind the Framebuffer to NEMA\_TEX0 slot
  - B. Bind Icon0 to NEMA\_TEX1 slot
  - C. Set corresponding blending mode (e.g. NEMA\_BL\_SRC\_OVER)
  - D. Blit NEMA\_TEX1 slot to NEMA\_TEX0 slot
- 3. Third Pass: Drawlcon\_1
  - A. Bind the Framebuffer to NEMA\_TEX0 slot
  - B. Bind Icon1 to NEMA\_TEX1 slot
  - C. Set corresponding blending mode (e.g. NEMA\_BL\_SRC\_OVER)
  - D. Blit NEMA\_TEX1 to NEMA\_TEX0 slot

If we build a single Command List for the above operations, we need to bind the Framebuffer only once, in the beginning of the draw process. This sequence will result in 3 blitting operations. However, there is a more efficient approach, described in Scenario 2.

### Scenario 2

The scene will be drawn in 2 passes: first draw lcon\_0 on top of the background and then draw lcon\_1.

- 1. First Pass: Draw Icon\_0 on top of the Background
  - A. Bind the Framebuffer to NEMA\_TEX0 slot
  - B. Bind Icon0 to NEMA\_TEX1 slot

- C. Bind the Background texture to NEMA\_TEX2 slot
- D. Set corresponding blending mode (e.g. NEMA\_BL\_SRC\_OVER)
- E. Blit NEMA\_TEX1 on top of NEMA\_TEX2 slot to NEMA\_TEX0 slot
- 2. Second Pass: Draw Icon\_1
  - A. Bind the Framebuffer to NEMA\_TEX0 slot
  - B. Bind Icon1 to NEMA\_TEX1 slot
  - C. Set corresponding blending mode (e.g. NEMA\_BL\_SRC\_OVER)
  - D. Blit NEMA\_TEX1 slot to NEMA\_TEX0 slot

## 19.7.2.1 Texture Binding Functions

Use this function to bind the destination texture. It implies binding to NEMA\_TEX0 slot:

Use the following function to bind the foreground (source) texture. This is needed only for Blit operations. Fill operations don't have a source texture. This function implies binding to NEMA\_TEX1 slot:

The following function binds a background texture to NEMA\_TEX2 slot. This is needed when the Blending Mode to be used does not use the destination texture (NEMA\_TEX0) as background texture at the blending operation.

# **19.8 Geometry Primitives**

In computer graphics, geometry primitives are the basic geometric shapes that a system can draw. In the GPU architecture, these primitives are generated by the Rasterizer module. The Rasterizer generates the fragments contained inside the primitive and feeds them to the Programmable Core for processing. The GPU can draw the following Geometry Primitives:

- Points
- Lines
- Filled Triangles
- Filled Rectangles
- Filled Quadrilaterals

All the aforementioned primitives can be processed by the Programmable Core to do simple operations (e.g filling with a constant color or gradient, blitting etc) or more advanced ones (e.g. blurring, edge detection etc). The GPU functionality can be extended through software to draw:

- Triangles
- Rectangles
- Polygons
- Filled Polygons
- Triangle Fans
- Triangle Strips
- Circles
- Filled Circles
- Arcs
- Rounded Rectangles

# 19.9 Blending

Alpha blending is a basic process in computer graphics. It refers to a convex combination of two colors, a translucent source (foreground) and a destination (background) one, allowing transparency effects. The basic blending algorithms described by Porter and Duff in [<XREF>2], define a set of mathematical operations for the Color channels (RGB) and the Alpha (transparency) channel of a fragment. Blending process is essential for rendering fonts and/or creating GUIs. In the graphics pipeline, blending is carried-out in the Graphics Core.

### 19.9.1 Blending in the Graphics Core

The Graphics Core is a programmable VLIW processor, which allows rapid calculations between colors. Normally, such per-fragment calculations are an overwhelming computational burden for a CPU or SoC. The Graphics Core is programmed through instructions in binary form, called Shaders. However, in embedded applications, running a compiler for creating these kinds of Shaders is not a realistic scenario. Therefore, the GFX library provides a lightweight and (user-friendly) easy to use interface that employs pre-assembled commands to create a powerful set of blending algorithms.

The Graphics Core can be programmed through the following functions:

```
void nema_set_blend_fill(nema_blend_mode_t blending_mode)
void nema_set_blend_blit(nema_blend_mode_t blending_mode)
```

These functions are defined in NemaGFX\_blender.h file. They should be used on fill and blit operations respectively. The blending\_mode argument is possible to be a predefined blending mode or a more refined User Defined Mode.

### 19.9.2 Notations and Conventions

Blending requires a series of calculations between the source (foreground) and destination (background) color fragments for producing the final color, which will be written in memory. The Color and Alpha channels are noted as follows:

- Sc: Source Color
- Sa: Source Alpha
- *Sf*: Source Blend Factor (multiplier)
- Dc: Destination Color
- Da: Destination Alpha
- *Df*: Destination Blend Factor (multiplier)
- Fc: Final Color
- Fa: Final Alpha
- Cc: Constant Color
- Ca: Constant Alpha

The Color and Alpha values range from 0 to 1, therefore each calculation result is also clamped to the same range. For consistency reasons Color and Alpha calculations are always described separately, as in some cases these calculations are not identical. When a constant color is used (noted as Cc and Ca), it can be set using the following function:

void nema\_set\_const\_color(uint32\_t rgba)

### 19.9.3 Predefined Blending Modes

Predefined Blending Modes is a set of commonly used modes, each implying different calculations between the source and destination colors for Color (RGB) channel and Alpha channel respectively. Table

1304 presents the entire list of the available Predefined Blending Modes along with the corresponding calculations that produce the final fragment color. Table 117 shows the result for each Predefined Blending Mode.

| Predefined Blending Modes | <u>RGB</u>                   | <u>ALPHA</u>                |
|---------------------------|------------------------------|-----------------------------|
| NEMA_BL_SIMPLE            | Sc * Sa + Dc *(1 - Sa)       | Sa *Sa+Da *(1 - Sa)         |
| NEMA_BL_CLEAR             | 0                            | 0                           |
| NEMA_BL_SRC               | Sc                           | Sa                          |
| NEMA_BL_SRC_OVER          | Sc + Dc *(1 - Sa)            | Sa+Da ∗(1−Sa)               |
| NEMA_BL_DST_OVER          | Sc *(1 - Da) + Dc            | Sa *(1 – Da) + Da           |
| NEMA_BL_SRC_IN            | Sc *Da                       | Sa *Da                      |
| NEMA_BL_DST_IN            | Dc *Sa                       | Da *Sa                      |
| NEMA_BL_SRC_OUT           | Sc ∗(1 − Da)                 | Sa ∗(1 − <i>D</i> a)        |
| NEMA_BL_DST_OUT           | <i>Dc</i> *(1 – S <i>a</i> ) | <i>Da</i> ∗(1 – Sa)         |
| NEMA_BL_SRC_ATOP          | Sc *Da + Dc *(1 - Sa)        | Sa ∗Da+Da ∗(1−Sa)           |
| NEMA_BL_DST_ATOP          | Sc *(1 - Da) + Dc * Sa       | Sa *(1-Da)+Da *Sa           |
| NEMA_BL_ADD               | Sc + Dc                      | Sa +Da                      |
| NEMA_BL_XOR               | Sc *(1 - Da) + Dc *(1 - Sa)  | Sa *(1 - Da) + Da *(1 - Sa) |

**Table 117: Predefined Blending Modes** 



#### Figure 108. Predefined Blending Modes

For instance, drawing a translucent red rectangle would require the following calls:

```
nema_set_blend_fill(NEMA_BL_SIMPLE);
nema_fill_rect(52, 52, 14, 4, nema_rgba(0xff, 0, 0, 0x80));
```

Tip: When in doubt, usually the NEMA\_BL\_SIMPLE blending mode is the safest choice.

The overall process starts with an empty Framebuffer, shown in Figure 109. Next, the process continues by blitting the textures shown in Figure 110.



Figure 109. Original Framebuffer before Blending

**DS** Figure 110. Scene Textures

# 19.9.4 User Defined Modes

Developers are able to create custom blending modes by using different factors for the source (foreground) and destination (background) color, through the following function:

The ops argument of the above function refers to additional operations, presented in Section 19.9.5 on page 282, therefore for the time being should be set to zero (0). The calculations result to the final color, using the following two equations:

 $Fc = Sc \cdot Sf + Dc \cdot Df \quad (1)$  $Fa = Sa \cdot Sf + Da \cdot Df \quad (2)$ 

The available Blend Factors are listed in Table 118. Figure 111 shows the available custom blending modes. As a result, the previous example is possible to be rewritten as:

nema\_set\_blend\_fill(nema\_blending\_mode(NEMA\_BF\_SRCALPHA, NEMA\_BF\_INVSRCALPHA, 0)); nema\_fill\_rect(52, 52, 14, 4, nema\_rgba(0xff, 0, 0, 0x80));

|      | Blend Factors (Sf or Df) |
|------|--------------------------|
| <br> |                          |

| NEMA_BF_ZERO            | 0               |
|-------------------------|-----------------|
| NEMA_BF_ONE             | 1               |
| NEMA_BF_SRCCOLOR        | Sc              |
| NEMA_BF_INVSRCCOLOR     | (1 –Sc)         |
| NEMA_BF_SRCALPHA        | Sa              |
| NEMA_BF_SRC_INVSRCALPHA | (1 <i>−Sa</i> ) |
| NEMA_BF_DESTALPHA       | Da              |

| NEMA_BF_INVDESTALPHA | (1 <i>−Da</i> ) |
|----------------------|-----------------|
| NEMA_BF_DESTCOLOR    | Dc              |
| NEMA_BF_INVDESTCOLOR | (1 <i>−Dc</i> ) |
| NEMA_BF_CONSTCOLOR   | Cc              |
| NEMA_BF_CONSTALPHA   | Ca              |





Figure 111. User-defined Blending Modes

## 19.9.5 Additional Operations

The GFX library allows the following operations, which can be applied together with the previously mentioned blending modes through the aforementioned function:

The additional supported operations are listed in Table 119. An example of the overall process can be found in Figure 112 and Figure 113.

| ops Arguments  | Description                                                          |
|----------------|----------------------------------------------------------------------|
| SRC_MODULATE_A | Multiply source alpha channel with Ca constant before blending.      |
|                | <i>Ca</i> is defined by callingNemaGFX_set_const_color().            |
| SRC_FORCE_A    | Replace source alpha channel with <i>Ca</i> before blending.         |
|                | Overrides SRC_MODULATE_A option.                                     |
|                | <i>Ca</i> is defined by calling NemaGFX_set_const_color().           |
| SRC_COLORIZE   | Multiply source color channels (RGB) with <i>Cc</i> before blending. |
|                | Cc is defined by calling NemaGFX_set_const_color().                  |
| SRC_COLORKEY   | Ignore fragment when source color matches the source color key,      |
|                | which is defined by calling NemaGFX_set_src_color_key().             |
| DST_COLORKEY   | Ignore fragment when destination color matches the destination color |
|                | key, which is defined by calling NemaGFX_set_dst_color_key().        |
|                |                                                                      |

Table 119: ops Arguments



Figure 112. Source Textures



Figure 113. Additional Operations Example

## 19.10 Fonts

Drawing text on the screen is an important element of any Graphical User Interface. To draw a string you will need a Typeface, the text to be drawn and some attributes on how the text is to be displayed. Typefaces are sourced in TrueType (TTF) file type, which contains scalable representations of typefaces described as vector curves. Scalable fonts are converted to raster fonts (bitmap fonts) by rasterization to a particular size and format. Raster fonts are drawn on the screen as a series of images with each letter drawn after the other using the correct letter width. To facilitate this process, the GFX library handles text display and alignment using special functions.



Figure 114. Vector and Bitmap Fonts

The first step is to convert a TrueType font to Bitmap font. This is done off-line with the NemaGFX\_FontUtil tool.

```
./nema_font_convert <font.ttf> -size <size> -format <A1|A2|RGBA>
-output <font\_bitmap>
```

The conversion generates a font\_bitmap.h and font\_bitmap.bin file at the specified size and format. The <size> parameter defines the height of the font. Fonts can be mono-spaced (fixed-width) or not.

Typefaces can be converted to the following formats:

A1:1 bit per pixel

A2:2 bits per pixel antialiased

A4:4 bits per pixel antialiased

A8:8 bits per pixel antialiased RGBA32:Sub-pixel Antialiased

Before drawing text we need to bind the data structure of a typeface.

void nema\_bind\_font (font\_t \*fontdata)

The text can be drawn through the following function:

```
void nema_draw_text (char *text, int *x, int *y, int clip_x, int clip_y, int clip_w,
int clip_h, uint32_t color, font_attr_t attributes)
```

The arguments of the above function refer to:

- The text to be drawn
- The (x, y) screen coordinates that the text should be drawn to
- The clipping box (x, y, width, height) in case some part of the text should be hidden
- The color of the text
- Additional font attributes

The additional font attributes are:

- JUST\_LEFT:Justify Left
- JUST\_CENTRE: Justify Centre
- JUST\_RIGHT: Justify Right
- WRAP:Wrap to a new line if exceeding

Arguments \*x, \*y return the position of the cursor after the text is drawn.

# 19.11 GFX Library Platform Porting

The GFX library has been designed to be easily portable in a variety of different platforms. This includes systems with or without an operating system. In order to port the GFX library successfully, one must take into account the target platform and adapt the HAL (Hardware Abstraction Layer) accordingly.

The HAL is a thin layer of the library that:

- Communicates directly with the system hardware and the GPU drivers (when drivers are available)
- Performs the communication between the host and the GPU (access to the GPU registers)
- Handles interrupts
- Implements the memory management scheme (memory allocation, deallocation, mapping and un-mapping)

### 19.11.1 Platform Specific HAL

Each target platform has:

- A nema\_sys\_defs.h header file located in NemaGFX\_SDK/common/platforms folder
- A separate *nema\_hal.c* file, located in *NemaGFX\_SDK/NemaGFX/platforms/* folder

In order to port the GFX library to a new platform, it is advised that the corresponding source files of an already ported platform are used as templates.

#### 19.11.2 nema\_sys\_defs.h

nema\_sys\_defs.h contains all the global definitions and inclusions that are platform specific. For example, the GFX library uses a set of integer types defined inside the C standard stdint.h header. If the platform's compiler supports the stdint.h, then it should be included in the nema\_sys\_defs.h header file. If the compiler does not support the stdint.h, then the following types should be defined:

- int8\_t, uint8\_t,
- int16\_t, uint16\_t,
- int32 t, uint32 t,
- int64\_t, uint64\_t

If the GFX library is compiled for a platform that runs multiple processes and/or multiple threads, the following definitions should be added respectively:

```
#define NEMA_MULTI_PROCESS
#define NEMA_MULTI_THREAD
```

#### 19.11.3 nema\_hal.c

*nema\_hal.c* contains all the platform specific functions that implement hardware register read/write operations, interrupt handling, memory management and mutex support. It acts as a thin layer which incorporates all the platform dependent portions of a the GFX library implementation.

#### 19.11.3.1 System Initialization

The nema\_init() function initializes the GFX library and calls the nema\_sys\_init() function which is responsible for the system initialization. The system initialization includes:

GPU register memory mapping
- Graphics Memory mapping
- Mutex initialization
- Ring Buffer allocation and initialization

#### 19.11.3.2 Register Read/Write

The host CPU writes to and reads from the GPU's configuration registers. The functions *nema\_reg\_read()* and *nema\_reg\_write()* are used for the communication of the CPU with the GPU.

When the target platform does not need memory virtualization (e.g. bare-metal systems), the access to the GPU's registers is straightforward by using the register's physical memory address. The only prerequisite in this case, is the appropriate memory mapping of the GPU registers to the system's main memory.

The following examples, illustrate the register read and write operations for bare-metal systems.

```
uint32_t nema_reg_read(uint32_t reg) {
    uint32_t *ptr = (uint32_t *)(nema_regs + reg);
    return *ptr;
}
void nema_reg_write(uint32_t reg, uint32_t value) {
    uint32_t *ptr = (uint32_t *)(nema_registers_base_addr + reg);
    *ptr = value;
}
```

On platforms that support virtual memory (i.e. Linux, Android), the GPU registers' physical addresses must be mapped to the virtual memory addresses before an access is attempted. This can be performed in three ways.

The first one is to utilize the GPU driver (if applicable), and perform the memory mapping using the *mmap* system call.

```
nema_regs_base_virt = mmap(NULL, 0x1000, PROT_READ | PROT_WRITE, MAP_SHARED, nema_fd,
0);
```

The second way in Linux systems, is to use the /dev/mem instead of the unema driver as shown in the following example.

```
// Map and initialize Graphics Memory
nema_devmem_mmap(&gmem_base_virt, gmem_base_phys, gmem_size, "VideoMemory");
// Memory map nema registers
nema_devmem_mmap((void **)&nema_regs, nema_regs_base_phys, 0x1000, "NemaRegisters
");
```

The third way in Linux systems, is to perform the read/write operations to the registers via respective *IOCTL* calls to the unema driver.

```
uint32_t nema_reg_read(uint32_t reg) {
    unema_ioctl_readwrite_t ioctl_rw;
    ioctl_rw.reg = reg;
    ioctl(nema_fd, UNEMA_IOCTL_REG_READ, &ioctl_rw);
    return ioctl_rw.value;
}
void nema_reg_write(uint32_t reg, uint32_t value) {
    unema_ioctl_readwrite_t ioctl_rw;
    ioctl_rw.reg = reg;
    ioctl_rw.value = value;
    ioctl(nema_fd, UNEMA_IOCTL_REG_WRITE, &ioctl_rw);
}
```

## 19.11.3.3 Interrupt Handling

The interrupt handler is executed when an interrupt is triggered by the GPU. Its purpose is to awaken suspended processes and clear the interrupt.

When the GPU kernel driver is available (Linux and Android systems), the interrupt handler is defined in the unema driver. When the GPU kernel driver is not available (bare-metal and RTOS based systems), the interrupt handler has to be defined in the nema\_hal.c file.

A typical interrupt handler for bare-metal systems is the following:

The HAL should implement the nema\_wait\_irq function for interrupt handling. Its purpose is to suspend the process (put to sleep) if the GPU is idle or until the GPU signals an interrupt. Its implementation is platform (CPU) dependent. If the kernel driver is available, then IOCTL calls are used, otherwise it is manually defined according to the CPU platform.

```
void nema_wait_irq(void)
{
```

Typically, the GPU will raise an interrupt when it has finished executing a Command List. The ID of the last Command List that has been executed can be read from the Configuration Register NEMA\_CLID. The function nema\_wait\_irq\_cl(int cl\_id) should wait until the content of the NEMA\_CLID register is greater or equal than the cl\_id argument.

## 19.11.3.4 Memory Management

At this stage, the memory management scheme should be implemented. Memory can be considered to consist of two parts: host memory (memory available only to the CPU) and graphics memory (memory available both to the GPU and the CPU).

Host memory can be allocated by using systems' default *malloc* method:

```
//System malloc
void nema_host_free (void *ptr ) {
    free(ptr);
}
void *nema_host_malloc (unsigned size) {
    return malloc(size);
}
```

If such a method is not available, the graphics memory allocator can be used:

```
//Think Silicon's graphics memory allocator
void nema_host_free (void *ptr ) {
    tsi_free(ptr);
}
void *nema_host_malloc (unsigned size) {
    return tsi_malloc(size);
}
```

Graphics memory is the portion of system memory that the GPU is allowed to have access. In this case, it is necessary that the graphics memory occupies a contiguous physical memory space. On systems that do not support virtual memory (e.g. bare-metal systems), graphics memory can be allocated in the same way as host memory (using *malloc*, *tsi\_malloc* or other custom memory allocator).

The GFX library includes the following API calls for memory allocation, deallocation and mapping:

•^:nema\_buffer\_create() - Allocate memory

^:nema\_buffer\_create\_pool() - Allocate memory from specific memory pool

.nema\_buffer\_map() - Map allocated memory space for CPU access

.rema\_buffer\_unmap() - Unmap previously mapped memory space

•^:nema\_buffer\_destroy() - Deallocate memory space

A reference example that illustrates this memory allocation scheme can be found in:

Software/NemaGFX\_SDK/NemaGFX/platforms/lattice\_mico32\_no\_OS/nema\_hal.c.

In this file, functions *nema\_buffer\_create* and *nema\_buffer\_destroy* adopt this specific scheme.

When virtual memory is used (e.g. Linux or Android systems), graphics memory should be allocated by the system's contiguous memory allocator (e.g. *ION* for Android, *CMA* for Linux).

Linux based system: The unema kernel driver pre-allocates a contiguous physical memory space using Linux CMA. When the GFX library is initialized, this memory space is mapped to the process' virtual space.

Android based system: When the GFX library is initialized, the *ION* kernel module is opened. On each subsequent graphics memory allocation or deallocation (*nema\_buffer\_create* or *nema\_buffer\_destroy*), the corresponding *IOCTL* to the *ION* module is called.

The GFX library supports multiple memory pools. This can be useful for systems with non-uniform memory hierarchy that have different characteristics (e.g. latency, throughput, etc). If such a feature is not needed, *nema\_buffer\_create\_pool()* can be set to redirect to *nema\_buffer\_create()*:

```
nema_buffer_t nema_buffer_create_pool(int pool, unsigned size) {
#ifdef NEMA_MULTI_MEM_POOLS //defined in nema_sys_defs.h
    nema_mutex_lock(MUTEX_MALLOC);
    nema_buffer_t bo;
    bo.base_virt = tsi_malloc_pool(pool, size);
    bo.base_phys = (uint32_t) tsi_virt2phys(bo.base_virt);
    bo.size = size;
    bo.fd = 0;
    nema_mutex_unlock(MUTEX_MALLOC);
    return bo;
#else
    return nema_buffer_create(size);
#endif
}
```

## 19.11.3.5 Support for Multi-Process Multi-Threaded systems

The GFX library is designed to support a wide variety of systems, from bare-metal to Linux platforms. These systems might also support multiple processes and/or multiple threads within a process. the GFX library is a graphics API that can manage resource sharing among multiple processes/threads if needed, using mutices and thread local storage (TLS). To support multiple processes, only mutices are necessary. For multiple threads, both mutices and TSL are needed.

The most obvious shared resource is the GPU itself. Multiple processes/threads send work to the GPU using Command Lists (CL). When a CL is executed by the GPU, it is guaranteed that it will not be interrupted by another CL. Each CL also needs to set the entire state of the GPU and not rely on previous CLs. So the only thing that needs to be taken care of, when multiple processes/threads are running, is the submission of a CL for execution. In this case, a simple mutex is used by the library.

The same applies for memory management. When a buffer is created or destroyed, a mutex ensures that no allocation or deallocation is performed by two processes or threads concurrently.

During system initialization, nema\_init() should call nema\_sys\_init() only once for each process. New threads within the process must not call nema\_init() again. The nema\_sys\_init() should not reinitialize the memory allocator nor the Ring Buffer, unless no other running process performed the aforementioned initializations.

For multi-threaded environments, *TLS\_VAR* should be defined inside *nema\_sys\_defs.h*. The GFX library uses *TLS\_VAR* as a prefix to declare thread local variables. For example, when using *GCC* the following definition should be added:

```
#ifdef NEMA_MULTI_THREAD
#define TLS_VAR __thread
#else
#define TLS_VAR
#endif
```

## 19.11.4 Bare Metal Display Control Library

The API for the NEMA functions to communicate with the display controller in bare metal systems is described in the following sections.

## 19.11.4.1 nemadc\_get\_config

Read Configuration Register

## **Return Value:**

uint32\_t: return implemented configuration

## 19.11.4.2 nemadc\_get\_crc

Read CRC Checksum Register

## **Return Value:**

unsigned int: return crc checksum of last frame for testing

## 19.11.4.3 nemadc\_init

Initializes the Display Controller and structures

## **Return Value:**

int: true if Display Controller is found

## 19.11.4.4 nemadc\_set\_bgcolor

Set Background Color before layer overlays

## Arguments:

int: Colour as a 32-bit rgba value

## Return Value:

void: void

## 19.11.4.5 nemadc\_timing

Sets Display Timing parameters and activates display

## Arguments:

int resx: Resolution X
int resy: Resolution Y
int fpx: front porch X
int fpy: front porch Y
int blx: blanking X
int bly: blanking Y

int bpx: backporch X

int bpy: back porch Y

## **Return Value:**

void: void

## 19.11.4.6 nemadc\_set\_mode

Set Operation Mode

## Arguments:

int mode: Mode of Operation (reg00)

## **Return Value:**

void: void

## 19.11.4.7 nemadc\_get\_status

Get STATUS

## Arguments:

void: void

## Return Value:

uint32\_t: Status

## 19.11.4.8 nemadc\_request\_vsync\_non\_blocking

Request a VSync Interrupt without blocking

## **Return Value:**

void: void

## 19.11.4.9 nemadc\_set\_layer

Set the Layer mode. This function can enable a layer and set attributes to it

## Arguments:

int layer\_no: The layer number

display\_layer layer: layer attributes structure

## **Return Value:**

void: void

## 19.11.4.10 nemadc\_clkdiv

Sets the built-in Clock Dividers and DMA Line Prefetch. (See Configuration Register 0x4)

## Arguments:

int div: Sets Divider 1
int div2: Sets Divider 2
int dma\_prefetch: Sets number of lines for the dma to prefetch
Return Value:

void: void

## 19.11.4.11 nemadc\_clkctrl

Control the Clock Gaters

## Arguments:

int div: Sets the Clock Gater

## **Return Value:**

void: void

## 19.11.4.12 nemadc\_layer\_enable

Enables a layer

## Arguments:

int layer\_no: layer to enable

## Return Value:

void: void

## 19.11.4.13 nemadc\_layer\_disable

Disables a layer

## Arguments:

int layer\_no: layer to disable

## **Return Value:**

void: void

## 19.11.4.14 nemadc\_get\_col\_mode

Read Color Mode Register

## **Return Value:**

uint32\_t: return color mode register

## 19.11.4.15 nemadc\_set\_palette

Sets an entry in the lut8 Palatte Gamma table

## Arguments:

**int index:** Color Index **int colour:** 32-bit RGBA colour value or Gamma index

## **Return Value:**

void: void

## 19.11.4.16 nemadc\_get\_palette

Reads an entry from the lut8 Palatte Gamma table

## Arguments:

int index: Color Index

## Return Value:

int: Return Colour for given palette index

## 19.11.4.17 nemadc\_set\_layer\_gamma\_lut

Sets an entry in the lut8 Palatte Gamma table for a layer

## Arguments:

int layer: Layer to set the entry

int index: Color Index

int colour: 32-bit RGBA colour value or Gamma index

## Return Value:

void: void

## 19.11.4.18 nemadc\_get\_layer\_gamma\_lut

Sets an entry in the lut8 Palatte Gamma table for a layer

## Arguments:

int layer: layer to read Gammalut

int index: Color Index

## Return Value:

int: Return Pallete index

## 19.11.4.19 nemadc\_MIPI\_out

Send command or data to MIPI Interface

## Arguments:

int cmd: command

## **Return Value:**

void: void

## 19.11.4.20 nemadc\_MIPI\_CFG\_out

Send command or data to MIPI Interface

## Arguments:

int cfg: configuration mode

## **Return Value:**

void: void

## 19.11.4.21 nemadc\_MIPI\_in

Read data from MIPI Interface

## Arguments:

void: void

## **Return Value:**

void: void

## 19.11.4.22 nemadc\_MIPI\_updateregion

Does Partial Update in MIPI

## Arguments:

int start\_x: start x coordinate int start\_y: start y coordinate int end\_x: end x coordinate int end\_y: end y coordinate int mode: mode of operation

## Return Value:

void: void

## **19.12 GFX Library Functions**

This section provides an overview of the implemented functions of the GFX library.

## 19.12.1 nema\_blender.h File Reference

#### Enumerations

enum nema\_blend\_factors\_t {
 NEMA\_BF\_ZERO,
 NEMA\_BF\_ONE,
 NEMA\_BF\_SRCCOLOR,
 NEMA\_BF\_INVSRCCOLOR,
 NEMA\_BF\_SRCALPHA,
 NEMA\_BF\_INVSRCALPHA,
 NEMA\_BF\_DESTALPHA,
 NEMA\_BF\_INVDESTALPHA,
 NEMA\_BF\_INVDESTCOLOR,
 NEMA\_BF\_INVDESTCOLOR,
 NEMA\_BF\_CONSTCOLOR,
 NEMA\_BF\_CONSTALPHA }

enumnema\_blend\_mode\_t{
 NEMA\_BL\_SIMPLE,
 NEMA\_BL\_CLEAR,
 NEMA\_BL\_SRC,
 NEMA\_BL\_SRC\_OVER,
 NEMA\_BL\_DST\_OVER,
 NEMA\_BL\_DST\_OVER,
 NEMA\_BL\_DST\_IN,
 NEMA\_BL\_DST\_IN,
 NEMA\_BL\_SRC\_OUT,
 NEMA\_BL\_SRC\_OUT,
 NEMA\_BL\_DST\_OUT,
 NEMA\_BL\_DST\_OUT,
 NEMA\_BL\_DST\_ATOP,
 NEMA\_BL\_ADD,
 NEMA\_BL\_XOR }

enum nema\_blend\_op\_t {
 NEMA\_BLOP\_NONE,
 NEMA\_BLOP\_NO\_USE\_ROPBL,
 NEMA\_BLOP\_DST\_CKEY\_NEG,
 NEMA\_BLOP\_SRC\_PREMULT,
 NEMA\_BLOP\_MODULATE\_A,
 NEMA\_BLOP\_FORCE\_A,
 NEMA\_BLOP\_MODULATE\_RGB,
 NEMA\_BLOP\_SRC\_CKEY,
 NEMA\_BLOP\_DST\_CKEY,
 NEMA\_BLOP\_MASK }

## Functions

static uint32\_t nema\_blending\_mode (nema\_blend\_factors\_t src, nema\_blend\_factors\_t dst, nema\_blend\_op\_t ops)

Return blending mode given source and destination blending factors and additional blending operations.

void nema\_set\_blend (uint32\_tblending\_mode, nema\_tex\_tdst\_tex, nema\_tex\_tfg\_tex, nema\_tex\_t bg\_ex)

Set blending mode.

static void nema\_set\_blend\_fill (uint32\_t blending\_mode)

Set blending mode for filling.

- static void nema\_set\_blend\_fill\_compose (uint32\_t blending\_mode)
   Set blending mode for filling with composing.
- static void nema\_set\_blend\_blit (uint32\_t blending\_mode)

Set blending mode for blitting.

- static void nema\_set\_blend\_blit\_compose (uint32\_t blending\_mode)
  - Set blending mode for blitting with composing.
- void nema\_set\_const\_color (uint32\_t rgba)
   Set constant color.
- static void nema\_set\_src\_color\_key (uint32\_t rgba)
   Set source color key.
- void nema\_set\_dst\_color\_key (uint32\_t rgba) Set destination color key.

## 19.12.1.1 Enumeration Type Documentation

## 19.12.1.1.1 enum nema\_blend\_factors\_t

## Enumerator

NEMA\_BF\_ZERO 0 NEMA\_BF\_ONE 1 NEMA\_BF\_SRCCOLOR Sc NEMA\_BF\_INVSRCCOLOR (1-Sc) NEMA\_BF\_SRCALPHA Sa NEMA\_BF\_INVSRCALPHA (1-Sa) NEMA\_BF\_DESTALPHA Da NEMA\_BF\_INVDESTALPHA (1-Da) NEMA\_BF\_DESTCOLOR Dc NEMA\_BF\_INVDESTCOLOR (1-Dc) NEMA\_BF\_CONSTCOLOR Cc NEMA\_BF\_CONSTALPHA Ca

## 19.12.1.1.2 enum nema\_blend\_mode\_t

## Enumerator

| NEMA_BL_SIMPLE   | Sa * Sa+Da * (1 - Sa)        |
|------------------|------------------------------|
| NEMA_BL_CLEAR    | 0                            |
| NEMA_BL_SRC      | Sa                           |
| NEMA_BL_SRC_OVER | Sa + Da * (1-Sa)             |
| NEMA_BL_DST_OVER | Sa * (1-Da)+Da               |
| NEMA_BL_SRC_IN   | Sa * Da                      |
| NEMA_BL_DST_IN   | Da * Sa                      |
| NEMA_BL_SRC_OUT  | Sa * (1 - Da)                |
| NEMA_BL_DST_OUT  | Da * (1 - Sa)                |
| NEMA_BL_SRC_ATOP | Sa * Da + Da * (1 - Sa)      |
| NEMA_BL_DST_ATOP | Sa * (1 - Da) + Da * Sa      |
| NEMA_BL_ADD      | Sa + Da                      |
| NEMA_BL_XOR      | Sa * (1 - Da) + Da * (1 -Sa) |

## 19.12.1.1.3 enum nema\_blend\_op\_t

## Enumerator

• NEMA\_BLOP\_NONE

No extra blendingoperation

NEMA\_BLOP\_NO\_USE\_ROPBL

Don't use Rop Blender even if present

NEMA\_BLOP\_DST\_CKEY\_NEG

Apply Inverse Destination Color Keying-draw only when dst color doesn't match colorkey

• NEMA\_BLOP\_SRC\_PREMULT

Pre-multiply Source Color with Source Alpha (cannot be used with NEMA\_BLOP\_MODULATE\_RGB)

NEMA\_BLOP\_MODULATE\_A

Modulate by Constant Alpha value *NEMA\_BLOP\_FORCE\_A* Force Constant Alpha value *NEMA\_BLOP\_MODULATE\_RGB* Modulate by Constant Color (RGB) values

NEMA\_BLOP\_SRC\_CKEY

Apply Source Color Keying - draw only when src color doesn't match colorkey

• NEMA\_BLOP\_DST\_CKEY

Apply Destination Color Keying - draw only when dst colormatches colorkey

NEMA\_BLOP\_MASK

## 19.12.1.2 Function Documentation

## 19.12.1.2.1 static uint32\_t nema\_blending\_mode (nema\_blend\_factors\_t src, nema\_blend\_factors\_t dst, nema\_blend\_op\_t ops) [inline], [static]

Return blending mode given source and destination blending factors and additional blending operations.

Parameters

| SrC | Source Blending Factor         |
|-----|--------------------------------|
| dst | Destination Blending Factor    |
| ops | Additional Blending Operations |

Returns

Final Blending Mode

# 19.12.1.2.2 void nema\_set\_blend(uint32\_tblending\_mode, nema\_tex\_tdst\_tex, nema\_tex\_t fg\_tex, nema\_tex\_t bg\_tex)

Set blending mode.

| blending_mode | Blending mode to be set      |
|---------------|------------------------------|
| dst_tex       | Destination Texture          |
| fg_tex        | Foreground (source) Texture  |
| bg_tex        | Background (source2) Texture |

# 19.12.1.2.3 static void nema\_set\_blend\_blit (uint32\_t blending\_mode) [inline], [static]

Set blending mode for blitting.

#### Parameters

blending\_mode Blending mode to be set

## 19.12.1.2.4 static void nema\_set\_blend\_blit\_compose (uint32\_t blending\_mode) [inline], [static]

Set blending mode for blitting with composing.

Parameters

blending\_mode Blending mode to be set

# 19.12.1.2.5 static void nema\_set\_blend\_fill (uint32\_t blending\_mode) [inline], [static]

Set blending mode for filling.

Parameters

blending\_mode Blending mode to be set

## 19.12.1.2.6 static void nema\_set\_blend\_fill\_compose (uint32\_t blending\_mode) [inline], [static]

Set blending mode for filling with composing.

Parameters

blending\_mode Blending mode to be set

## 19.12.1.2.7 void nema\_set\_const\_color (uint32\_t rgba)

Set constant color.

Parameters

rgba | RGBA color

See Also

nema\_rgba()

## 19.12.1.2.8 void nema\_set\_dst\_color\_key (uint32\_t rgba)

Set destination color key.

Parameters

rgba RGBA color key

See Also nema\_rgba()

## 19.12.1.2.9 static void nema\_set\_src\_color\_key (uint32\_t rgba) [inline], [static]

Set source color key.

Parameters

rgba RGBA color key

See Also

nema\_rgba()

## 19.12.1.2.10 nema\_cmdlist.h File Reference

### Data Structures

struct nema\_cmdlist\_t

#### Functions

nema\_cmdlist\_t nema\_cl\_create\_prealloc (nema\_buffer\_t \*bo)

Create a new Command List into a preallocated space.

- nema\_cmdlist\_t nema\_cl\_create\_sized (uint32\_t size\_bytes)
  - Create a new, non expandable Command List of specific size.
- nema\_cmdlist\_t nema\_cl\_create (void)
  - Create a new expandable Command List.
- void nema\_cl\_destroy (nema\_cmdlist\_t \*cl)
   Destroy/Free a Command List.
- void nema\_cl\_rewind (nema\_cmdlist\_t \*cl)

Reset position of next command to be written to the beginning. Doesn't clear the List's contents.

void nema\_cl\_bind (nema\_cmdlist\_t \*cl)

Define in which Command List each subsequent commands are going to be inserted.

void nema\_cl\_unbind (void)

Unbind current bound Command List, if any.

void nema\_cl\_submit (nema\_cmdlist\_t \*cl)

Enqueue Command List to the Ring Buffer for execution.

void nema\_cl\_wait (nema\_cmdlist\_t \*cl)

Wait for Command List to finish.

• int nema\_cl\_add\_cmd (uint32\_t reg, uint32\_t data)

Add a command to the bound Command List.

int nema\_cl\_add\_multiple\_cmds (int cmd\_no, uint32\_t \*cmd)

Add multiple commands to the bound Command List.

- uint32\_t \* nema\_cl\_get\_space (int cmd\_no)
- void nema\_cl\_branch (nema\_cmdlist\_t \*cl)

Branch from the bound Command List to a different one. Return is implied.

void nema\_cl\_jump (nema\_cmdlist\_t \*cl)

Jump from the bound Command List to a different one. No return is implied.

void nema\_cl\_return (void)

Add an explicit return command to the bound Command List.

## 19.12.1.3 Function Documentation

## 19.12.1.3.1 int nema\_cl\_add\_cmd (uint32\_t reg, uint32\_t data)

Add a command to the bound Command List.

Parameters

| reg  | Hardware register to be written |
|------|---------------------------------|
| data | Data to be written              |

#### Returns

0 if no error has occurred

## 19.12.1.3.2 int nema\_cl\_add\_multiple\_cmds (int cmd\_no, uint32\_t \* cmd)

Add multiple commands to the bound Command List.

Parameters

| cmd_no | Numbers of commands to add          |
|--------|-------------------------------------|
| cmd    | Pointer to the commands to be added |

#### Returns

0 if no error has occurred

## 19.12.1.3.3 void nema\_cl\_bind (nema\_cmdlist\_t \* cl)

Define in which Command List each subsequent commands are going to be inserted.

Parameters

*cl* Pointer to the Command List

## 19.12.1.3.4 void nema\_cl\_branch (nema\_cmdlist\_t \* cl)

Branch from the bound Command List to a different one. Return is implied.

Parameters

cl Pointer to the Command List to branch to

## 19.12.1.3.5 nema\_cmdlist\_t nema\_cl\_create (void)

Create a new expandable Command List. Returns

The instance of the new Command List

## 19.12.1.3.6 nema\_cmdlist\_t nema\_cl\_create\_prealloc (nema\_buffer\_t \* bo)

Create a new Command List into a preallocated space.

Parameters

| addr_virt  | Command List's address (preallocated) |
|------------|---------------------------------------|
| size_bytes | Command List's size in bytes          |

Returns

The instance of the new Command List

## 19.12.1.3.7 nema\_cmdlist\_t nema\_cl\_create\_sized (uint32\_t size\_bytes)

Create a new, non expandable Command List of specific size.

Parameters

size\_bytes Command List's size in bytes

Returns

The instance of the new Command List

## 19.12.1.3.8 void nema\_cl\_destroy (nema\_cmdlist\_t \* cl)

Destroy/Free a Command List.

Parameters

cl Pointer to the Command List

## 19.12.1.3.9 uint32\_t\* nema\_cl\_get\_space (int cmd\_no)

private

## 19.12.1.3.10 void nema\_cl\_jump (nema\_cmdlist\_t \* cl)

Jump from the bound Command List to a different one. No return is implied.

Parameters

cl Pointer to the Command List to jump to

## 19.12.1.3.11 void nema\_cl\_return(void)

Add an explicit return command to the bound Command List.

## 19.12.1.3.12 void nema\_cl\_rewind (nema\_cmdlist\_t \* cl)

Reset position of next command to be written to the beginning. Doesn't clear the List's contents.

Parameters

*cl* Pointer to the Command List

## 19.12.1.3.13 void nema\_cl\_submit (nema\_cmdlist\_t \* cl)

Enqueue Command List to the Ring Buffer for execution.

Parameters

*cl* Pointer to the Command List

## 19.12.1.3.14 void nema\_cl\_unbind (void)

Unbind current bound Command List, if any.

## 19.12.1.3.15 void nema\_cl\_wait (nema\_cmdlist\_t \* cl)

Wait for Command List to finish.

Parameters

cl Pointer to the Command List

## 19.12.1.3.16 nema\_easing.h File Reference

## Functions

- float nema\_ez\_linear (float p)
  - Linear easing, no acceleration.
- float nema\_ez\_quad\_in (float p)
   Quadratic easing in, accelerate from zero.
- float nema\_ez\_quad\_out (float p)

Quadratic easing out, decelerate to zero velocity.

float nema\_ez\_quad\_in\_out (float p)

Quadratic easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_cub\_in (float p)

Cubic easing in, accelerate from zero.

float nema\_ez\_cub\_out (float p)

Cubic easing out, decelerate to zero velocity.

float nema\_ez\_cub\_in\_out (float p)

Cubic easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_quar\_in (float p)

Quartic easing in, accelerate from zero.

float nema\_ez\_quar\_out (float p)

Quartic easing out, decelerate to zero velocity.

float nema\_ez\_quar\_in\_out (float p)

Quartic easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_quin\_in (float p)

Quintic easing in, accelerate from zero.

float nema\_ez\_quin\_out (float p)

Quintic easing out, decelerate to zero velocity.

float nema\_ez\_quin\_in\_out (float p)

Quintic easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_sin\_in (float p)

Sinusoidal easing in, accelerate from zero.

float nema\_ez\_sin\_out (float p)

Sinusoidal easing out, decelerate to zero velocity.

float nema\_ez\_sin\_in\_out (float p)

Sinusoidal easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_circ\_in (float p)

Circular easing in, accelerate from zero.

float nema\_ez\_circ\_out (float p)

Circular easing out, decelerate to zero velocity.

float nema\_ez\_circ\_in\_out (float p)

Circular easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_exp\_in (float p)

Exponential easing in, accelerate from zero.

float nema\_ez\_exp\_out (float p)

Exponential easing out, decelerate to zero velocity.

float nema\_ez\_exp\_in\_out (float p)

Exponential easing in and out, accelerate to halfway, then decelerate.

- float nema\_ez\_elast\_in (float p)
   Elastic easing in, accelerate from zero.
- float nema\_ez\_elast\_out (float p)

Elastic easing out, decelerate to zero velocity.

float nema\_ez\_elast\_in\_out (float p)

Elastic easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_back\_in (float p)

Overshooting easing in, accelerate from zero.

float nema\_ez\_back\_out (float p)

Overshooting easing out, decelerate to zero velocity.

float nema\_ez\_back\_in\_out (float p)

Overshooting easing in and out, accelerate to halfway, then decelerate.

float nema\_ez\_bounce\_out (float p)

Bouncing easing in, accelerate from zero.

float nema\_ez\_bounce\_in (float p)

Bouncing easing out, decelerate to zero velocity.

float nema\_ez\_bounce\_in\_out (float p)

Bouncing easing in and out, accelerate to halfway, then decelerate.

float nema\_ez (float A, float B, float steps, float cur\_step, float(\*ez\_func)(float p))

Convenience function to perform easing between two values given number of steps, current step and easing function.

## 19.12.1.4 Function Documentation

## 19.12.1.4.1 float nema\_ez (float A,float B,float steps,float cur\_step,float()(float p)ez\_func)

Convenience function to perform easing between two values given number of steps, current step and easing function.

2

### Parameters

| A        | Initial value within range [0, 1]      |
|----------|----------------------------------------|
| В        | Finale value within range [0, 1]       |
| steps    | Total number of steps                  |
| cur_step | Current Step                           |
| ez_func  | pointer to the desired easing function |

Returns

Eased value

## 19.12.1.4.2 float nema\_ez\_back\_in (float p)

Overshooting easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.3 float nema\_ez\_back\_in\_out (float p)

Overshooting easing in and out, accelerate to halfway, then decelerate.

Parameters

| p | Input value, typically within the [0, 1] range |
|---|------------------------------------------------|

Returns

Eased value

## 19.12.1.4.4 float nema\_ez\_back\_out (float p)

Overshooting easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.5 float nema\_ez\_bounce\_in (float p)

Bouncing easing out, decelerate to zero velocity.

Parameters

p Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.6 float nema\_ez\_bounce\_in\_out (float p)

Bouncing easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.7 float nema\_ez\_bounce\_out (float p)

Bouncing easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.8 float nema\_ez\_circ\_in (float p)

Circular easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.9 float nema\_ez\_circ\_in\_out (float p)

Circular easing in and out, accelerate to halfway, then decelerate. Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.10 float nema\_ez\_circ\_out (float p)

Circular easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.11 float nema\_ez\_cub\_in (float p)

Cubic easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns Eased value

## 19.12.1.4.12 float nema\_ez\_cub\_in\_out (float p)

Cubic easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.13 float nema\_ez\_cub\_out (float p)

Cubic easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.14 float nema\_ez\_elast\_in (float p)

Elastic easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.15 float nema\_ez\_elast\_in\_out (float p)

Elastic easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.16 float nema\_ez\_elast\_out (float p)

Elastic easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.17 float nema\_ez\_exp\_in (float p)

Exponential easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.18 float nema\_ez\_exp\_in\_out (float p)

Exponential easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.19 float nema\_ez\_exp\_out (float p)

Exponential easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.20 float nema\_ez\_linear (float p)

Linear easing, no acceleration.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.21 float nema\_ez\_quad\_in (float p)

Quadratic easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.22 float nema\_ez\_quad\_in\_out (float p)

Quadratic easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.23 float nema\_ez\_quad\_out (float p)

Quadratic easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.24 float nema\_ez\_quar\_in (float p)

Quartic easing in, accelerate from zero.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.25 float nema\_ez\_quar\_in\_out (float p)

Quartic easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.26 float nema\_ez\_quar\_out (float p)

Quartic easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.27 float nema\_ez\_quin\_in (float p)

Quintic easing in, accelerate from zero.

Parameters

p Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.28 float nema\_ez\_quin\_in\_out (float p)

Quintic easing in and out, accelerate to halfway, then decelerate.

Parameters

p Input value, typically within the [0, 1] range

Returns Eased value

## 19.12.1.4.29 float nema\_ez\_quin\_out (float p)

Quintic easing out, decelerate to zero velocity.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.30 float nema\_ez\_sin\_in (float p)

Sinusoidal easing in, accelerate from zero

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.31 float nema\_ez\_sin\_in\_out (float p)

Sinusoidal easing in and out, accelerate to halfway, then decelerate.

Parameters

*p* Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.1.4.32 float nema\_ez\_sin\_out (float p)

Sinusoidal easing out, decelerate to zero velocity.

Parameters

p Input value, typically within the [0, 1] range

Returns

Eased value

## 19.12.2 nema\_font.h File Reference

#### Data Structures

- struct nema\_glyph\_t
- struct nema\_font\_t

#### Macros

- #define NEMA\_ALIGNX\_CENTER Align horizontally centered
- #define NEMA\_ALIGNX\_JUSTIFY Justify horizontally
- #define NEMA\_ALIGNX\_LEFT Align horizontally to the left
- #define NEMA\_ALIGNX\_MASK Horizontal alignment mask
- #define NEMA\_ALIGNX\_RIGHT Align horizontally to the right
- #define NEMA\_ALIGNY\_BOTTOM Align vertically to the bottom
- #define NEMA\_ALIGNY\_CENTER Align vertically centered
- #define NEMA\_ALIGNY\_JUSTIFY Justify vertically
- #define NEMA\_ALIGNY\_MASK Vertical alignment mask
- #define NEMA\_ALIGNY\_TOP Align vertically to the top
- #define NEMA\_TEXT\_WRAP Use text wrapping

## Functions

void nema\_bind\_font (nema\_font\_t \*font)

Bind the font to use in future nema\_print() calls.

- int nema\_string\_get\_bbox (char \*str, int \*w, int \*h, int max\_w)
   Get the bounding box's width and height of a string.
- void nema\_print (char \*str, int \*x, int \*y, int w, int h, uint32\_tfg\_col, uint32\_t align)
   Print pre-formatted text.

## 19.12.2.1 Function Documentation

## 19.12.2.1.1 void nema\_bind\_font (nema\_font\_t \* font)

Bind the font to use in future nema\_print() calls.

Parameters

font Pointer to font

# 19.12.2.1.2 void nema\_print (char \* str, int \* x, int \* y, int w, int h, uint32\_t fg\_col, uint32\_t align)

Print pre-formatted text.

Parameters

| str    | Pointer to string                                                                  |
|--------|------------------------------------------------------------------------------------|
| X      | X coordinate of the starting point.                                                |
|        | After execution, x contains the starting point for the next string to be drawn at. |
| У      | Y coordinate of the starting point.                                                |
|        | After execution, y contains the starting point for the next string to be drawn at. |
| W      | Width of the drawing area                                                          |
| h      | Height of the drawing area                                                         |
| fg_col | Foreground color of text                                                           |
| align  | Alignment and wrapping mode                                                        |

## 19.12.2.1.3 int nema\_string\_get\_bbox (char \* str, int \* w, int \* h, int max\_w)

Get the bounding box's width and height of a string.

| str                                                  | Pointer to string                                 |
|------------------------------------------------------|---------------------------------------------------|
| W                                                    | Pointer to variable where width should be written |
| h Pointer to variable where height should be writter |                                                   |
| max_w                                                | Max allowed width                                 |

Returns

Number of carriage returns

## 19.12.3 nema\_graphics.h File Reference

### Enumerations

enum nema\_rotation\_t {
 NEMA\_ROT\_000\_CCW,
 NEMA\_ROT\_090\_CCW,
 NEMA\_ROT\_180\_CCW,
 NEMA\_ROT\_270\_CCW,
 NEMA\_ROT\_000\_CW,
 NEMA\_ROT\_270\_CW,
 NEMA\_ROT\_180\_CW,
 NEMA\_ROT\_090\_CW,
 NEMA\_ROT\_090\_CW,
 NEMA\_MIR\_VERT,
 NEMA\_MIR\_HOR }

## Functions

void nema\_emulate\_p (void)

Emulate GPU functionality when using the GFX fragment shaders (blending modes).

- uint32\_t nema\_rgba (unsigned char R, unsigned char G, unsigned char B, unsigned char A) Return Nema internal RGBA color.
- uint32\_t nema\_premultiply\_rgba (uint32\_t rgba)

Pre-multiply RGB channels with Alpha channel.

int nema\_init (void)

Initialize NemaGFX library.

void nema\_bind\_src\_tex (uint32\_t baseaddr\_phys, uint32\_t width, uint32\_t height, nema\_tex\_- format\_t format, int32\_t stride, nema\_tex\_mode\_t mode)

Program Texture Unit with a foreground (source) texture (NEMA\_TEX1)

void nema\_bind\_src2\_tex (uint32\_t baseaddr\_phys, uint32\_t width, uint32\_t height, nema\_tex\_- format\_t format, int32\_t stride, nema\_tex\_mode\_t mode)

Program Texture Unit with a background texture ((NEMA\_TEX2)

void nema\_bind\_dst\_tex (uint32\_t baseaddr\_phys, uint32\_t width, uint32\_t height, nema\_tex\_- format\_t format, int32\_t stride)

Program Texture Unit with a destination texture (NEMA\_TEX0)

void nema\_bind\_depth\_buffer (uint32\_t baseaddr\_phys, uint32\_t width, uint32\_t height)

Bind Depth Buffer.

- void nema\_clear (uint32\_t rgba8888)
  - Clear destination texture with color.
- void nema\_clear\_depth (uint32\_t val)
  - Clear depth buffer with specified value.
- void nema\_draw\_line (int x0, int y0, int x1, int y1, uint32\_t rgba8888)
   Draw a colored line.
- void nema\_draw\_circle (int x0, int y0, int r, uint32\_t rgba8888)
   Draw a colored circle.
- void nema\_draw\_rounded\_rect (int x0, int y0, int w, int h, int r, uint32\_t rgba8888)
   Draw a colored rectangle with rounded edges.
- void nema\_draw\_rect (int x, int y, int w, int h, uint32\_t rgba8888)
   Draw a colored rectangle.
- void nema\_fill\_circle (int x, int y, int r, uint32\_trgba8888)
   Fill a circle with color.
- void nema\_fill\_triangle (intx0, inty0, intx1, inty1, intx2, inty2, uint32\_trgba8888)
   Fill a triangle with color.
- void nema\_fill\_rounded\_rect (int x0, int y0, int w, int h, int r, uint32\_t rgba8888)
   Fill a rectangle with rounded edges with color.
- void nema\_fill\_rect (int x, int y, int w, int h, uint32\_t rgba8888)
   Fill a rectangle with color.
- void nema\_fill\_quad (int x0, int y0, int x1, int y1, int x2, int y2, int x3, int y3, uint32\_trgba8888)
   Fill a quadrilateral with color.
- void nema\_blit (int x, int y)

Blit source texture to destination texture.

- void nema\_blit\_rect (int x, int y, int w, int h)
  - Blit source texture to destination's specified rectangle (crop or wrap when needed)
- void nema\_blit\_rect\_fit (int x, int y, int w, int h)

Blit source texture to destination. Fit (scale) texture to specified rectangle.

void nema\_blit\_rotate (int x, int y, nema\_rotation\_trotation)

Rotate and Blit source texture to destination.

- void nema\_blit\_rotate\_partial (int sx, int sy, int sw, int sh, int x, int y, nema\_rotation\_t rotation)
   Rotate and Blit partial source texture to destination.
- void nema\_blit\_quad\_fit (float dx0, float dy0, float dx1, float dy1, float dx2, float dy2, float dx3, float dy3)
   Blit source texture to destination. Fit texture to specified quadrilateral.

## 19.12.3.1 Enumeration Type Documentation

## 19.12.3.1.1 enum nema\_rotation\_t

### Enumerator

| NEMA_ROT_000_CCW | No rotation                          |
|------------------|--------------------------------------|
| NEMA_ROT_090_CCW | Rotate 90 degrees counter-clockwise  |
| NEMA_ROT_180_CCW | Rotate 180 degrees counter-clockwise |
| NEMA_ROT_270_CCW | Rotate 270 degrees counter-clockwise |
| NEMA_ROT_000_CW  | No rotation                          |
| NEMA_ROT_270_CW  | Rotate 270 degrees clockwise         |
| NEMA_ROT_180_CW  | Rotate 180 degrees clockwise         |
| NEMA_ROT_090_CW  | Rotate 90 degrees clockwise          |
| NEMA_MIR_VERT    | Mirror Vertically                    |
| NEMA_MIR_HOR     | Mirror Horizontally                  |

## 19.12.3.2 Function Documentation

# 19.12.3.2.1 void nema\_bind\_depth\_buffer (uint32\_t baseaddr\_phys, uint32\_t width, uint32\_t height)

Bind Depth Buffer.

Parameters

| baseaddr_phys | Address of the depth buffer, as seen by the GPU |
|---------------|-------------------------------------------------|
| width         | Buffer width                                    |
| height        | Buffer hight                                    |

# 19.12.3.2.2 void nema\_bind\_dst\_tex (uint32\_t baseaddr\_phys,uint32\_t width,uint32\_t height, nema\_tex\_format\_t format, int32\_t stride)

Program Texture Unit with a destination texture (NEMA\_TEX0)

| baseaddr_phys | Address of the destination texture, as seen by the GPU   |
|---------------|----------------------------------------------------------|
| width         | Texture width                                            |
| height        | Texture hight                                            |
| format        | Texture format                                           |
| stride        | Texture stride. If negative, it's calculated internally. |

## 19.12.3.2.3 void nema\_bind\_src2\_tex (uint32\_t baseaddr\_phys,uint32\_t width,uint32\_t height, nema\_tex\_format\_t format,int32\_t stride,nema\_tex\_mode\_t mode)

Program Texture Unit with a background texture ((NEMA\_TEX2)

#### Parameters

| baseaddr_phys | Address of the source2 texture, as seen by the GPU       |
|---------------|----------------------------------------------------------|
| width         | Texture width                                            |
| height        | Texture hight                                            |
| format        | Texture format                                           |
| stride        | Texture stride. If negative, it's calculated internally. |
| mode          | Wrapping and Filtering mode                              |

## 19.12.3.2.4 void nema\_bind\_src\_tex (uint32\_t baseaddr\_phys,uint32\_t width,uint32\_t height, nema\_tex\_format\_t format,int32\_t stride,nema\_tex\_mode\_t mode)

Program Texture Unit with a foreground (source) texture (NEMA\_TEX1)

Parameters

| baseaddr_phys | Address of the source texture, as seen by the GPU        |
|---------------|----------------------------------------------------------|
| width         | Texture width                                            |
| height        | Texture hight                                            |
| format        | Texture format                                           |
| stride        | Texture stride. If negative, it's calculated internally. |
| mode          | Wrapping and Filtering mode                              |

## 19.12.3.2.5 void nema\_blit (int x, int y)

Blit source texture to destination texture.

Parameters

| X | destination x coordinate |
|---|--------------------------|
| У | destination y coordinate |

See Also

nema\_set\_blend\_fill()

# 19.12.3.2.6 void nema\_blit\_quad\_fit (float dx0,float dy0,float dx1,float dy1,float dx2, float dy2, float dx3, float dy3)

Blit source texture to destination. Fit texture to specified quadrilateral.

| dx0 | x coordinate at the first vertex of the quadrilateral |
|-----|-------------------------------------------------------|
| dy0 | y coordinate at the first vertex of the quadrilateral |

| dx1 | x coordinate at the second vertex of the quadrilateral |
|-----|--------------------------------------------------------|
| dy1 | y coordinate at the second vertex of the quadrilateral |
| dx2 | x coordinate at the third vertex of the quadrilateral  |
| dy2 | y coordinate at the third vertex of the quadrilateral  |
| dx3 | x coordinate at the fourth vertex of the quadrilateral |
| dy3 | y coordinate at the fourth vertex of the quadrilateral |

See Also

nema\_set\_blend\_blit()

## 19.12.3.2.7 void nema\_blit\_rect (int x, int y, int w, int h)

Blit source texture to destination's specified rectangle (crop or wrap when needed)

Parameters

| X | destination x coordinate |
|---|--------------------------|
| У | destination y coordinate |
| W | destination width        |
| h | destination height       |

See Also

nema\_set\_blend\_blit()

## 19.12.3.2.8 void nema\_blit\_rect\_fit (int x, int y, int w, int h)

Blit source texture to destination. Fit (scale) texture to specified rectangle.

Parameters

| Х | destination x coordinate |
|---|--------------------------|
| У | destination y coordinate |
| W | destination width        |
| h | destination height       |

See Also nema\_set\_blend\_blit()

## 19.12.3.2.9 void nema\_blit\_rotate (int x, int y, nema\_rotation\_t rotation)

Rotate and Blit source texture to destination.

| X        | destination x coordinate |
|----------|--------------------------|
| У        | destination y coordinate |
| rotation | Rotation to be done      |

See Also

nema\_set\_blend\_blit()

## 19.12.3.2.10 void nema\_blit\_rotate\_partial (int sx,int sy,int sw,int sh,int x,int y, nema\_rotation\_t rotation)

Rotate and Blit partial source texture to destination.

Parameters

| SX       | source upper left x coordinate  |
|----------|---------------------------------|
| sy       | source upper left y coordinate  |
| SW       | source width of partial region  |
| sh       | source height of partial region |
| X        | destination x coordinate        |
| У        | destination y coordinate        |
| rotation | Rotation to be done             |

See Also

nema\_set\_blend\_blit()

## 19.12.3.2.11 void nema\_clear (uint32\_t rgba8888)

Clear destination texture with color.

Parameters

rgba8888 32-bit RGBA color

See Also

nema\_rgba()

## 19.12.3.2.12 void nema\_clear\_depth (uint32\_t val)

Clear depth buffer with specified value.

Parameters

val Clear value

## 19.12.3.2.13 void nema\_draw\_circle (int x0, int y0, int r, uint32\_t rgba8888)

Draw a colored circle.
| хO       | x coordinate of the circle's center |
|----------|-------------------------------------|
| y0       | y coordinate of the circle's center |
| r        | circle's radius                     |
| raba8888 | Color to be used                    |

See Also

nema\_set\_blend\_fill() nema\_rgba()

# 19.12.3.2.14 void nema\_draw\_line (int x0, int y0, int x1, int y1, uint32\_t rgba8888)

Draw a colored line.

Parameters

| xO       | x coordinate at the beginning of the line |
|----------|-------------------------------------------|
| y0       | y coordinate at the beginning of the line |
| x1       | x coordinate at the end of the line       |
| y1       | y coordinate at the end of the line       |
| rgba8888 | Color to be used                          |

See Also

nema\_set\_blend\_fill() nema\_rgba()

#### 19.12.3.2.15 void nema\_draw\_rect (int x, int y, int w, int h, uint32\_t rgba8888)

Draw a colored rectangle.

Parameters

| Х        | x coordinate of the upper left vertex of the rectangle |
|----------|--------------------------------------------------------|
| У        | y coordinate at the upper left vertex of the rectangle |
| W        | width of the rectangle                                 |
| h        | height of the rectangle                                |
| rgba8888 | Color to be used                                       |

See Also

nema\_set\_blend\_fill() nema\_rgba()

void nema\_draw\_rounded\_rect (int x0,int y0,int w,int h,int r,uint32\_t rgba8888)

Draw a colored rectangle with rounded edges.

| x0 x coordinate of the upper left vertex of the rectangle |
|-----------------------------------------------------------|
|-----------------------------------------------------------|

| уО       | y coordinate at the upper left vertex of the rectangle |
|----------|--------------------------------------------------------|
| W        | width of the rectangle                                 |
| h        | height of the rectangle                                |
| r        | corner radius                                          |
| rgba8888 | Color to be used                                       |

See Also

nema\_set\_blend\_fill() nema\_rgba()

# 19.12.3.2.16 void nema\_emulate\_p(void)

Emulate Nema|P functionality when using NemaGFX fragment shaders (blending modes) .

# 19.12.3.2.17 void nema\_fill\_circle (int x, int y, int r, uint32\_t rgba8888)

Fill a circle with color.

Parameters

| X        | x coordinate of the circle's center |
|----------|-------------------------------------|
| У        | y coordinate of the circle's center |
| r        | circle's radius                     |
| rgba8888 | Color to be used                    |

See Also

nema\_set\_blend\_fill() nema\_rgba()

# 19.12.3.2.18 void nema\_fill\_quad (int x0, int y0, int x1, int y1, int x2, int y2, int x3, int y3, uint32\_t rgba8888)

Fill a quadrilateral with color.

Parameters

| х0       | x coordinate at the first vertex of the quadrilateral  |
|----------|--------------------------------------------------------|
| y0       | y coordinate at the first vertex of the quadrilateral  |
| x1       | x coordinate at the second vertex of the quadrilateral |
| y1       | y coordinate at the second vertex of the quadrilateral |
| x2       | x coordinate at the third vertex of the quadrilateral  |
| y2       | y coordinate at the third vertex of the quadrilateral  |
| х3       | x coordinate at the fourth vertex of the quadrilateral |
| у3       | y coordinate at the fourth vertex of the quadrilateral |
| rgba8888 | Color to be used                                       |

See Also

nema\_set\_blend\_fill() nema\_rgba()

#### 19.12.3.2.19 void nema\_fill\_rect (int x, int y, int w, int h, uint32\_t rgba8888)

Fill a rectangle with color.

Parameters

| Х        | x coordinate of the upper left vertex of the rectangle |
|----------|--------------------------------------------------------|
| У        | y coordinate at the upper left vertex of the rectangle |
| W        | width of the rectangle                                 |
| h        | height of the rectangle                                |
| rgba8888 | Color to be used                                       |

See Also

nema\_set\_blend\_fill() nema\_rgba()

#### 19.12.3.2.20 void nema\_fill\_rounded\_rect (int x0, int y0, int w, int h, int r, uint32\_trgba8888)

Fill a rectangle with rounded edges with color.

#### Parameters

| h        | height of the rectangle |
|----------|-------------------------|
| r        | corner radius           |
| rgba8888 | Color to be used        |

#### See Also

nema\_set\_blend\_fill() nema\_rgba()

# 19.12.3.2.21 void nema\_fill\_triangle (int x0, int y0, int x1, int y1, int x2, int y2, uint32\_t rgba8888)

Fill a triangle with color.

Parameters

| хO       | x coordinate at the first vertex of the triangle  |
|----------|---------------------------------------------------|
| y0       | y coordinate at the first vertex of the triangle  |
| x1       | x coordinate at the second vertex of the triangle |
| y1       | y coordinate at the second vertex of the triangle |
| x2       | x coordinate at the third vertex of the triangle  |
| y2       | y coordinate at the third vertex of the triangle  |
| rgba8888 | Color to be used                                  |

See Also

nema\_set\_blend\_fill()

# 19.12.3.2.22 int nema\_init (void)

Initialize NemaGFX library.

Returns

-1 on error

# 19.12.3.2.23 uint32\_t nema\_premultiply\_rgba (uint32\_t rgba)

Pre-multiply RGB channels with Alpha channel.

Parameters

rgba RGBA color

Returns

Pre-multiplied RGBA color

# 19.12.3.2.24 uint32\_tnema\_rgba(unsigned char R, unsigned char G, unsigned char B, unsigned char A)

Return Nema internal RGBA color.

Parameters

| R | Red component   |
|---|-----------------|
| G | Green component |
| В | Blue component  |
| A | Alpha component |

Returns

RGBA value

# 19.12.4 nema\_hal.h File Reference

#### **Data Structures**

struct nema\_buffer\_t

#### Macros

- #define MUTEX\_RB
- #define MUTEX\_MALLOC

# Functions

- int32\_t nema\_sys\_init (void)
  Initialize system. Implementor defined. Called in nema\_init().
- void nema\_wait\_irq (void)
  Wait for interrupt from the GPU.
- void nema\_wait\_irq\_cl (int cl\_id)
  Wait for a Command List to finish.
- uint32\_t nema\_reg\_read (uint32\_t reg) Read Hardware register.
- void nema\_reg\_write (uint32\_t reg, uint32\_t value)
  Write Hardware Register.
- nema\_buffer\_t nema\_buffer\_create (unsigned size)
  Create memory buffer.
- nema\_buffer\_t nema\_buffer\_create\_pool (int pool, unsigned size)
  Create memory buffer at a specific pool.
- void \* nema\_buffer\_map (nema\_buffer\_t \*bo)
  Maps buffer.
- void nema\_buffer\_unmap (nema\_buffer\_t \*bo)
  Unmaps buffer.
- void nema\_buffer\_destroy (nema\_buffer\_t \*bo)
  Destroy/deallocate buffer.
- uint32\_t nema\_buffer\_phys (nema\_buffer\_t \*bo)
  Get physical (GPU) base address of a given buffer.
- void \* nema\_host\_malloc (unsigned size)

Allocate memory for CPU to use (typically, standard malloc() is called)

- void nema\_host\_free (void \*ptr)
  - Free memory previously allocated with nema\_host\_malloc()
- int nema\_mutex\_lock (int mutex\_id)

Mutex Lock for multiple processes/threads.

int nema\_mutex\_unlock (int mutex\_id)
 Mutex Unlock for multiple processes/threads.

# 19.12.4.1 Function Documentation

# 19.12.4.1.1 nema\_buffer\_t nema\_buffer\_create (unsigned size)

Create memory buffer.

size Size of buffer in bytes

Returns

nema\_buffer\_t struct

# 19.12.4.1.2 nema\_buffer\_t nema\_buffer\_create\_pool (int pool, unsigned size)

Create memory buffer at a specific pool.

Parameters

| pool | ID of the desired memory pool |
|------|-------------------------------|
| size | Size of buffer in bytes       |

Returns nema\_buffer\_t struct

# 19.12.4.1.3 void nema\_buffer\_destroy (nema\_buffer\_t \* bo)

Destroy/deallocate buffer.

Parameters

*bo* Pointer to buffer struct

Returns

void

# 19.12.4.1.4 void\* nema\_buffer\_map (nema\_buffer\_t \* bo)

Maps buffer.

Parameters

bo Pointer to buffer struct

Returns

Virtual pointer of the buffer (same as in bo->base\_virt)

# 19.12.4.1.5 uint32\_t nema\_buffer\_phys (nema\_buffer\_t \* bo)

Get physical (GPU) base address of a given buffer.

bo Pointer to buffer struct

Returns

Physical base address of a given buffer

# 19.12.4.1.6 void nema\_buffer\_unmap (nema\_buffer\_t \* bo)

Unmaps buffer.

Parameters

bo Pointer to buffer struct

Returns

void

# 19.12.4.1.7 void nema\_host\_free (void \* ptr)

Free memory previously allocated with nema\_host\_malloc()

Parameters

*ptr* Pointer to allocated memory (virtual)

Returns

void

See Also nema\_host\_malloc()

# 19.12.4.1.8 void\* nema\_host\_malloc (unsigned size)

Allocate memory for CPU to use (typically, standard malloc() is called)

Parameters

size Size in bytes

Returns

Pointer to allocated memory (virtual)

See Also nema\_host\_free()

# 19.12.4.1.9 int nema\_mutex\_lock (int mutex\_id)

Mutex Lock for multiple processes/threads.

MUTEX\_RB | or MUTEX\_MALLOC

Returns

int

#### 19.12.4.1.10 int nema\_mutex\_unlock (int mutex\_id)

Mutex Unlock for multiple processes/threads.

Parameters

MUTEX\_RB or MUTEX MALLOC

Returns

int

#### 19.12.4.1.11 uint32\_t nema\_reg\_read (uint32\_t reg)

Read Hardware register.

Parameters

reg Register to read

Returns Value read from the register See Also nema\_reg\_write

#### 19.12.4.1.12 void nema\_reg\_write (uint32\_t reg, uint32\_t value)

Write Hardware Register.

Parameters

| reg   | Register to write   |  |
|-------|---------------------|--|
| value | Value to be written |  |

Returns

void()

See Also nema\_reg\_read()

# 19.12.4.1.13 int32\_t nema\_sys\_init (void)

Initialize system. Implementor defined. Called in nema\_init().

#### Parameters

void

Returns 0 if no errors occurred See Also nema init()

#### 19.12.4.1.14 void nema\_wait\_irq(void)

Wait for interrupt from the GPU.

Parameters

void

Returns

void

# 19.12.4.1.15 void nema\_wait\_irq\_cl (int cl\_id)

Wait for a Command List to finish.

Parameters

cl\_id | Command List ID

Returns

void

#### 19.12.5 nema\_math.h File Reference

#### Macros

- #define NEMA\_E
- #define NEMA\_LOG2E
- #define NEMA\_LOG10E
- #define NEMA\_LN2
- #define NEMA\_LN10
- #define NEMA\_PI
- #define NEMA\_PI\_2
- #define NEMA\_PI\_4
- #define NEMA 1 PI
- #define NEMA\_2\_PI

- #define NEMA\_2\_SQRTPI
- #define NEMA\_SQRT2
- #define NEMA\_SQRT1\_2
- #define nema\_min2 (a, b)

Find the minimum of two values.

#define nema\_max2 (a, b)

Find the maximum of two values.

- #define nema\_clamp (val, min, max)
  Clamp value.
- #define nema\_abs (a)

Calculate the absolute value.

- #define nema\_floats\_equal (x, y)
  Compare two floats.
- #define nema\_float\_is\_zero (x)
  Checks if value x is zero.
- #define nema\_deg\_to\_rad (d)

Convert degrees to radians.

#define nema\_rad\_to\_deg(r)

Convert radians to degrees.

- #define nema\_i2fx (a)
  Convert integer to 16.16 fixed point.
- #define nema\_f2fx(a)

Convert float to 16.16 fixed point.

- #define nema\_floor(f)
  Floor function.
- #define nema\_ceil(f)
  Ceiling function.

#### Functions

- float nema\_sin (float angle\_degrees)
  Fast sine approximation of a given angle.
- float nema\_cos (float angle\_degrees)

Fast cosine approximation of a given angle.

- float nema\_tan (float angle\_degrees)
  Fast tangent approximation of a given angle.
- float nema\_pow (float x, float y)
  A rough approximation of x raised to the power of y. USE WITH CAUTION!

float nema\_sqrt (float x)

A rough approximation of the square root of x. USE WITH CAUTION!

float nema\_atan (float x)

A floating-point approximation of the inverse tangent of x.

#### 19.12.5.1 Macro Definition Documentation

#### 19.12.5.1.1 #define NEMA\_1\_PI

1/pi

#### 19.12.5.1.2 #define NEMA\_2\_PI

2/pi

#### 19.12.5.1.3 #define NEMA\_2\_SQRTPI

2/sqrt(pi)

#### 19.12.5.1.4 #define nema\_abs(a)

Calculate the absolute value.

Parameters

a Value

Returns

The absolute value of a

#### 19.12.5.1.5 #define nema\_ceil(f)

Ceiling function.

Parameters

a Value to be ceiled

Returns

ceiled value

#### 19.12.5.1.6 #define nema\_clamp(val,min,max)

Clamp value.

| val | Value to clamp |
|-----|----------------|
| min | Minimum value  |
| max | Minimum value  |

Returns

Clamped value

#### 19.12.5.1.7 #define nema\_deg\_to\_rad(d)

Convert degrees to radians.

Parameters

d Angle in degrees

Returns

Angle in radians

#### 19.12.5.1.8 #define NEMA\_E

е

#### 19.12.5.1.9 #define nema\_f2fx(a)

Convert float to 16.16 fixed point.

Parameters

| а | Value | t I | converted |
|---|-------|-----|-----------|
|   |       | 0 ( |           |

Returns

16.16 fixed point value

#### 19.12.5.1.10 #define nema\_float\_is\_zero(x)

Checks if value x is zero.

Parameters

x X value

Returns

1 if x == 0, 0 if x != 0

#### 19.12.5.1.11 #define nema\_floats\_equal(x,y)

Compare two floats.

Parameters

| X | First float  |
|---|--------------|
| У | Second float |

Returns

1 if x == y, 0 if x != y

# 19.12.5.1.12 #define nema\_floor(f)

Floor function.

Parameters

| а | Value | t | b | floored |
|---|-------|---|---|---------|
|   |       | 0 | е |         |

Returns

floored value

# 19.12.5.1.13 #define nema\_i2fx(a)

Convert integer to 16.16 fixed point.

Parameters

| а | Value | t | converted |
|---|-------|---|-----------|
|   |       | 0 | ;         |

Returns

16.16 fixed point value

# 19.12.5.1.14 #define NEMA\_LN10

ln(10)

# 19.12.5.1.15 #define NEMA\_LN2

ln(2)

# 19.12.5.1.16 #define NEMA\_LOG10E

log10(e)

# 19.12.5.1.17 #define NEMA\_LOG2E

log2(e)

# 19.12.5.1.18 #define nema\_max2(a,b)

Find the maximum of two values.

Parameters

| а | First value  |
|---|--------------|
| b | Second value |

Returns

The maximum of a and b

# 19.12.5.1.19 #define nema\_min2(a,b)

Find the minimum of two values.

Parameters

| а | First value  |
|---|--------------|
| b | Second value |

Returns

The minimum of a and b

#### 19.12.5.1.20 #define NEMA\_PI

рі

# 19.12.5.1.21 #define NEMA\_PI\_2

pi/2

# 19.12.5.1.22 #define NEMA\_PI\_4

pi/4

# 19.12.5.1.23 #define nema\_rad\_to\_deg(r)

Convert radians to degrees.

r Angle in radians

Returns

Angle in degrees

#### 19.12.5.1.24 #define NEMA\_SQRT1\_2

1/sqrt(2)

#### 19.12.5.1.25#define NEMA\_SQRT2

sqrt(2)

#### 19.12.5.2 Function Documentation

#### 19.12.5.2.1 float nema\_atan (float x)

A floating-point approximation of the inverse tangent of x.

Parameters

x X value

Returns

Inverse tangent (angle) of x in degrees

#### 19.12.5.2.2 float nema\_cos (float angle\_degrees)

Fast cosine approximation of a given angle.

Parameters

angle degrees Angle in degrees

Returns

Cosine of the given angle

#### 19.12.5.2.3 float nema\_pow (float x, float y)

#### Arough approximation of x raised to the power of y. USE WITH CAUTION!

Parameters

x base value. Must be non negative.

y power value

Returns

the result of raising x to the power y

#### 19.12.5.2.4 float nema\_sin (float angle\_degrees)

Fast sine approximation of a given angle.

Parameters

angle\_degrees Angle in degrees

Returns

Sine of the given angle

#### 19.12.5.2.5 float nema\_sqrt (float x)

Arough approximation of the square root of x. USE WITH CAUTION!

Parameters

| X       | X value. Must be non negative |
|---------|-------------------------------|
| \return | The square root of x          |

#### 19.12.5.2.6 float nema\_tan (float angle\_degrees)

Fast tangent approximation of a given angle.

Parameters

angle\_degrees Angle in degrees

Returns

Tangent of the given angle

#### 19.12.6 nema\_matrix3x3.h File Reference

#### Typedefs

typedef float nema\_matrix3x3\_t [3][3]

#### Functions

void nema\_mat3x3\_load\_identity (nema\_matrix3x3\_t m)

Load Identity Matrix.

- void nema\_mat3x3\_translate (nema\_matrix3x3\_t m, float tx, float ty)
  Apply translate transformation.
- void nema\_mat3x3\_scale (nema\_matrix3x3\_t m, float sx, float sy)
  Apply scale transformation.
- void nema\_mat3x3\_shear (nema\_matrix3x3\_t m, float shx, float shy)
  Apply shear transformation.
- void nema\_mat3x3\_mirror (nema\_matrix3x3\_t m, int mx, int my)
  Apply mirror transformation.
- void nema\_mat3x3\_rotate (nema\_matrix3x3\_t m, float angle\_degrees)
  Apply rotation transformation.
- void nema\_mat3x3\_mul (nema\_matrix3x3\_t m, nema\_matrix3x3\_t\_m)
  Multiply two 3x3 matrices (m = m\*\_m)
- void nema\_mat3x3\_mul\_vec (nema\_matrix3x3\_t m, float \*x, float \*y) Multiply vector with matrix.
- void nema\_mat3x3\_mul\_vec\_affine (nema\_matrix3x3\_t m, float \*x, float \*y)
  Multiply vector with affine matrix.
- void nema\_mat3x3\_adj (nema\_matrix3x3\_t m)

Calculate adjoint.

void nema\_mat3x3\_div\_scalar (nema\_matrix3x3\_t m, floats)

Divide matrix with scalar value.

int nema\_mat3x3\_invert (nema\_matrix3x3\_t m)

Invert matrix.

• int nema\_mat3x3\_quad\_to\_rect (int width, int height, float sx0, float sy0, float sx1, float sy1, float sx2, float sy2, float sx3, float sy3, nema\_matrix3x3\_t m)

Map rectangle to quadrilateral.

# 19.12.6.1 Function Documentation

# 19.12.6.1.1 void nema\_mat3x3\_adj (nema\_matrix3x3\_t m)

Calculate adjoint.

Parameters

*m* Matrix

# 19.12.6.1.2 void nema\_mat3x3\_div\_scalar (nema\_matrix3x3\_t m, float s)

Divide matrix with scalar value.

| т | Matrix to divide |
|---|------------------|
| S | scalar value     |

#### 19.12.6.1.3 int nema\_mat3x3\_invert (nema\_matrix3x3\_t m)

Invert matrix.

Parameters

*m* Matrix to invert

#### 19.12.6.1.4 void nema\_mat3x3\_load\_identity (nema\_matrix3x3\_t m)

Load Identity Matrix.

Parameters

*m* Matrix to be loaded

#### 19.12.6.1.5 void nema\_mat3x3\_mirror (nema\_matrix3x3\_t m, int mx, int my)

Apply mirror transformation.

Parameters

| m  | Matrix to apply transformation   |
|----|----------------------------------|
| mx | if non-zero, mirror horizontally |
| my | if non-zero, mirror vertically   |

#### 19.12.6.1.6 void nema\_mat3x3\_mul (nema\_matrix3x3\_t m, nema\_matrix3x3\_t \_m)

Multiply two 3x3 matrices (m = m\*\_m)

Parameters

| m | left matrix, will be overwritten by the result |
|---|------------------------------------------------|
| m | right matrix                                   |

#### 19.12.6.1.7 void nema\_mat3x3\_mul\_vec (nema\_matrix3x3\_t m, float \* x, float \* y)

Multiply vector with matrix.

| m | Matrix to multiply with |
|---|-------------------------|
| X | Vector x coefficient    |
| у | Vector y coefficient    |

#### 19.12.6.1.8void nema\_mat3x3\_mul\_vec\_affine (nema\_matrix3x3\_t m,float \* x,float \* y

Multiply vector with affine matrix.

Parameters

| m | Matrix to multiply with |
|---|-------------------------|
| X | Vector x coefficient    |
| У | Vector y coefficient    |

# 19.12.6.1.9 int nema\_mat3x3\_quad\_to\_rect (int width, int height, float sx0, float sy0, float sx1, float sy1,float sx2,float sy2,float sx3,float sy3,nema\_matrix3x3\_t m)

Map rectangle to quadrilateral.

Parameters

| width  | Rectangle width                                        |
|--------|--------------------------------------------------------|
| height | Rectangle height                                       |
| sx0    | x coordinate at the first vertex of the quadrilateral  |
| sy0    | y coordinate at the first vertex of the quadrilateral  |
| sx1    | x coordinate at the second vertex of the quadrilateral |
| sy1    | y coordinate at the second vertex of the quadrilateral |
| sx2    | x coordinate at the third vertex of the quadrilateral  |
| sy2    | y coordinate at the third vertex of the quadrilateral  |
| sx3    | x coordinate at the fourth vertex of the quadrilateral |
| sy3    | y coordinate at the fourth vertex of the quadrilateral |
| m      | Mapping matrix                                         |

# 19.12.6.1.10 void nema\_mat3x3\_rotate (nema\_matrix3x3\_t m, float angle\_degrees)

Apply rotation transformation.

Parameters

| т             | Matrix to apply transformation |
|---------------|--------------------------------|
| angle_degrees | Angle to rotate in degrees     |

# 19.12.6.1.11 void nema\_mat3x3\_shear (nema\_matrix3x3\_t m, float shx, float shy)

Apply shear transformation.

| m   | Matrix to apply transformation |
|-----|--------------------------------|
| shx | X shearing factor              |
| shy | Y shearing factor              |

# 19.12.6.1.12 void nema\_mat3x3\_translate (nema\_matrix3x3\_t m, float tx, float ty)

Apply translate transformation.

Parameters

| m  | Matrix to apply transformation |
|----|--------------------------------|
| tx | X translation factor           |
| ty | Y translation factor           |

#### 19.12.7 nema\_matrix4x4.h File Reference

#### Typedefs

typedef float nema\_matrix4x4\_t [4][4]

#### Functions

- void nema\_mat4x4\_load\_identity (nema\_matrix4x4\_t m)
  - Load a 4x4 Identity Matrix.
- void nema\_mat4x4\_mul (nema\_matrix4x4\_t m, nema\_matrix4x4\_t m\_l, nema\_matrix4x4\_t m\_r) Multiply two 4x4 matrices.
- void nema\_mat4x4\_mul\_vec (nema\_matrix4x4\_tm, float \*x, float \*y, float \*z, float \*w)
  Multiply a 4x1 vector with a 4x4 matrix.
- void nema\_mat4x4\_translate (nema\_matrix4x4\_t m, float tx, float ty, float tz)
  - Apply translate transformation.
- void nema\_mat4x4\_scale (nema\_matrix4x4\_t m, float sx, float sy, float sz)
  Apply scale transformation.
- void nema\_mat4x4\_rotate\_X (nema\_matrix4x4\_t m, float angle\_degrees)
  Apply rotate transformation around X axis.
- void nema\_mat4x4\_rotate\_Y (nema\_matrix4x4\_t m, float angle\_degrees)
  Apply rotate transformation around Y axis.
- void nema\_mat4x4\_rotate\_Z (nema\_matrix4x4\_t m, float angle\_degrees)
  Apply rotate transformation around Z axis.
- void nema\_mat4x4\_load\_perspective (nema\_matrix4x4\_tm,floatfovy\_degrees,floataspect,float nearVal, float farVal)
  Set up a perspective projection matrix.
- void nema\_mat4x4\_load\_ortho (nema\_matrix4x4\_t m, float left, float right, float bottom, float top, float nearVal, float far-Val)
  - Set up an orthographic projection matrix.

• void nema\_mat4x4\_load\_ortho\_2d (nema\_matrix4x4\_tm, float left, float right, float bottom, float top)

Set up a 2D orthographic projection matrix.

int nema\_mat4x4\_obj\_to\_win\_coords (nema\_matrix4x4\_t mvp, float x\_orig, float y\_orig, float width, float height, float nearVal, float farVal, float \*x, float \*z, float \*z, float \*w)

Convenience Function to calculate window coordinates from object coordinates.

# 19.12.7.1 Function Documentation

#### 19.12.7.1.1 void nema\_mat4x4\_load\_identity (nema\_matrix4x4\_t m)

Load a 4x4 Identity Matrix.

Parameters

*m* Matrix to be loaded

# 19.12.7.1.2 void nema\_mat4x4\_load\_ortho (nema\_matrix4x4\_t m, float left, float right, float bottom, float top, float nearVal, float farVal)

Set up an orthographic projection matrix.

Parameters

| m       | A 4x4 Matrix                                                          |
|---------|-----------------------------------------------------------------------|
| left    | Left vertical clipping plane                                          |
| right   | Right vertical clipping plane                                         |
| bottom  | bottom horizontal clipping plane                                      |
| top     | Top horizontal clipping plane                                         |
| nearVal | Distance from the viewer to the near clipping plane (always positive) |
| farVal  | Distance from the viewer to the far clipping plane (always positive)  |

# 19.12.7.1.3 void nema\_mat4x4\_load\_ortho\_2d (nema\_matrix4x4\_t m, float left, float right, float bottom, float top)

Set up a 2D orthographic projection matrix.

Parameters

| m      | A 4x4 Matrix                     |
|--------|----------------------------------|
| left   | Left vertical clipping plane     |
| right  | Right vertical clipping plane    |
| bottom | bottom horizontal clipping plane |
| top    | Top horizontal clipping plane    |

#### 19.12.7.1.4 void nema\_mat4x4\_load\_perspective (nema\_matrix4x4\_t m, float fovy\_degrees, float aspect, float nearVal, float farVal)

Set up a perspective projection matrix.

| m            | A 4x4 Matrix                                                          |
|--------------|-----------------------------------------------------------------------|
| fovy_degrees | Field of View in degrees                                              |
| aspect       | Aspect ratio that determines the field of view in the x direction.    |
| nearVal      | Distance from the viewer to the near clipping plane (always positive) |
| farVal       | Distance from the viewer to the far clipping plane (always positive)  |

#### 19.12.7.1.5 void nema\_mat4x4\_mul (nema\_matrix4x4\_t m, nema\_matrix4x4\_tm\_l, nema\_matrix4x4\_t m\_r)

Multiply two 4x4 matrices.

Parameters

| m         | Result Matrix |
|-----------|---------------|
| I         | Left operand  |
| <u></u> r | Right operand |

# 19.12.7.1.6void nema\_mat4x4\_mul\_vec (nema\_matrix4x4\_t m, float \* x, float \* y, float \* z, float \* w)

Multiply a 4x1 vector with a 4x4 matrix.

Parameters

| m | Matrix to be multiplied |
|---|-------------------------|
| X | Vector first element    |
| У | Vector second element   |
| Z | Vector third element    |
| W | Vector forth element    |

# 19.12.7.1.7 intnema\_mat4x4\_obj\_to\_win\_coords(nema\_matrix4x4\_tmvp, floatx\_orig, float y\_orig, float width, float height, float nearVal, float farVal, float \* x, float \* y, float \* z, float \* w)

Convenience Function to calculate window coordinates from object coordinates.

| mvp     | Model, View and Projection Matrix                                     |
|---------|-----------------------------------------------------------------------|
| x_orig  | Window top left X coordinate                                          |
| y_orig  | Window top left Y coordinate                                          |
| width   | Window width                                                          |
| height  | Window height                                                         |
| nearVal | Distance from the viewer to the near clipping plane (always positive) |
| farVal  | Distance from the viewer to the far clipping plane (always positive)  |
| X       | X object coordinate                                                   |
| У       | Y object coordinate                                                   |
| Z       | Z object coordinate                                                   |

w W object coordinate

Returns

1 if vertex is outside frustum (should be clipped)

#### 19.12.7.1.8 void nema\_mat4x4\_rotate\_X (nema\_matrix4x4\_t m, float angle\_degrees)

Apply rotate transformation around X axis.

Parameters

| т             | Matrix to apply transformation |
|---------------|--------------------------------|
| angle_degrees | Angle to rotate in degrees     |

#### 19.12.7.1.9 void nema\_mat4x4\_rotate\_Y (nema\_matrix4x4\_t m, float angle\_degrees)

Apply rotate transformation around Y axis.

Parameters

| т             | Matrix to apply transformation |
|---------------|--------------------------------|
| angle_degrees | Angle to rotate in degrees     |

#### 19.12.7.1.10 void nema\_mat4x4\_rotate\_Z (nema\_matrix4x4\_t m, float angle\_degrees)

Apply rotate transformation around Z axis.

Parameters

| т             | Matrix to apply transformation |
|---------------|--------------------------------|
| angle_degrees | Angle to rotate in degrees     |

#### 19.12.7.1.11 void nema\_mat4x4\_scale (nema\_matrix4x4\_t m, float sx, float sy, float sz)

Apply scale transformation.

Parameters

| m  | Matrix to apply transformation |
|----|--------------------------------|
| SX | X scaling factor               |
| sy | Y scaling factor               |
| SZ | Z scaling factor               |

#### 19.12.7.1.12 void nema\_mat4x4\_translate (nema\_matrix4x4\_t m, float tx, float ty, float tz)

Apply translate transformation.

Parameters

*m* Matrix to apply transformation

| tx | Х | translation factor |
|----|---|--------------------|
| ty | Y | translation factor |
| tz | Z | translation factor |

#### 19.12.8 nema\_programHW.h File Reference

#### Enumerations

- enum nema\_tex\_t{
  - NEMA\_NOTEX, NEMA\_TEX0, NEMA\_TEX1, NEMA\_TEX2, NEMA\_TEX3}
- enumnema\_tex\_format\_t {
  - NEMA\_RGBX8888,
  - NEMA\_RGBA8888,
  - NEMA\_XRGB8888,
  - NEMA\_ARGB8888,
  - NEMA\_RGB565,
  - NEMA\_RGBA5650,
  - NEMA\_RGBA5551,
  - NEMA\_RGBA4444,
  - NEMA\_RGBA0800,
  - NEMA\_A8,
  - NEMA\_RGBA0008,
  - NEMA\_L8,
  - NEMA\_RGBA3320,
  - NEMA\_BW1,
  - NEMA\_UYVY,
  - NEMA\_ABGR8888,
  - NEMA\_BGRA8888,
  - NEMA\_BGRX8888,
  - NEMA\_TSC4,
  - NEMA\_TSC6,
  - NEMA\_TSC6A,
  - NEMA\_RY,

NEMA\_GU, NEMA\_BV, NEMA\_YUV, NEMA\_Z24\_8, NEMA\_Z16, NEMA\_UV, NEMA\_L2, NEMA\_L4, NEMA\_L4, NEMA\_ASTC4\_4, NEMA\_ASTC8\_8 }

enum nema\_tex\_mode\_t {
 NEMA\_FILTER\_PS,
 NEMA\_FILTER\_BL,
 NEMA\_TEX\_CLAMP,
 NEMA\_TEX\_REPEAT,
 NEMA\_TEX\_BORDER,
 NEMA\_TEX\_MIRROR,
 NEMA\_TEX\_RANGE\_0\_1,
 NEMA\_TEX\_LEFT\_HANDED }

enum nema\_tri\_cull\_t {
 NEMA\_CULL\_NONE,
 NEMA\_CULL\_CW,
 NEMA\_CULL\_CCW,
 NEMA\_CULL\_ALL}

# Functions

int nema\_checkGPUPresence (void)

Check if a known GPU is present.

• void nema\_bind\_tex (nema\_tex\_t texid, uint32\_t addr\_gpu, uint32\_t width, uint32\_t height, nema\_tex\_format\_t format, int32\_t stride, nema\_tex\_mode\_t wrap\_mode)

Program a Texture Unit.

void nema\_set\_tex\_color (uint32\_t color)

Set Texture Mapping default color.

void nema\_set\_matrix (nema\_matrix3x3\_t m)

Load GPU's Matrix Multiplier with a given 3x3 matrix.

- void nema\_set\_matrix\_scale (float src\_xres, float src\_yres, float dst\_xres, float dst\_yres, float dst\_x, float dst\_y)
  Load GPU's Matrix Multiplier for scaling.
- void nema\_set\_matrix\_translate (float dst\_x, float dst\_y)
  - Load GPU's Matrix Multiplier for a simple Blit (affine translation)
- void nema\_set\_const\_reg (int reg, uint32\_t value)
  Write a value to a Constant Register of the GPU.
- void nema\_set\_clip (uint32\_t x, uint32\_t y, uint32\_t w, uint32\_t h)
  - Sets the drawing area's Clipping Rectangle.
- void nema\_load\_frag\_shader (const uint32\_t \*cmd, uint32\_t count, uint32\_t codeptr)
  Load a precompiled Shader to the GPU's internal memory.
- void nema\_set\_frag\_ptr (uint32\_t ptr)

Set the Internal Memory address of the fragment shader to be executed.

- void nema\_load\_frag\_shader\_ptr (const uint32\_t cmd, uint32\_t count, uint32\_t codeptr, uint32\_t ptr)
  Load a precompiled Shader to the GPU's internal memc?y and set fragment pointer.
- void nema\_set\_rop\_blend\_mode (uint32\_t bl\_mode)
  Set ROP blending mode.
- void nema\_set\_rop\_dst\_color\_key (uint32\_t rgba)
  - Set ROP destination color key.
- void nema\_set\_rop\_const\_color (uint32\_t rgba)
  Set ROP constant color.
- void nema\_tri\_cull (nema\_tri\_cull\_t cull)
  Set triangle/quadrilateral culling mode.
- void nema\_set\_raster\_color (uint32\_t rgba8888)
  - Set the color which will be used when drawing primitives (lines, rectangles etc)
- void nema\_raster\_pixel (int x, int y)
  - Program Rasterizer to generate a pixel.
- void nema\_raster\_line (int x0, int y0, int x1, int y1)
  - Program Rasterizer to draw a line.
- void nema\_raster\_triangle\_fx (int x0fx, int y0fx, int x1fx, int y1fx, int x2fx, int y2fx)
  Program Rasterizer to draw a triangle.
- void nema\_raster\_rect (int x, int y, int w, int h)
  - Program Rasterizer to generate a triangle.
- void nema\_raster\_rounded\_rect (int x0, int y0, int w, int h, int r)

Program Rasterizer to draw a rectangle with rounded edges.

- void nema\_raster\_triangle\_f (float x0, float y0, float z0, float w0, float x1, float y1, float z1, float w1, float x2, float y2, float z2, float w2)
- void nema\_raster\_quad\_fx(intx0fx, inty0fx, intx1fx, inty1fx, intx2fx, inty2fx, intx3fx, inty3fx)
  Program Rasterizer to draw a quadrilateral.
- void nema\_get\_dirty\_region (int \*minx, int \*miny, int \*maxx, int \*maxy)

Returns the bounding rectangle of all the pixels that have been modified since its previous call.

int nema\_format\_size (nema\_tex\_format\_t format)

Return pixel size in bytes.

int nema\_stride\_size (nema\_tex\_format\_t format, int width)

Return stride in bytes.

• int nema\_texture\_size (nema\_tex\_format\_t format, int width, int height)

Return texture size in bytes.

- void nema\_enable\_tiling (uint32\_t enable)
- void nema\_set\_depth\_range (float min\_depth, floatmax\_depth)

Set maximum and minimum values for depth buffer. Available ony for Nema|T.

void nema\_set\_viewport (float x, float y, float w, floath)

Sets Viewport parameters for vertex shader. Available only for Nema|T.

#### 19.12.8.1 Enumeration Type Documentation

#### 19.12.8.1.1 enum nema\_tex\_format\_t

#### Enumerator

NEMA\_RGBX8888 RGBX8888

NEMA\_RGBA8888 RGBA8888

NEMA\_XRGB8888 XRGB8888

NEMA\_ARGB8888 ARGB8888

NEMA\_RGB565 RGBA5650

NEMA\_RGBA5650 RGBA5650

NEMA\_RGBA5551 RGBA5551

NEMA\_RGBA4444 RGBA4444

NEMA\_RGBA0800 RGBA0800

NEMA\_A8 RGBA0008

NEMA\_RGBA0008 RGBA0008

*NEMA\_L8* L8

*NEMA\_RGBA3320* RGBA3320 (source only)

*NEMA\_BW1* BW1 (source only)

NEMA\_UYVY UYVY NEMA\_ABGR8888 ABGR8888 NEMA\_BGRA8888 BGRA NEMA\_BGRX8888 BGRX NEMA\_TSC4 TSC4 NEMA\_TSC6 TSC6 NEMA\_TSC6A TSC6A NEMA\_RY RY NEMA\_GU GU NEMA\_BV BV NEMA YUV YUV NEMA\_Z24\_8 Z24\_8 NEMA\_Z16 Z16 NEMA\_UV UV NEMA\_L2 L2 NEMA\_L4 L4 NEMA\_ASTC4\_4 ASTC4\_4 NEMA\_ASTC8\_8 ASTC8\_8

# 19.12.8.1.2 enum nema\_tex\_mode\_t

#### Enumerator

NEMA\_FILTER\_PS Point Sampling. NEMA\_FILTER\_BL Bilinear filtering. NEMA\_TEX\_CLAMP Clamp NEMA\_TEX\_REPEAT Repeat NEMA\_TEX\_BORDER Border NEMA\_TEX\_MIRROR Mirror NEMA\_TEX\_RANGE\_0\_1 Interpolated Coordinates range: 0-1 NEMA\_TEX\_LEFT\_HANDED (0,0) is bottom leftcorner

# 19.12.8.1.3 enum nema\_tex\_t

# Enumerator NEMA\_NOTEX No Texture NEMA\_TEX0 Texture 0

NEMA\_TEX1 Texture 1

NEMA\_TEX2 Texture 2 NEMA\_TEX3 Texture 3

# 19.12.8.1.4 enum nema\_tri\_cull\_t

#### Enumerator

NEMA\_CULL\_NONE Disable Triangle/Quadrilateral Culling NEMA\_CULL\_CW Cull clockwise Triangles/Quadrilaterals NEMA\_CULL\_CCW Cull anti-clockwise Triangles/Quadrilaterals NEMA\_CULL\_ALL Cull all

#### 19.12.8.2 Function Documentation

# 19.12.8.2.1 void nema\_bind\_tex (nema\_tex\_t texid, uint32\_t addr\_gpu, uint32\_t width, uint32\_t height, nema\_tex\_format\_t format, int32\_t stride, nema\_tex\_mode\_t wrap\_mode)

Program a Texture Unit.

Parameters

| texid     | Texture unit to be programmed                               |
|-----------|-------------------------------------------------------------|
| addr_gpu  | Texture's address as seen by the GPU                        |
| width     | Texture's width                                             |
| height    | Texture's height                                            |
| format    | Texture's format                                            |
| stride    | Texture's stride. If stride < 0, it's left to be calculated |
| wrap_mode | Wrap/Repeat mode to be used                                 |

# 19.12.8.2.2 int nema\_checkGPUPresence (void)

Check if a known GPU is present. Returns -1 if no known GPU is present

#### 19.12.8.2.3 void nema\_enable\_tiling (uint32\_t enable)

#### 19.12.8.2.4 int nema\_format\_size (nema\_tex\_format\_t format)

Return pixel size in bytes.

Parameters

| forma | Color format |
|-------|--------------|
| t     |              |

Returns

Pixel size in bytes

# 19.12.8.2.5 void nema\_get\_dirty\_region (int ? minx,int ? miny,int ? maxx,int ? maxy)

Returns the bounding rectangle of all the pixels that have been modified since its previous call.

Parameters

| minx | x coordinate of the upper left corner of the dirty region  |
|------|------------------------------------------------------------|
| miny | y coordinate of the upper left corner of the dirty region  |
| maxx | x coordinate of the lower right corner of the dirty region |
| maxy | y coordinate of the lower right corner of the dirty region |

?

# 19.12.8.2.6 void nema\_load\_frag\_shader (const uint32\_tcma, uint32\_t count, uint32\_t codeptr)

Load a precompiled Shader to the GPU's internal memory.

Parameters

| cmd     | Pointer to the shader                                |
|---------|------------------------------------------------------|
| count   | Number of commands                                   |
| codeptr | Internal Memory address to be written (default is 0) |

# 19.12.8.2.7 void nema\_load\_frag\_shader\_ptr (const uint32\_tcn<sup>?</sup>1, uint32\_t count, uint32\_t codeptr, uint32\_t ptr)

Load a precompiled Shader to the GPU's internal memory and set fragment pointer.

Parameters

| cmd     | Pointer to the shader                                |
|---------|------------------------------------------------------|
| count   | Number of commands                                   |
| codeptr | Internal Memory address to be written (default is 0) |
| ptr     | Internal Memory address of the fragment shader       |

#### 19.12.8.2.8 void nema\_raster\_line (int x0, int y0, int x1, int y1)

Program Rasterizer to draw a line.

| xO | x coordinate at the beginning of the line |
|----|-------------------------------------------|
| y0 | y coordinate at the beginning of the line |
| x1 | x coordinate a the en o the line          |
|    | t d f                                     |
| y1 | y coordinate a the en o the line          |
|    | t d f                                     |

#### 19.12.8.2.9 void nema\_raster\_pixel (int x, int y)

Program Rasterizer to generate a pixel.

#### Parameters

| X | x coordinate of the pixel |
|---|---------------------------|
| У | y coordinate of the pixel |

# 19.12.8.2.10 void nema\_raster\_quad\_fx (int x0fx, int y0fx, int x1fx, int y1fx, int x2fx, int y2fx, int x3fx, int y3fx)

Program Rasterizer to draw a quadrilateral.

Parameters

| xOfx | x coordinate at the first vertex of the quadrilateral (fixed point 16.16)  |
|------|----------------------------------------------------------------------------|
| yOfx | y coordinate at the first vertex of the quadrilateral (fixed point 16.16)  |
| x1fx | x coordinate at the second vertex of the quadrilateral (fixed point 16.16) |
| y1fx | y coordinate at the second vertex of the quadrilateral (fixed point 16.16) |
| x2fx | x coordinate at the third vertex of the quadrilateral (fixed point 16.16)  |
| y2fx | y coordinate at the third vertex of the quadrilateral (fixed point 16.16)  |
| x3fx | x coordinate at the fourth vertex of the quadrilateral (fixed point 16.16) |
| y3fx | y coordinate at the fourth vertex of the quadrilateral (fixed point 16.16) |

#### 19.12.8.2.11 void nema\_raster\_rect (int x, int y, int w, int h)

Program Rasterizer to generate a triangle.

Parameters

| X | x coordinate of the upper left vertex of the rectangle |
|---|--------------------------------------------------------|
| У | y coordinate at the upper left vertex of the rectangle |
| W | width of the rectangle                                 |
| h | height of the rectangle                                |

#### 19.12.8.2.12 void nema\_raster\_rounded\_rect (int x0, int y0, int w, int h, int r)

Program Rasterizer to draw a rectangle with rounded edges.

| х0 | x coordinate of the upper left vertex of the rectangle |
|----|--------------------------------------------------------|
| y0 | y coordinate at the upper left vertex of the rectangle |
| W  | width of the rectangle                                 |
| h  | height of the rectangle                                |
| r  | corner radius                                          |

19.12.8.2.13 void nema\_raster\_triangle\_f (float x0,float y0,float z0,float w0,float x1, float y1, float z1, float w1, float x2, float y2, float z2, float w2)

# 19.12.8.2.14 void nema\_raster\_triangle\_fx (int x0fx, int y0fx, int x1fx, int y1fx, int x2fx, int y2fx)

Program Rasterizer to draw a triangle.

Parameters

| x0fx | x coordinate at the first vertex of the triangle (fixed point 16.16)  |
|------|-----------------------------------------------------------------------|
| yOfx | y coordinate at the first vertex of the triangle (fixed point 16.16)  |
| x1fx | x coordinate at the second vertex of the triangle (fixed point 16.16) |
| y1fx | y coordinate at the second vertex of the triangle (fixed point 16.16) |
| x2fx | x coordinate at the third vertex of the triangle (fixed point 16.16)  |
| y2fx | y coordinate at the third vertex of the triangle (fixed point 16.16)  |

#### 19.12.8.2.15 void nema\_set\_clip (uint32\_t x, uint32\_t y, uint32\_t w, uint32\_t h)

Sets the drawing area's Clipping Rectangle.

Parameters

| X | Clip Window top-left x coordinate |  |  |  |  |  |
|---|-----------------------------------|--|--|--|--|--|
| У | Clip Window minimum y             |  |  |  |  |  |
| W | Clip Window width                 |  |  |  |  |  |
| h | Clip Window height                |  |  |  |  |  |

#### 19.12.8.2.16 void nema\_set\_const\_reg (int reg, uint32\_t value)

Write a value to a Constant Register of the GPU.

Parameters

| reg   | Constant Register to be written |
|-------|---------------------------------|
| value | Value to be written             |

#### 19.12.8.2.17 void nema\_set\_depth\_range (float min\_depth, float max\_depth)

Set maximum and minimum values for depth buffer. Available ony for Nema|T.

| min_depth | Minimum value |
|-----------|---------------|
| max_depth | Maximum value |

#### 19.12.8.2.18 void nema\_set\_frag\_ptr (uint32\_t ptr)

SettheInternalMemoryaddressofthefragmentshadertobeexecuted.

#### Parameters

*ptr* Internal Memory address of the fragment shader

#### 19.12.8.2.19 void nema\_set\_matrix (nema\_matrix3x3\_t m)

Load GPU's Matrix Multiplier with a given 3x3 matrix.

Parameters

| m | Matrix | t | b | loaded |
|---|--------|---|---|--------|
|   |        | 0 | е |        |

# 19.12.8.2.20 void nema\_set\_matrix\_scale (float src\_xres, float src\_yres, float dst\_xres, float dst\_yres, float dst\_x, float dst\_y)

Load GPU's Matrix Multiplier for scaling.

Parameters

| src_xres | Width of source rectangular area                     |
|----------|------------------------------------------------------|
| src_yres | Height of source rectangular area                    |
| dst_xres | Width of destination rectangular area                |
| dst_yres | Height of destination rectangular area               |
| dst_x    | X coordinate of upper-left vertex of the destination |
| dst_y    | Y coordinate of upper-left vertex of the destination |

#### 19.12.8.2.21 void nema\_set\_matrix\_translate (float dst\_x, float dst\_y)

Load GPU's Matrix Multiplier for a simple Blit (affine translation)

#### Parameters

| dst_x | X coordinate of upper-left vertex of the destination |
|-------|------------------------------------------------------|
| dst_y | Y coordinate of upper-left vertex of the destination |

#### 19.12.8.2.22 void nema\_set\_raster\_color (uint32\_t rgba8888)

Set the color which will be used when drawing primitives (lines, rectangles etc)

#### Parameters

| rgba8888 | Color to be used |
|----------|------------------|
|          |                  |

See Also

nema\_rgba()

#### 19.12.8.2.23 void nema\_set\_rop\_blend\_mode (uint32\_t bl\_mode)

Set ROP blending mode.

Parameters

*bl\_mode* Blending mode

#### 19.12.8.2.24 void nema\_set\_rop\_const\_color (uint32\_t rgba)

Constant color

Set ROP constant color.

Parameters

rgba

See Also

nema\_rgba()

#### 19.12.8.2.25 void nema\_set\_rop\_dst\_color\_key (uint32\_t rgba)

Set ROP destination color key.

Parameters

rgba Destination Color Key

See Also

nema\_rgba()

#### 19.12.8.2.26 void nema\_set\_tex\_color (uint32\_t color)

Set Texture Mapping default color.

Parameters

*color* default color in 32-bit RGBA format

See Also

nema\_rgba()

# 19.12.8.2.27 void nema\_set\_viewport (float x, float y, float w, float h)

Sets Viewport parameters for vertex shader. Available only for Nema|T.

#### Parameters

| X | Start X coordinate     |
|---|------------------------|
| У | Start Y coordinate     |
| W | Width of the Viewport  |
| h | Height of the Viewport |

#### 19.12.8.2.28 int nema\_stride\_size (nema\_tex\_format\_t format, int width)

Return stride in bytes.

#### Parameters

| format | Color format         |
|--------|----------------------|
| width  | Texture color format |

#### Returns

Stride in bytes

# 19.12.8.2.29 int nema\_texture\_size (nema\_tex\_format\_t format, int width, int height)

Return texture size in bytes.

Parameters

| format | Texture color format |
|--------|----------------------|
| width  | Texture width        |
| height | Texture height       |

Returns

Texture size in bytes

#### 19.12.8.2.30 void nema\_tri\_cull (nema\_tri\_cull\_t cull)

Set triangle/quadrilateral culling mode.

#### Parameters

*cull* Culling mode

# 20. PDM-to-PCM Converter (PDM)

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 20.1 PDM Clock Configuration

There are 4 clocks or frequencies of relevance for the PDM interface:

- Interface core master clock, PDM\_CLK, optimally set to 24.576 MHz.
- PDM output clock, PDM\_CKO, whose frequency is selected to meet microphone specification.
- Desired sampling frequency at the microphone, Fsin, also determined by the requirements of the microphone.
- Desired sample rate, Fsout, of the PCM stream from the interface determined by the requirements of the application.

The Oversample Ratio (OSR), which is the ratio of the input sample frequency divided by the output sample frequency, or  $F_{sin} / F_{sout}$ , is an important parameter to determine and is derived based on several factors. By definition, the PDM\_CKO clock to the microphone(s) is the same as  $F_{sin}$ . The Nyquist Sampling Rate requires sampling at, or greater than, twice the bandwidth (BW) of the signal. For example, if a bandwidth of 8 kHz is targeted, then a sampling rate of 16 kHz is required. Oversampling is sampling beyond the Nyquist rate such that OSR =  $F_{sin} / Nyquist$  Rate =  $F_{sin} / (2 \times BW) = F_{sin} / F_{sout}$ . For example, 8 kHz BW, sampled at 768 kHz, has an OSR of 768 kHz / 16 kHz = 48.

The OSR is selected by setting the SINC decimation rate in the SINCRATE field in the PDMn\_CORECFG0 register, such that OSR =  $2 \times SINCRATE$ . The PDM\_CLK is determined by dividing down the PDM\_APB\_CLK, which is 96 MHz. The divider is selected in the CLKSEL field of the PDMn\_CTRL register. If the PDMn\_CLK is desired to be 24.576 MHz, then CLKSEL should set to 1.

The microphone clock frequency, PDM\_CKO or  $F_{sin}$ , is determined by the requirements of the microphone. This clock is typically set to 768 kHz for lower power operation or to 1536 kHz for slightly better microphone performance. As mentioned,  $F_{sout}$  is determined by the requirements of the application, typically set for 16 kHz or 32 kHz.

The PDM\_CKO is configured through the SINCRATE and MCLKDIV fields of the PDMn\_CORECFG0 register, and the DIVMCLKQ field of the PDMn\_CORECFG1 register. See PDM chapter of the Apollo4 datasheet and the PDM register set for the relationships between the clocks and their dividers.

The following table shows settings which are recommended to produce commonly used PDM bit data clock frequencies (PDMA\_CKO) and PCM sample rates as a function of  $F_S$  and OSR for the various operating modes, at a PDM\_CLK of 24.576 MHz.
| OPERATING<br>MODE             | F <sub>PDMA_CKO</sub><br>(MHz) | F <sub>S</sub><br>(kHz) | OSR | DIV_<br>MCLKQ<br>[1:0] | MCLKDIV<br>[3:0] | SINCRATE<br>[6:0] | SINAD<br>(dB) | DR (dB) |
|-------------------------------|--------------------------------|-------------------------|-----|------------------------|------------------|-------------------|---------------|---------|
|                               | 6.144                          | 96                      | 64  | 1                      | 1                | 32                | 103           | 110.8   |
|                               | 3.072                          | 48                      | 64  | 1                      | 3                | 32                | 105.5         | 108.7   |
| High Perfor-<br>mance Mode    | 3.072                          | 24                      | 128 | 1                      | 3                | 64                | 122.8         | 120.9   |
|                               | 3.072                          | 16                      | 192 | 1                      | 3                | 96                | 116.1         | 120.4   |
|                               | 1.536                          | 16                      | 96  | 1                      | 7                | 48                | 115.4         | 120.5   |
|                               | 3.072                          | 96                      | 32  | 1                      | 3                | 16                | 86            | 87.9    |
| Reduced Perfor-<br>mance Mode | 1.536                          | 48                      | 32  | 1                      | 7                | 16                | 83.2          | 88.8    |
|                               | 0.768                          | 16                      | 48  | 1                      | 15               | 24                | 89.7          | 97.2    |
| Mid                           | 1.536                          | 24                      | 64  | 1                      | 7                | 32                | 101           | 107.6   |
| Mode                          | 1.024                          | 16                      | 64  | 1                      | 11               | 32                | 100           | 106.8   |

Table 120: PDMA\_CKO and OSR Settings for Different Sampling Frequencies

Notes:

1. Assumes PDM\_CLK of 24.576 MHz.

- 2. The above frequency combinations are recommended values, where **DIV\_MCLKQ = 2'b01**. User may determine other proper values according to actual master clock rate and digital microphones implemented in system design.
- 3. **SINAD** means ratio of signal to noise plus the first N harmonics of THD.
- 4. **DR** means dynamic range, which is measured as SINAD (-60dB) in this table.

To derive the settings, use the steps and equations below:

- For a PDM\_CLK = 24.576 MHz, set PDMx\_CTRL\_CLKSEL to 1.
- For oversampling where OSR = F<sub>sin</sub> / F<sub>sout</sub>, OSR should typically be between 32 and 128, or 192.
- To select the SINC rate to be equal to OSR / 2, set PDMx\_CORECFG0\_SINCRATE field to typically be between 16 and 64, or 96 ONLY for OSR of 192.
- To select the recommended setting of 1 for DIV\_MCLKQ where PDMn\_MCLKQ is the same clock frequency as PDMn\_CLK, set the PDMx\_CORECFG1\_DIVMCLKQ field to 1.
- To select the PDM output clock, PDMn\_CKO, select the divider of the MCLKQ clock by setting the PDMx\_CORECFG0\_MCLKDIV field to get the desired value of PDMn\_CKO, where PDMn\_CLKO = PDMn\_MCLKQ / (PDMn\_MCLKDIV+1).

# 20.2 Operating Modes

The PDM may operate in stereo or mono mode in normal operation, and system reset or power down mode when not in use. Each mode can be programmed by control registers. Note that mono mode operation always uses left channel conversion path.

| Mode                | CORE-<br>CFG1_<br>PCM-<br>CHSET | CORE-<br>CFG0_<br>LRSWAP | CTRL_EN | PDMA<br>_CKO | CTRL_<br>CLKEN | CTRL_<br>RSTB | PCMA_<br>DATA_L            | PCMA_<br>DATA_R           | PCMA<br>_VALID |
|---------------------|---------------------------------|--------------------------|---------|--------------|----------------|---------------|----------------------------|---------------------------|----------------|
| Stereo<br>Non- swap | 2'b11                           | 0                        | 1       | Active       | Active         | 1             | Normal<br>Left             | Normal<br>Right           | Normal         |
| Stereo<br>Swap      | 2'b11                           | 1                        | 1       | Active       | Active         | 1             | Normal<br>Right<br>Swapped | Normal<br>Left<br>Swapped | Normal         |
| Mono Left           | 2'b01                           | 0                        | 1       | Active       | Active         | 1             | Normal<br>Left             | 0                         | Normal         |
| Mono<br>Right       | 2'b10                           | 1                        | 1       | Active       | Active         | 1             | Normal<br>Right<br>Swapped | 0                         | Normal         |
| System<br>Reset     | х                               | х                        | х       | х            | х              | 0             | 0                          | 0                         | 0              |
| Power<br>Down       | х                               | х                        | 0       | 0            | х              | х             | 0                          | 0                         | 0              |

 Table 121: PDM-to-PCM Converter Operating Mode

Notes:

- 2. Reasonable phase delay between PDMA\_CKO and PDM\_CLK is allowable.
- 3. RSTB is asynchronous reset signal.
- 4. At Power-Down mode, internal master clocks (MCLK\_L and MCLK\_R) are set to "0" to avoid toggling.
- 5. PCMA\_VALID normal output pulse width is one PDM\_CLK clock period.

# 20.3 Supported Data Format

The PDM module supports only 24-bit unpacked data format. The following table shows the word sequence for single-channel (A), dual-channel (A and B) and 8-channel (A-H) audio.

<sup>1. &</sup>quot;Active" means clock toggling at normal frequency.

| Word     | Single-         | channel          | Dual-c          | hannel           | 8-channel       |                  |  |  |  |
|----------|-----------------|------------------|-----------------|------------------|-----------------|------------------|--|--|--|
| Sequence | Upper 8<br>bits | Lower 24<br>bits | Upper 8<br>bits | Lower 24<br>bits | Upper 8<br>bits | Lower 24<br>bits |  |  |  |
| 1        | 0x00            | A_t0[23:0]       | 0x00            | A_t0[23:0]       | 0x00            | A_t0[23:0]       |  |  |  |
| 2        | 0x00            | A_t1[23:0]       | 0x00            | B_t0[23:0]       | 0x00            | B_t0[23:0]       |  |  |  |
| 3        | 0x00            | A_t2[23:0]       | 0x00            | A_t1[23:0]       | 0x00            | C_t0[23:0]       |  |  |  |
| 4        | 0x00            | A_t3[23:0]       | 0x00            | B_t1[23:0]       | 0x00            | D_t0[23:0]       |  |  |  |
| 5        | 0x00            | A_t4[23:0]       | 0x00            | A_t2[23:0]       | 0x00            | E_t0[23:0]       |  |  |  |
| 6        | 0x00            | A_t5[23:0]       | 0x00            | B_t2[23:0]       | 0x00            | F_t0[23:0]       |  |  |  |
| 7        | 0x00            | A_t6[23:0]       | 0x00            | A_t3[23:0]       | 0x00            | G_t0[23:0]       |  |  |  |
| 8        | 0x00            | A_t7[23:0]       | 0x00            | B_t3[23:0]       | 0x00            | H_t0[23:0]       |  |  |  |

| Table | 122: | 24-bit | <b>Unpacked Dat</b> | ta |
|-------|------|--------|---------------------|----|
|       |      |        |                     |    |

# 20.4 Digital Volume Control

The PDM supports digital volume-control with a PGA gain range from -12dB to +34.5dB in 1.5dB/step. It is programmed by register PGAL/PGAR for left/right channel respectively. PGA gain may be changed on-the-fly during normal operation. In order to reduce zipper or clip noise during gain transition, a built-in volume smoother is implemented with fine gain steps that enables softly ramp up or ramp down volume levels. The fine gain is set by register bit SELSTEP to 0.13dB or 0.26dB internally.

| Table | 123: | PGA | Gain | Control |
|-------|------|-----|------|---------|
|-------|------|-----|------|---------|

| Register             | Default | Description                                                                                                             |
|----------------------|---------|-------------------------------------------------------------------------------------------------------------------------|
| CORECFG0_            | 01000   | Left Channel PGA Gain: +1.5dB/step, -12dB ~ +34.5dB                                                                     |
| PGAL[4:0]            |         | 00000 = -12dB; 00001 = -10.5dB;                                                                                         |
|                      |         | 01000 = 0dB;                                                                                                            |
|                      |         | 11111= +34.5dB                                                                                                          |
| CORECFG0_            | 01000   | Right Channel PGA Gain: +1.5dB/step, -12dB ~ +34.5dB                                                                    |
| PGAR[4:0]            |         | 00000 = -12dB; 00001 = -10.5dB;                                                                                         |
|                      |         | 01000 = 0dB;                                                                                                            |
|                      |         | 11111= +34.5dB                                                                                                          |
| CORECFG1_<br>SELSTEP | 0       | Fine grain step size for smooth PGA or Softmute attenuation transition:<br>0 = 0.13dB step size<br>1 = 0.26dB step size |

### 20.4.1 Soft Mute

The PDM contains a built-in software controlled mute function that digitally attenuates signals to imperceptible levels or zero. When mute function is enabled by setting SOFTMUTE = 1, the corresponding digital signal output is decreased from current level to mute state through predefined granular gain step per time constant transition. The time constant is set through register SCYCLES[2:0]. The slope of mute transition process is determined by SCYCLES and gain step. During soft-mute, the PDM is still on and clocks are toggling while user may desire to mute the recording tentatively. The soft mute configuration delivers smooth transition and reduces clip/zipper noise during the mute transition.

When the mute function is disabled by setting SOFTMUTE = 0, the mute function is off and the PDM goes back to normal operation when the output signal level returns to normal with the existing PGA gain.

| Register/Field        | Default | Description                                                                                                                                                                                                                                                                          |
|-----------------------|---------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| CORECFG0_SOFTMUTE     | 0       | Sets Soft-Mute function:<br>0 = Disable<br>1 = Enable                                                                                                                                                                                                                                |
| CORECFG0_SCYCLES[2:0] | 001     | Sets number of steps (PDMA_CKO cycles) during PGA gain setting changes or soft mute operation.         000 = 64 steps       100 = 192 steps         001 = 96 steps       101 = 256 steps         010 = 128 steps       110 = 384 steps         011 = 160 steps       111 = 512 steps |
| CORECFG1_SELSTEP      | 1       | Sets fine gain step for smooth PGA or Soft-Mute attenuation transition.<br>0 = 0.13dB<br>1 = 0.26dB                                                                                                                                                                                  |

#### Table 124: SOFTMUTE Register Configuration

## 20.5 Low Pass Filter (LPF)

The PDM's internal low pass filters attenuate the out-of-band noise at predefined band width and corners. Table 125 below describes the parameters of LPF performance.

Table 125: LPF Parameters

| Parameter                  | Symbol | Test<br>Condition | Min  | Тур      | Мах | Unit |
|----------------------------|--------|-------------------|------|----------|-----|------|
| Pass band corner frequency | Fpass  |                   | 0.02 | 0.417*Fs | 20  | kHz  |
| Pass band ripple           |        | Peak-to-<br>Peak  |      | 0.3      | 0.5 | dB   |
| Stop band corner frequency | Fstcut |                   |      | 0.6      |     | Fs   |
| Stop band rejection        | STA    |                   |      | -70      |     | dB   |

# 20.6 High Pass Filter (HPF)

The PDM's high pass filter blocks DC offset and low frequency noise in the signal band. The filter response for the high pass filter is characterized as:

$$H(Z) = (1 - Z^{-1}) / [1 - (1 - 2^{-HPGAIN}) Z^{-1}]$$

In default mode, HPGAIN = 1011, so the high pass filter can be formulated by the polynomial:

 $H(Z) = (1 - Z^{-1}) / [1 - 0.99951Z^{-1}]$ 

The user may tune the HPGAIN value to adjust the high pass filter cutoff corner frequency for better system configuration.

# 21. Low Power Analog Audio Interface

Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

NOTE

The Apollo4 Lite SoC and Apollo4 Blue Lite SoC do not include the Audio ADC (AUDADC) module. This chapter and its content do not apply to these SoCs.

# 21.1 Automatic Sample Accumulation and Scaling

The Audio ADC block offers a facility for the automatic accumulation of samples without requiring core involvement. Thus up to 128 samples per slot can be accumulated without waking the core. This facilitates averaging algorithms to smooth out the data samples. Each slot can request from 1 to 128 samples in power-of-2 increments to be accumulated before producing a result in the FIFO.

NOTE Each slot can independently specify how many samples to accumulate so results can enter the FIFO from different slots at different rates.

All slots write their accumulated results to the FIFO in exactly the same format regardless of how many samples were accumulated to produce the results. The precision mode for each determines the format for the FIFO data. 12-bit, 10-bit and 8-bit precision modes correspond to 12.6, 10.6 and 8.6 formats, respectively.

Each slot contains a 21-bit accumulator as shown in Table 126. When the Audio ADC is triggered for the last sample of an accumulation, the accumulator is cleared and the FIFO will be written with the final average value. When each active slot obtains a sample from the Audio ADC, it is added to the value in its accumulator.

If a slot is set to accumulate 128 samples per result, then the accumulator could reach a maximum value of:

 $128^{*}(2^{14}-1) = 128^{*}16383 = 2097024 = 2^{21} - 128$ , hence the 21 bit accumulator.

|        |        |        |        |        |        |        |        |        |        |        | -      |        |        |        |        |        |        |        |        |        |
|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|
| 2<br>0 | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 0<br>9 | 0<br>8 | 0<br>7 | 0<br>6 | 0<br>5 | 0<br>4 | 0<br>3 | 0<br>2 | 0<br>1 | 0<br>0 |
|        |        |        |        |        |        |        |        |        | Ассі   | umu    | lator  | •      |        |        |        |        |        |        |        |        |

#### Table 126: Per Slot Sample Accumulator

Table 127 shows the maximum possible accumulated values. Note that for the 128 sample accumulation case, the LSB of the accumulator is discarded when the results are written to the FIFO.

| # Samples | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3     | 1<br>2 | 1<br>1 | 1<br>0 | 0<br>9 | 0<br>8 | 0<br>7 | 0<br>6 | 0<br>5 | 0<br>4 | 0<br>3 | 0<br>2 | 0<br>1 | 0<br>0 |
|-----------|--------|--------|--------|------------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|
| 128       |        |        |        |            |        |        |        | 10     | .6     |        |        |        |        |        |        |        | 0      |
| 64        | Х      |        |        |            |        |        |        |        | 10     | ).6    |        |        |        |        |        |        |        |
| 32        | Х      | Х      |        |            |        |        |        |        |        | 10.5   |        |        |        |        |        |        |        |
| 16        | Х      | Х      | Х      |            |        |        |        |        |        | 10     | ).4    |        |        |        |        |        |        |
| 8         | Х      | Х      | Х      | Х          |        |        |        |        |        |        | 10.3   |        |        |        |        |        |        |
| 4         | Х      | Х      | Х      | x x x 10.2 |        |        |        |        |        |        |        |        |        |        |        |        |        |
| 2         | Х      | Х      | Х      | X X X 10.1 |        |        |        |        |        |        |        |        |        |        |        |        |        |
| 1         | Х      | Х      | Х      | Х          | Х      | Х      |        |        |        |        | 1      | 0      |        |        |        |        |        |

Table 127: Accumulator Scaling

# 21.2 Sixteen Entry Result FIFO

All results written to the FIFO have exactly the same format as shown in Table 128. The properly scaled accumulation results are written in the lower half word in the aforementioned format. Since each slot can produce results at a different rate, the slot number generating the result is also written to the FIFO along with the total valid entry count within the FIFO.

#### Table 128: FIFO Register

| 3           | 3 | 2            | 2   | 2 | 2 | 2  | 2  | 2   | 2  | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1   | 0   | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|-------------|---|--------------|-----|---|---|----|----|-----|----|---|---|---|---|---|---|---|---|---|---|---|-----|-----|---|---|---|---|---|---|---|---|---|
| 1           | 0 | 9            | 8   | 7 | 6 | 5  | 4  | 3   | 2  | 1 | 0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0   | 9   | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
| R<br>S<br>V | N | Slot<br>umbe | er. |   |   | FI | FO | Cou | nt |   |   |   |   |   |   |   |   |   |   | F | IFO | DAT | Ā |   |   |   |   |   |   |   |   |

| Table 129: 1 | 2-bit FIFO | Data Format |
|--------------|------------|-------------|
|--------------|------------|-------------|

| # Samples | 1<br>9 | 1<br>8 | 1<br>7       | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|-----------|--------|--------|--------------|--------|--------|--------|--------|--------|--------|--------|---|---|---|---|---|---|---|---|---|---|
| 128       | 0      | 0      | 12.6         |        |        |        |        |        |        |        |   |   |   |   |   |   |   |   |   |   |
| 64        | 0      | 0      | 12.6         |        |        |        |        |        |        |        |   |   |   |   |   |   |   |   |   |   |
| 32        | 0      | 0      | 12.5         |        |        |        |        |        |        |        |   |   |   |   | Х |   |   |   |   |   |
| 16        | 0      | 0      |              |        |        |        |        | 12     | 2.4    |        |   |   |   |   |   |   |   |   | Х | Х |
| 8         | 0      | 0      |              |        |        |        |        | 12     | 2.3    |        |   |   |   |   |   |   |   | Х | Х | Х |
| 4         | 0      | 0      | 12.2         |        |        |        |        |        |        |        | Х | Х | Х | Х |   |   |   |   |   |   |
| 2         | 0      | 0      |              |        |        |        |        | 12     | 2.1    |        |   |   |   |   |   | Х | Х | Х | Х | Х |
| 1         | 0      | 0      | 12 X X X X X |        |        |        |        |        |        | Х      | Х |   |   |   |   |   |   |   |   |   |

| # Samples | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 9    | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|-----------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|------|---|---|---|---|---|---|---|---|---|
| 128       | 0      | 0      | 0      | 0      |        |        |        |        | 10     | ).6    |      |   |   |   |   |   |   |   |   |   |
| 64        | 0      | 0      | 0      | 0      | 10.6   |        |        |        |        |        |      |   |   |   |   |   |   |   |   |   |
| 32        | 0      | 0      | 0      | 0      | 10.5   |        |        |        |        |        | 10.5 |   |   |   |   |   |   |   |   | Х |
| 16        | 0      | 0      | 0      | 0      |        | 10.4   |        |        |        |        |      |   |   |   |   | Х | Х |   |   |   |
| 8         | 0      | 0      | 0      | 0      |        |        |        |        | 10     | ).3    |      |   |   |   |   |   |   | х | Х | Х |
| 4         | 0      | 0      | 0      | 0      |        | 10.2   |        |        |        |        |      |   |   | Х | Х | Х | Х |   |   |   |
| 2         | 0      | 0      | 0      | 0      | 10.1   |        |        |        |        |        | 1    | Х | Х | х | Х | Х |   |   |   |   |
| 1         | 0      | 0      | 0      | 0      | 10 X X |        |        |        |        |        | Х    | х | Х | Х |   |   |   |   |   |   |

#### Table 130: 10-bit FIFO Data Format

 Table 131: 8-bit FIFO Data Format

| # Samples | 1<br>9 | 1<br>8 | 1<br>7 | 1<br>6 | 1<br>5 | 1<br>4 | 1<br>3 | 1<br>2 | 1<br>1 | 1<br>0 | 9  | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|-----------|--------|--------|--------|--------|--------|--------|--------|--------|--------|--------|----|---|---|---|---|---|---|---|---|---|
| 128       | 0      | 0      | 0      | 0      | 0      | 0      |        |        |        | 8      | .6 |   |   |   |   |   |   |   |   |   |
| 64        | 0      | 0      | 0      | 0      | 0      | 0      | 0 8.6  |        |        |        |    |   |   |   |   |   |   |   |   |   |
| 32        | 0      | 0      | 0      | 0      | 0      | 0      | 8.5    |        |        |        |    |   |   |   |   |   | Х |   |   |   |
| 16        | 0      | 0      | 0      | 0      | 0      | 0      | 8.4    |        |        |        |    |   |   |   |   | Х | Х |   |   |   |
| 8         | 0      | 0      | 0      | 0      | 0      | 0      |        |        |        | 8      | .3 |   |   |   |   |   |   | Х | Х | Х |
| 4         | 0      | 0      | 0      | 0      | 0      | 0      | 8.2    |        |        |        |    |   | Х | Х | Х | Х |   |   |   |   |
| 2         | 0      | 0      | 0      | 0      | 0      | 0      | 8.1    |        |        |        |    |   | Х | Х | Х | Х | Х |   |   |   |
| 1         | 0      | 0      | 0      | 0      | 0      | 0      | 8      |        |        |        |    | Х | Х | Х | Х | Х | Х |   |   |   |

Software accesses the contents of the FIFO through the FIFO register. This register will be written by the Audio ADC digital controller simultaneous with the conversion complete interrupt (if enabled) after accumulating the number of samples to average configured for the slot. The FIFO register contains the earliest written data, the number of valid entries within the FIFO and the slot number associated with the FIFO data. Thus the interrupt handler servicing Audio ADC interrupts can easily distribute results to different RTOS tasks by simply looking up the target task using the slot number from the FIFO register.

Three other features greatly simplify the task faced by firmware developers of interrupt service routines for the Audio ADC block:

- 1. The FIFO count bit field is not really stored in the FIFO. Instead it is a live count of the number of valid entries currently residing in the FIFO. If the interrupt service routine was entered because of a conversion then this value will be at least one. When the interrupts routine is entered it can pull successive sample values from the FIFO until this bit field goes to zero. Thus avoiding wasteful re-entry of the interrupt service routine. Note that no further I/O bus read is required to determine the FIFO depth.
- 2. This FIFO has no read side effects. This is important to firmware for a number of reasons. One important result is that the FIFO register can be freely read repetitively by a debugger without affecting the state of the FIFO. In order to pop this FIFO and look at the next result, if any, one simply writes any value to this register. Any time the FIFO is read, then the compiler has gone to the trouble of generat-

ing an address for the read. To pop the FIFO, one simply writes to that same address with any value. This give firmware a positive handshake mechanism to control exactly when the FIFO pops.

3. When a conversion completes resulting in hardware populating the 12th valid FIFO entry, the FIFOOVR1 (FIFO 75% full) interrupt status bit will be set. When a conversion completes resulting in hardware populating the 16th valid FIFO entry, the FIFOOVR2 interrupt status bit will be set. In a FIFO full condition with 16 valid entries, the Audio ADC will not overwrite existing valid FIFO contents. Before subsequent conversions will populate the FIFO with conversion data, software must free an open FIFO entry by writing to the FIFO Register or by resetting the Audio ADC by disabling and enabling the Audio ADC using the CFG register.

## 21.3 DMA

When enabled, the Audio ADC can use DMA to keep its FIFO serviced and transfers samples to SRAM. Generally, DMA should be used when the desired use case is autonomous recording of samples to a preallocated buffer in SRAM. The buffer may be byte-aligned but must be a word-multiple in size.

The general steps to enabling Audio ADC DMA are as follows:

- 1. Ensure SRAM target(s) are powered up.
- 2. Power up the Audio ADC if it's not already on.
- 3. Configure ADC slots and CFG register.
- 4. Set DMATOTCOUNT to the total amount of data to transfer. While the DMA is in progress, this register contains a live count of the remaining data to transfer.
- 5. Configure DMATARGADDR, the SRAM target byte address, for the location in memory of the first sample to be written by DMA.
- 6. Select a DMA trigger level by configuring DMATRIGEN to either FIFO 100% full or FIFO 75% full. This defines what conditions will initiate a DMA transfer.
- 7. Configure DMACFG, including setting DMAEN.
- 8. Trigger the Audio ADC multiple times, using either the timer trigger (when using repeat mode), multiple SW triggers, or multiple external triggers.

Each time the FIFO fills to the appropriate level, the DMA will start and the FIFO will be drained. During this time, depending on the particular use case, it may be appropriate to put the SoC to sleep or deepsleep.

To monitor progress of the DMA, there is a DMASTAT status register. When the DMA is actively transferring data from the Audio ADC FIFO to SRAM, DMATIP will be asserted. At the end of an entire transfer (DMATOTCOUNT reaches 0), then DMACPL will be set. Last, but not least, if an error occurs due to the DMA being asked to perform an illegal operation, DMAERR will be asserted. Causes of a DMA error include:

- DMA transfers to address outside SRAM memory region
- Popping from the FIFO while the DMA is underway

Care must be taken to avoid powering down SRAM that the DMA wants to write to.

If the DMA complete interrupt is enabled, this can be used to wake the SoC from sleep or deepsleep and communicate that the SRAM buffer has been filled and is ready for processing. The DMA error interrupt may also be used to signal the SoC that there is a problem with the DMA configuration.

To recover from a DMA error, disable any repeating trigger, disable the DMA via DMACFG's DMAEN field, and manually drain the Audio ADC FIFO. Then follow the procedure described above for enabling Audio ADC DMA while correcting the configuration issue.

Some additional capabilities of the DMA include:

- Audio ADC auto-power-off upon DMA completion: This feature, enabled via the DMACFG register's DPWROFF field, allows the Audio ADC to power off once DMATOTCOUNT reaches zero. Note that this feature is incompatible with waking the SoC from sleep or deepsleep using the DMA complete interrupt.
- Masking FIFOCNT and SLOTNUM data from FIFO data: The DMA engine can be configured to write only samples to SRAM without the FIFOCNT and SLOTNUM data. This allows the SoC to skip the manual process of masking the potentially undesirable upper bits of each data value written to SRAM.

## 21.4 Window Comparator

A window comparator is provided which can generate an interrupt whenever a sample is determined to be inside the window limits or outside the window limits. These are two separate interrupts with separate interrupt enables. Thus one can request an interrupt any time a specified slot makes an excursion outside the window comparator limits.

The window comparison function has an option for comparing the contents of the limits registers directly with the FIFO data (default) or for scaling the limits register depending on the precision mode selected for the slots.

Firmware has to participate in the determination of whether an actual excursion occurred. The window comparator interrupts set their corresponding interrupt status bits continuously whenever the inside or outside condition is true. Thus if one enables and receives an "*excursion*" interrupt then the status bit can't be usefully cleared while the Audio ADC slot is sampling values outside the limits. That is, if one receives an excursion interrupt and clears the status bit, it will immediately set again if the next Audio ADC sample is still outside the limits. Thus firmware should reconfigure the interrupt enables upon receiving an excursion interrupt so that the next interrupt will occur when an Audio ADC sample ultimately goes back inside the window limits. Firmware may also want to change the windows comparator limit at that time to utilize a little hysteresis in these window comparator decisions.

| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1  | 1    | 0     | 0   | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|---|---|---|----|------|-------|-----|---|---|---|---|---|---|---|---|
| 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1  | 0    | 9     | 8   | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|   |   |   |   |   |   |   |   | Lo | ower | · Lin | nit |   |   |   |   |   |   |   |   |

Table 132: Window Comparator Lower Limit Register

| Table | 133: | Window | Com   | parator | Upper | Limit | Reaister |
|-------|------|--------|-------|---------|-------|-------|----------|
|       |      |        | ••••• | paiatoi |       |       |          |

| 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1   | 0     | 0   | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|---|---|---|---|---|---|---|---|---|-----|-------|-----|---|---|---|---|---|---|---|---|
| 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0   | 9     | 8   | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
|   |   |   |   |   |   |   |   | U | ppe | · Lin | nit |   |   |   |   |   |   |   |   |

The determination of whether a sample is *inside* or *outside* of the window limits is made by comparing the data format of the slot result written to the FIFO with the 20 bit window limits. An Audio ADC sample is inside if the following relation is true:

14.6 Lower Limit <= ADC SAMPLE <= 14.6 Upper Limit

Thus setting both limits to the same value, say 700.0 (0x2BC << 6 = 0xAF00), will only produce an inside interrupt when the Audio ADC sample is exactly 700.0 (0xAF00). Furthermore, note that if the lower limit is set to zero (0x00000) and the upper limit is set to 0xFFFFF then all accumulated results from the Audio ADC will be inside the window limits and no excursion interrupts can ever by generated. In fact, in this

case, the incursion interrupt status bit will be set for every sample from any active slot with its window comparator bit enabled. If the incursion interrupt is enabled then an interrupt will be generated for every such sample written to the FIFO.

The window comparator limits are a shared resource and apply to all active slots which have their window comparator bits enabled. If window limits are enabled for multiple enabled slots with different precision modes, the window comparison function can be configured to automatically scale the 14.6 upper and lower limits value to match the corresponding precision mode format for the enabled slots through the SCWLIM register.

# 21.5 Operating Modes and the Mode Controller

The mode controller is a sophisticated state machine that manages not only the time slot conversions but also the power state of the Audio ADC analog components and the hand shake with the clock generator to start the HFRC clock source if required. Thus once the various control registers are initialized, the core can go to sleep and only wake up when there are valid samples in the FIFO for the interrupt service routine to distribute. Firmware does not have to keep track of which block is using the HFRC clock source since the devices in conjunction with the clock generator manage this automatically. The Audio ADC block's mode controller participates in this clock management protocol.

From a firmware perspective, the Audio ADC mode controller is controlled from bit fields in the Audio ADC configuration register and from the various bit fields in the eight slot configuration registers.

The most over-riding control is the PWRENAUDADC enable bit in the PWRCTRL\_AUDSSPWREN register of the power control block. This bit must be set to '1' to enable power to the Audio ADC subsystem. Furthermore, the ADCEN bit in the Audio ADC configuration register is a global functional enable bit for general Audio ADC operation. Setting this bit to zero has many of the effects of a software reset, such as reseting the FIFO pointers. Setting this bit to one enables the mode controller to examine its inputs and proceed to autonomously handle analog to digital conversions.

An Audio ADC scan is the process of sampling the analog voltages at each input of the Audio ADC following a trigger event. If the Audio ADC is enabled and one or more slots are enabled, a scan is initiated after the Audio ADC receives a trigger through one of the configured trigger sources. The scan flowchart diagram can be found in Figure 115

An Audio ADC conversion is the process of averaging measurements following one or more scans for each slot that is enabled.



Figure 115. Scan Flowchart

## 21.5.1 Single Mode

In single mode, one trigger event produces one scan of all enabled slots. Depending on the settings of the accumulate and scale bit field for the active slots, this may or may not result in writing a result to the FIFO. When the trigger source is an external pin then one external pin transition of the proper polarity will result in one complete scan of all enabled slots. If the external pin is connected to a repetitive pulse source then repeating scans of all enabled slots are run at the input trigger rate.

## 21.5.2 Repeat Mode

Counter/Timer 7 has a bit in its configuration register that allows it to be a source of repetitive triggers for the Audio ADC. If counter/timer 7 is initialized for this purpose then one only needs to turn on the RPTEN bit in the Audio ADC configuration register to enable this mode in the Audio ADC.

#### NOTE

The mode controller does **not** process these repetitive triggers from the counter/ timer until a first triggering event occurs from the normal trigger sources. Thus one can select software triggering in the TRIGSEL field and set up all of the other Audio ADC registers for the desired sample acquisitions. Then one can write to the software trigger register and the mode controller will enter REPEAT mode. In repeat mode, the mode controller waits only for each successive counter/timer 3A input to launch a scan of all enabled slots.

#### 21.5.3 Low Power Modes

An application may use the Audio ADC in one of three power modes. Each mode has different implications from overall energy perspective relative to the startup latency from trigger-to-data as well as the standby power consumed. The table below is intended to provide guidance on which mode may be more effective based on latency tolerance. This table should only be used as a reference.

| LPMODE | Definition                                                                                                                                     | Entry Latency                              |
|--------|------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------|
| 0      | Audio ADC is kept active continuously (used in continuous sampling scenarios)                                                                  | 0<br>(requires initial<br>calibration)     |
| 1      | Audio ADC is mostly powered off between samples, HFRC is<br>duty cycled between samples. No calibration required after<br>initial calibration) | <70μs<br>(shorter for lower<br>resolution) |
| 2      | Audio ADC is completely powered off between samples,<br>HFRC is duty cycled between samples. Requires recalibra-<br>tion for each conversion.  | <660µs                                     |

#### Table 134: Audio ADC Power Modes

### 21.5.3.1 Low Power Mode 0

Low Power Mode 0 (LPMODE0) enables the lowest latency from trigger to conversion data available. This mode leaves the reference buffer powered on between scans to bypass any startup latency between triggers<sup>1</sup>.

### 21.5.3.2 Low Power Mode 1

Low power mode 1 (LPMODE1) is a power mode whereby the Audio ADC Digital Controller will automatically power off the Audio ADC clocks, analog Audio ADC and reference buffer between scans while maintaining Audio ADC calibration data. This mode may operate autonomously without CPU interaction, even while the CPU is in sleep or deepsleep mode for repeat mode triggers or hardware triggers. While operating in this mode, the Audio ADC Digital Controller may be used to burst through multiple scans enabling max sample rate data collection if the triggers are running at a rate at least 2x the maximum sample rate until the final scan has completed. When a scan completes without a pending trigger latched, the Audio ADC subsystem will enter a low power state until the next trigger event.

<sup>1.</sup> The reference buffer will not be powered on when the Audio ADC is configured for external reference

## 21.5.3.3 Low Power Mode 2

If desirable, for applications requiring infrequent conversions, software may choose to operate the Audio ADC in LPMODE2, whereby the full Audio ADC Analog and Digital subsystem remains completely powered off between samples. In this use case, the software configures the power control Audio ADC enable register followed by configuring the Audio ADC slots and the Audio ADC configuration register between conversion data collections, followed by disabling the Audio ADC in the power control Audio ADC enable register. Although this mode provides extremely low power operation, using the Audio ADC in this mode will result in a cold start latency including reference buffer stabilization delay and a calibration sequence 100's of microseconds, nominally. IMPORTANT: the Audio ADC Digital Control logic is powered off when the Power Control Audio ADC enable bit (PWRCTRL\_AUDSSPWREN\_PWRENAUDADC) is set to zero. This clears any previously configured registers in the Audio ADC Digital Module necessitating reconfiguration for any subsequent Audio ADC operation. In this mode, the Audio ADC must be reconfigured prior to any subsequent Audio ADC operation.

## 21.6 Interrupts

The Audio ADC has 8 interrupt status bits with corresponding interrupt enable bits, as follows:

- 1. Conversion Complete Interrupt
- 2. Scan Complete Interrupt
- 3. FIFO Overflow Level 1
- 4. FIFO Overflow Level 2
- 5. Window Comparator Excursion Interrupt (a.k.a. outside interrupt)
- 6. Window Comparator Incursion Interrupt (a.k.a. inside interrupt)
- 7. DMA Complete (DCMP)
- 8. DMA Error (DERR)
- 9. DMA transfer complete
- 10. DMA error condition

The window comparator interrupts are discussed above. See "Window Comparator" on page 369.

There are two interrupts based on the *fullness* of the FIFO. When the respective interrupts are enabled, Overflow 1 fires when the FIFO reaches 75% full, viz. 6 entries. Overflow 2 fires when the FIFO is completely full.

When enabled, the conversion complete interrupt fires when a single slot completes its conversion and the resulting conversion data is pushed into the FIFO.

When enabled, the scan complete interrupt indicates that all enabled slots have sampled their respective channels following a trigger event.

When a single slot is enabled and programmed to average over exactly one measurement and the scan complete and conversion complete interrupts are enabled, a trigger event will result in the conversion complete and scan complete interrupts firing simultaneously upon completion of the Audio ADC scan. Again, if both respective interrupts are enabled and a single slot is enabled and programmed to average over 128 measurements, 128 trigger events result in 128 scan complete interrupts and exactly one conversion complete interrupt following the 128 Audio ADC scans. When multiple slots are enabled with different settings for the number of measurements to average, the conversion complete interrupt signifies that one or more of the conversions have completed and the FIFO contains valid data for one or more of the slot conversions.

The DMA transfer complete interrupt is triggered upon completion of the currently configured DMA.

The DMA error interrupt is triggered if the DMA has been instructed to perform an illegal operation such as:

Writing outside SRAM

- Writing to powered-down SRAM
- Popping from the FIFO while DMA is underway

## 21.7 Generating the Sample Rate for the Audio ADC

TIMER6 of the Timer Module has a special function which allows it to function as the sample trigger generator for the Audio ADC. If the TIMER\_GLOBEN\_AUDADCEN bit is set, the output of the timer is sent to the Audio ADC which uses it as a trigger. Typically, TIMER6 is configured in Repeatable Up-counter Compare (UPCOUNT - FN = 2) mode. INTEN\_TMR60INT may be set to generate an interrupt whenever the trigger occurs, but typically the Audio ADC interrupt will be used for this purpose.

# 22. Inter-IC Sound (I<sup>2</sup>S)



Figure 116. I<sup>2</sup>S Block Diagram



Please refer to this module's registers in the applicable SoC's register set, which is included in the AmbiqSuite SDK.

# 22.1 I2S Clock Management

Clock Management consists of master clock source selection and the two-stage divider configuration.



The Receive FIFO and Transmit FIFO decouple the I<sup>2</sup>S clock domain and the back-end bus interface clock domain. All FIFO depths are configurable, and their status and current number of samples are accessible by means of memory mapped registers and ports. The back-end interface supports blocking transactions.

The user-accessible register set contains configuration and information registers such as the number of words currently in the FIFO, its maximum capacity, the programmable RXUPPERLIMIT register, etc.

## 22.2 DMA

DMA is also provided to allow the automatic transfer of a block of data to and from the module. The interface multiplexes the full duplex  $I^2S$  data between the system memory and the external pins using a single DMA interface internally. Data is provided to the DMA logic by either the  $I^2S$  module or the system fabric, and sent to either the  $I^2S$  module or system fabric.

Enabling the DMA operation requires setup in both sub modules. The  $I^2S$  module must be programmed with the  $I^2S$  setup, and to activate the FIFO watermark signals to the interface logic. The interface logic must be updated with the DMA addressing, priority and size parameters, and then enabled for the desired directions. Interrupts should also be enabled to alert the processor on various conditions.

Once programmed and enabled, the DMA data transfer is performed using a series of transfers in a fixed block size of 8 32-bit samples. The internal DMA/APB bus is used for this transfer and, if both TX and RX DMA are enabled, will arbitrate between these as needed to store or fetch the appropriate data.

Arbitration is done in a round robin fashion between the TX and RX when both are active simultaneously. The arbitration occurs before each block of DMA (8 samples) is transferred, and will grant either the alternate direction between transfers if there are active requests for the transfer. There is no mandated order for which DMA direction is done other than the alternate direction is selected if the request is active and was not selected for the previous block of data transfer.

DMA request are created within the I<sup>2</sup>S module based on FIFO space availability (TX) or filled level (RX). These two FIFOs in the I<sup>2</sup>S module are separate and independent. Two registers, TXLOWERLIMIT register and the RXUPPERLIMIT register, are used to program the interrupt condition. During TX operation, if the TX FIFO falls below the number of samples programmed into TXLOWERLIMIT, it will assert a request to the DMA logic to fetch data and store this into the TX FIFO for transmission out of the chip. Similarly, when there is the same or more samples in the RX FIFO as programmed into RXUPPERLIMIT, an RX transfer request will be asserted to transfer data from the RX FIFO to the internal system memory.

Due to the fixed size block transfer of 8 samples (8 32-bit words), TXLOWERLIMIT must not be programmed to more than TX FIFO SIZE - 8, as this could cause overruns. RXUPPERLIMIT must not be programmed to a value less the 8, as this could cause underruns. The FIFO size for both the TX and RX FIFOs is 64 samples.

The address and size of the DMA must be programmed into the interface registers prior to enabling the DMA operation. DMA operations for TX and RX are independent and can operate alone or simultaneously with the other I<sup>2</sup>S DMA operation. Once the DMA is enabled, the system will continue to transfer data in blocks of 8 samples until the TOTCNT number of samples (TXTOTCNT in the TXDMATOTCNT register or RXTOTCNT in the RXDMATOTCNT register) is transferred. Note that the TOTCNT value need not be a multiple of 8. Transfers will be done in units of 8, and the last transfer of the DMA will be 8 or fewer samples, as needed to complete the TOTCNT to zero.

The DMA ADDRESS registers and the TXDMATOTCNT/RXDMATOTCNT registers are updated with each block of data transferred and can be read at any time by the processor without affecting ongoing DMA operations. However, the registers cannot be written unless the DMA is disabled by writing the TXDMAEN

or RXDMAEN bits in the DMACFG register. Once the DMA is disabled, it will stop making requests and transferring data. The maximum size of a single DMA is 4K samples (32-bit words), which would require 16 kB of storage space.

For each direction (TX or RX), interrupts can be used to alert the processor(s) to the completion of a DMA operation, or after a programmable number of blocks have been transferred.

Once completed, software must first write the DMACFG register to 0 prior to making any updates to either DMAADDR register or either DMATOTCNT register.

## 22.3 Interrupts

The interface control logic provides interrupts for DMA operations, and forwards the interrupt from the I<sup>2</sup>S module through the single interrupt signal from the module instance. There are 5 main interrupts available, of 3 different types.

The RX/TX DMACPL interrupt asserts when the programmed DMA completes, or end with an error condition. An error condition is indicated with a 1 in the DMA status registers (TXDMASTAT and RXDMASTAT). If an error condition did occur during a DMA operation, the DMA must first be disabled, then the DMAERR bit cleared by writing a '0' to it. Once this is done, the DMA operation can be reprogrammed and restarted again.

The TX/RX REQCNT interrupt will assert every time the DMA engine transfers a certain number of data blocks (8 samples) of the DMA. The count is programmed into the DMACFG register prior to starting the DMA. If the count register is written with 0 or 1, it will assert for each transfer of 8 samples for the DMA. If programmed to '3' for example, it would assert the interrupt on the completion of every 3rd transfer block.

The last interrupt, IPB, is a level interrupt from the I<sup>2</sup>S module and is controlled with register settings within this module. This is a level based interrupt, and is used within the edge trigger interrupt system, so care must be taken to ensure that the level is de-asserted prior to re-enabling this interrupt.

## 22.4 Data Configurations

Table 135 lists the configurations supported by the I<sup>2</sup>S module.

| Feature                         | Non-ASRC mode                                                               |
|---------------------------------|-----------------------------------------------------------------------------|
| Audio sample sizes (bits)       | 8, 16, 24, 32                                                               |
| Maximum number of chan-<br>nels | 8                                                                           |
| Master/Slave modes              | master or slave                                                             |
| I <sup>2</sup> S/TDM modes      | Any supported by the I <sup>2</sup> S, with the limitations indicated above |

### Table 135: Configurations Supported by the I<sup>2</sup>S Module

Figure 117 and Figure 118 present the packing of the audio samples in the DMA, for all possible audio sample sizes. Those figures illustrate the waveforms for the  $I^2S$  mode (slots of 32 bits, left justification and one delay bit). However, the module can support all  $I^2S$  and TDM modes described in this datasheet, with the limitations indicated in Table 135.



Figure 117. DMA Configurations for 16 Bits and 24 Bits



Figure 118. DMA Configurations for 8 Bits and 32 Bits

# 22.5 Configuration and Control

The I<sup>2</sup>S instance has several configuration and control options in the register set for software driver development and use. Three aspects of configuration and control are important: master/slave configurations, I<sup>2</sup>S initialization, and data flow control using receive and transmit FIFOs.

## 22.5.1 Master/Slave Configurations

The I<sup>2</sup>S instance can be configured to operate in slave or master mode. At power-up, they are configured to act as slaves to avoid signal collisions with other modules on the chip. The operation mode can be changed by writing the appropriate value to the MSL (Master Slave Configuration) bit in the I2S IO Configuration (I2SIOCFG) register.

### 22.5.2 Module Initialization

The following steps outline the procedure for initializing the  $I^2S$  interface. Unless otherwise noted, referenced registers are in the  $I^2S$  register documentation.

- 1. Reset the serial receiver or transmitter by asserting bits RXRST and/or TXRST in the I2SCTL register.
- 2. Specify the polarity of the I<sup>2</sup>S IO signals by writing the desired configuration to the I<sup>2</sup>S IO Configuration (I2SIOCFG) register.
- 3. Specify the serial data format in the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) register.
- 4. To generate interrupts triggered by the number of samples in the FIFOs, do as follows:
  - A. Read the Receive FIFO Size register (RXFIFOSIZE) or the Transmit FIFO Size register (TXFI-FOSIZE) to know its size, if not known already.
  - B. Write the desired values to the Receive FIFO Upper Limit register (RXUPPERLIMIT) and/or the Transmit FIFO Lower Limit register (TXLOWERLIMIT).
  - C. Unmask the RXFFI or TXFFI interrupt bits by asserting the RXFFM or TXFFM mask bits in the IPB Interrupt Mask register (IPBIRPT).
- 5. To generate interrupts when the next channel to be received becomes uncertain, due to a Transmit FIFO underrun or Receive FIFO overrun condition, unmask the TXEI and RXFI interrupt bits by asserting the TXEM or RXFM mask bits in the IPB Interrupt Mask register (IPBIRPT).
- 6. Write samples to the Transmit FIFO so that the transmit FIFO has samples above the FIFO Lower Limit value.
- 7. To start receiving and/or transmitting audio streams, enable the receiver and/or transmitter by asserting the RXEN and/or TXEN bits in the I<sup>2</sup>S Control register (I2SCTL). Note that for full-duplex operation, the TXEN and RXEN bits must be asserted in a single register access.

## 22.5.3 Data Flow Control

### 22.5.3.1 Receiver Data Flow Control

- 1. Poll the number of samples field or the Empty bit in the Receive FIFO Status register (RXFIFOSTA-TUS), and take action to prevent the FIFO from becoming full.
- 2. Alternatively, set the RXFFM bit in the IPB Interrupt Mask register (IPBIRPT) to enable the RXFFI interrupt bit when the number of samples in the Receive FIFO exceeds the value in the Receive FIFO Upper Limit register (RXUPPERLIMIT). The RXFFI interrupt bit will clear after the number of samples in the Receive FIFO is below the value in RXUPPERLIMIT.
- 3. If the Receive FIFO does become full, the newly received samples are dropped, and an overrun condition is triggered by means of the RXFI interrupt bit, if the RXFM mask bit is enabled. The overrun condition means that the order of the received channels (channel synchronization) has become uncertain.

- 4. To regain synchronicity, disable the receiver by de-asserting the RXEN bit, reset the receiver by asserting the RXRST bit, and re-enable it by asserting the RXEN bit; these bits are all in the I<sup>2</sup>S Control register (I2SCTL). After this operation, the module will resume receiving samples starting from the first channel of a new frame.
- 5. During normal operation (before an overrun condition has occurred) use the Receive Channel ID register (RXCHANID) to know the last channel received. After an overrun condition has occurred, the contents of RXCHANID become meaningless.

## 22.5.3.2 Transmitter Data Flow Control

- 1. Poll the number of samples field or the Full bit in the Transmit FIFO Status register (TXFIFOSTATUS), and take action to prevent the Transmit FIFO from becoming empty.
- Alternatively, set the TXFFM bit in the IPB Interrupt Mask register (IPBIRPT) to enable the TXFFI interrupt bit when the number of samples in the Transmit FIFO drops below the value in the Transmit FIFO Lower Limit register (TXLOWERLIMIT). The TXFFI interrupt bit will clear after the number of samples in the Transmit FIFO is above the value in TXLOWERLIMIT.
- 3. If the Transmit FIFO does become empty, a zero is transmitted, and an underrun condition is triggered by means of the TXEI interrupt bit, if the TXEM mask bit is enabled. The underrun condition means that the order of the transmitted channels (channel synchronization) has become uncertain.
- 4. To regain synchronicity, disable the transmitter by de-asserting the TXEN bit, reset the transmitter by asserting the TXRST bit, clear the TXRST bit to take the transmitter out of the reset state, write samples to the Transmit FIFO until it contains more samples than the value in the Transmit FIFO Lower Limit register (TXLOWERLIMIT), and finally re-enable the transmitter by asserting the TXEN bit; the TXEN and TXRST bits are all in the I<sup>2</sup>S Control register (I2SCTL). After this operation, the module will resume receiving samples starting from the first channel of a new frame.
- 5. During normal operation (before an overrun condition has occurred) use the Transmit Channel Identification register to know the next channel to be transmitted. After an underrun condition has occurred, the contents of the Transmit Channel Identification register become meaningless.

## 22.6 Serial Audio Interface

#### 22.6.1 TDM Serial Audio Format

Figure 119 illustrates the relationship between the behavior of the frame sync, fsync, signal and the configuration parameters FPER (Frame Period) and FWID (fsync width) in the I<sup>2</sup>S IO Configuration (I2SIOCFG) register. The fsync signal comes out of the Apollo4 on one of the pins offering I2Sx\_WS functionality. Refer to the Pin Mapping Table in the GPIO chapter.

The FPER parameter has a maximum value of 4095 and measures the distance in sclk cycles between two consecutive fsync assertions. The FWID parameter has a maximum value of 255 and indicates the number of sclk cycles during which fsync will stay asserted. When the module is configured as a slave, these registers are not used.



Figure 119. Programmable Frame Period and Width

A timing diagram of a single channel 8-bit sample audio serial interface signal can be observed in Figure 120. A frame starts in the first clock cycle in which the frame synchronization, fsync, is active. The data contents can be delayed with respect to the frame start using the DATADLY parameter in the Control register, which can be configured for 0-bit, 1-bit or 2-bit delay as shown in the figure.



Figure 120. Data Delay

For example, for a 1-bit delay, the module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 0, specifying a single-phase frame
- FRLEN1 = 0, specifying one channel per frame
- WDLEN1 = 1h, specifying 12 bits per channel
- DATADLY = 1, specifying a 1-bit data delay
- SSZ1 = 0, specifying an 8 bit audio sample for each channel

- FPER = 0Bh, specifying a frame period of 12 cycles
- FWID = 0h, specifying a 1-cycle frame sync pulse
- PRx = 0, specifying sclk rising edge sampling
- JUST = 0, specifying that the receive data is left-justified
- FSP = 1, specifying that an active high fsync signal is used

## NOTE In the examples in this section, 12 bits per channel sample size is used as the WDLEN1 parameter, even though only 8, 16, 24 and 32 bits per channel sample size is supported in the $I^2S$ modules on this device.



## Figure 121. Dual-phase Frame Example

Figure 121 shows a TDM example with two phases. The module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 1, specifying a two-phase frame
- FRLEN1 = 1, specifying two channels in phase 1
- FRLEN2 = 3, specifying four channels in phase 2
- WDLEN1 = 1h, specifying 12 bits per channel in phase 1
- WDLEN2 = 0h, specifying 8 bits per channel in phase 2
- DATADLY = 1, specifying a 1-bit data delay
- FPER = 38h, specifying a frame period of 56 cycles
- FWID = 0h, specifying a 1-cycle frame sync pulse
- PRx = 0, specifying sclk rising edge sampling
- JUST = 0, specifying that the receive data is left-justified
- FSP = 1, specifying that an active high fsync signal is used



### Figure 122. Right Justification

Figure 122 shows an audio frame using right justification. The module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 0, specifying a single-phase frame
- FRLEN1 = 0, specifying one channel per frame
- WDLEN1 = 1h, specifying 12 bits per channel
- DATADLY = 0, specifying a 0-bit data delay
- SSZ1 = 0, specifying an 8-bit audio sample for each channel
- FPER = Bh, specifying a frame period of 12 cycles
- FWID = 0h, specifying a 1-cycle frame sync pulse
- PRx = 0, specifying sclk rising edge sampling
- JUST = 1, specifying that the receive data is left-justified
- FSP = 1, specifying that an active high fsync signal is used

## 22.6.2 I<sup>2</sup>S Serial Audio Format

Figure 123 shows an I<sup>2</sup>S formatted audio frame.



### Figure 123. I<sup>2</sup>S Formatted Audio Frame

The module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 0, specifying a single-phase frame
- FRLEN1 = 1, specifying two channels per frame
- WDLEN1 = 5, specifying 32 bits per channel
- DATADLY = 1, specifying a 1-bit data delay
- SSZ1 = 0, specifying an 8 bit audio sample for each channel
- FPER = 3Fh, specifying a frame period of 64 cycles
- FWID = 1Fh, specifying a frame sync pulse of 32 bits
- PRx = 0, specifying sclk rising edge sampling
- JUST = 0, specifying that the receive data is left-justified
- FSP = 0, specifying that active low fsync signals are used

### 22.6.3 Left-justified Serial Audio Format

Figure 124 shows a left-justified formatted audio frame.



#### Figure 124. Left-justified Audio Frame

The module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 0, specifying a single-phase frame
- FRLEN1 = 1, specifying two channels per frame
- WDLEN1 = 5, specifying 32 bits per channel
- DATADLY = 0, specifying a 0-bit data delay
- SSZ1 = 0, specifying an 8 bit audio sample for each channel
- FPER = 3Fh, specifying a frame period of 64 cycles
- FWID = 1Fh, specifying a frame sync pulse of 32 bits
- PRx = 0, specifying sclk falling edge sampling
- JUST = 0, specifying that the receive data is left-justified
- FSP = 1, specifying that active high fsync signals are used

#### 22.6.4 Right-justified Serial Audio Format

Figure 125 shows a right-justified formatted audio frame.



#### Figure 125. Right-justified Audio Frame

The module must be configured as follows [see the I<sup>2</sup>S Data Format Configuration (I2SDATACFG) and I<sup>2</sup>S IO Configuration (I2SIOCFG) registers]:

- PH = 0, specifying a single-phase frame
- FRLEN1 = 1, specifying two channels per frame
- WDLEN1 = 5, specifying 32 bits per channel
- DATADLY = 0, specifying a 0-bit data delay
- SSZ1 = 0, specifying an 8 bit audio sample for each channel
- FPER = 3Fh, specifying a frame period of 64 cycles
- FWID = 1Fh, specifying a frame sync pulse of 32 bits
- PRx = 0, specifying sclk falling edge sampling
- JUST = 1, specifying that the receive data is right-justified
- FSP = 1, specifying that active high fsync signals are used

# 23. Ordering Information

| Device Name      | Orderable<br>Part Number | MRAM | RAM    | Package                         | Packing       | Temperature<br>Range |
|------------------|--------------------------|------|--------|---------------------------------|---------------|----------------------|
| Apollo4 SoC      | AMAP42KK-KBR-<br>B2      | 2 MB | 1.8 MB | 5.0 x 5.0<br>146-pin BGA        | Tape and Reel | -20 to 60°C          |
| Apollo4 Blue SoC | AMA4B2KK-KBR-<br>B2      | 2 MB | 1.8 MB | 4.7 x 4.7<br>131-pin SiP<br>BGA | Tape and Reel | -20 to 60°C          |

## Table 136: Apollo4 SoC Ordering Information

## Table 137: Apollo4 Plus SoC Ordering Information

| Device Name              | Orderable<br>Part Number | MRAM | RAM     | Package<br>(mm)          | Packing       | Temperature<br>Range |
|--------------------------|--------------------------|------|---------|--------------------------|---------------|----------------------|
| Apollo4 Plus SoC         | AMAP42KP-KBR             | 2 MB | 2.75 MB | 5.0 x 5.0<br>146-pin BGA | Tape and Reel | -20 to 60°C          |
| Apollo4 Blue Plus<br>SoC | AMA4B2KP-KBR             | 2 MB | 2.75 MB | 4.7 x 4.7<br>131-pin BGA | Tape and Reel | –20 to 60°C          |
| Apollo4 Blue Plus<br>SoC | AMA4B2KP-KXR             | 2 MB | 2.75 MB | 4.7 x 4.7<br>131-pin BGA | Tape and Reel | –20 to 60°C          |

#### Table 138: Apollo4 Lite SoC Ordering Information

| Device Name              | Orderable<br>Part Number | MRAM | RAM      | Package<br>(mm)          | Packing       | Temperature<br>Range |
|--------------------------|--------------------------|------|----------|--------------------------|---------------|----------------------|
| Apollo4 Lite SoC         | AMAP42KL-KBR             | 2 MB | 1.375 MB | 5.0 x 5.0<br>146-pin BGA | Tape and Reel | -20 to 60°C          |
| Apollo4 Blue Lite<br>SoC | AMA4B2KL-KBR             | 2 MB | 1.375 MB | 4.7 x 4.7<br>131-pin BGA | Tape and Reel | –20 to 60°C          |



©2023 Ambiq Micro, Inc. All rights reserved. Ambiq Micro, Inc. 6500 River Place Boulevard, Building 7, Suite 200, Austin, TX 78730-1156

www.ambiq.com/

sales@ambiqmicro.com

https://support.ambiqmicro.com

+1 (512) 879-2850

PG-A4-8p0 Version 8.0 July 2023