IC Design : ARM protocols

A SoC design would do one of the following:

Criteria for making system level choices

• Choosing the right bus for your system

• Peripherals selection

• Performance requirements based on your application

ARM

First you have to learn the three major ARM microprocessor families:

Cortex M, Cortex A and Cortex R.

Cortex M: For micro controller domain and low power consumption and real time application.

Cortex A: For application field and generally used to run an Operating system (specific kernel Linux distribution )

Cortex R: For automotive area.

Know the type of instruction set which is RISC because the main goal is reducing the power consumption of the SOC that embed ARM Microprocessor.

APB is mainly proposed for connecting to simple peripherals. It can be seen that the APB comes with a low power peripheral. It groups narrow bus peripherals to avoid loading the system bus.APB does not support burst mode of transfer but AHB does.

APB is simple protocol which is useful for register reads and writes mostly with no fancy signals one operation after another. AHB on the other hand supports INCR, WRAP and single mode with varied lengths thus giving better performance.

The APB has been designed to implement as simple an interface as possible. It is a simple design which makes it much easier to connect new APB peripherals and makes the analysis of the system performance easier to calculate.

As the computing demands rise, AHB started to fall short in meeting the demands of the system which were ever hungry for more bandwidths.

AHB is ARM's most popular protocol, which was invented in an era, where the computing power of the Integrated Circuits were very primitive as compared to what we have now

One of the major problems with the AHB protocol is its inability to support what are called 'outstanding' transactions. An outstanding transaction is simply the one, which has been issued but its response is awaited. Yes, mostly it is related to 'read' transactions, as the 'write' transactions can live without a response.Though AHB does support 'split' transactions which one may argue that its what an outstanding transaction support in AHB is, but it never took off, and neither did it served the purpose of 'outstanding' transactions.

APB

As the computing demands rise, AHB started to fall short in meeting the demands of the system which were ever hungry for more bandwidths.

AHB is ARM's most popular protocol, which was invented in an era, where the computing power of the Integrated Circuits were very primitive as compared to what we have now (2019)

APB

1. When should APB slave Sample address/Data for read/write transaction from APB master?

As per APB, for READ/WRITE transaction from master, generate PSEL = 1 in the first clock cycle and then PENABLE = 1 in the next clock cycle.

During the two clock cycles, PADDR, PWDATA, PWRITE signals does not change. APB slave drives PREADY always HIGH. So the PENABLE goes to '0' in the second clock cycle as per the protocol. We need to drive the PREADY and PSLVERR and the PRDATA from slave at the same clock for the read transfer (which should happen in the ACCESS phase here). Control signals should be sampled by the peripheral at the end of the setup phase.

For write transfers the peripheral can then sample PWDATA either at the end of the setup phase (when PSEL=1 and PENABLE=0) or else at the end of the access phase (when PSEL=1 and PENABLE=1).

PWDATA is guaranteed to be stable throughout the APB access, so it doesn't really matter when you sample it, although we suppose sampling at the end of the access phase is the intention of the protocol as it allows the peripheral one cycle to detect the transfer and then another cycle to then prepare to sample the data.

For read transfers the PRDATA bus is only valid during the access phase, so when PSEL=1 and PENABLE=1 (and PREADY=1), so the master will only sample read data just before PENABLE goes low at the end of the access.

First clock : PSEL = 1 & PWRITE = 0/1 (read/write, here PREADY is LOW by default)

Second clock: PSEL =1 & PENABLE = 1 & PWRITE = 0/1

(For write mem[paddr] <= pwdata and for read prdata <= mem[paddr])

Third clock cycle: PREADY =1

For write transfers, the addr/data is sampled when PSEL =1 and PENABLE = 1 (SECOND CLOCK).

Later PREADY = 1 is driven in the next clock cycle (which is third clock cycle) where the data is written to the internal registers to make sure both data and PREADY happens at same time. But this takes three cycles to complete one APB write/read transfer. The same is done with read transfer where PRDATA and PREADY =1 is driven in third clock cycle.

Make PREADY =1 by default, the APB master will complete transfer in TWO clock cycles as per protocol. But PRDATA will be only available in third clock cycle by which time PSEL & PENABLE goes to LOW. So now prdata is not valid for Master.

2. Why is there no wait signal on the APB?

PREADY can remain high forever if you don't want to add wait states. If you are connecting an APB2 peripheral (with no PREADY output) to an APB3 master, you would just tie the PREADY input on the master high.PREADY is only checked during the "ACCESS" phase of an APB access, so during periods when PSEL is low, or when PSEL is high but PENABLE is low (the transfer "SETUP" phase), PREADY is undefined and could be high.

Typically the driver software will first access a status register to determine that data is available

Peripherals which do require wait states can be designed as AHB slaves and in the rare case that a design does include a large number of these peripherals then a secondary stub AHB can be used to reduce the loading on the main system bus.

3. How should AHB to APB bridges handle accesses that are not 32-bits?

The bridge should simply pass the entire 32-bit data bus through the bridge. When transfers less than 32-bits are performed to an APB slave it is important to ensure that the peripheral is located on the appropriate bits of the APB data bus

4. For your initial descriptions of the first clock and second clock, why can't PREADY be high during the second clock?

If memory just needs an enable signal to know when PWDATA should be sampled, this just needs to be when PSEL & PWRITE & PENABLE is 1'b1 if the data input is a latch, or the falling edge of the PSEL & PWRITE & PENABLE logic (both probably also including some PADDR decoding).

If we use the above logic during the second clock just to know when to sample data, but this isn't written to the actual memory until one cycle later, then you could drive PREADY one cycle later as you describe, but that is something your design requires, not something the protocol requires.

For reads start the memory access as soon as PENABLE goes high, so if the memory being accessed can generate valid PRDATA during that single cycle "second clock", again PREADY could also be driven high in this cycle. If PRDATA cannot be driven valid during the second clock cycle, again that is a design issue requiring an extra cycle, not something mandated by the APB spec.

It sounds like your PRDATA not being valid until the third clock cycle is an access time limitation of the register/memory being accessed, so having PREADY allows you to stall the APB access until the peripheral can drive back valid read data, but as you know the address being read at the end of the first clock cycle, the protocol does then allow the peripheral to immediately then start returning read data to be sampled by the APB master at the end of the second cycle, so not requiring 3 cycles.

2 cycles is the basic access timing for APB, PREADY allows you to extend this access timing, but only if the peripheral being accessed needs additional access time.APB basic access time is 2 cycles. (1st - SETUP, 2nd - ACCESS PHASE)

For write transfers, we can achieve the 2 cycles completion phase by driving PREADY =1 by default or making it HIGH in the second clock cycle (PSEL = 1 , PWRITE =1 , PENABLE =1) where PADDR , PWDATA is sampled in the second clock cycle. Here we are sure that the data is written to the address specified in the internal register/memory of the slave.

For read transfer, as per earlier details , PADDR is sampled in ACCESS PHASE. So we cant have the PRDATA in the same clock cycle. It takes additional cycle to return the data. So we drive PREADY =1 only when valid data is available. (3rd clock)

As per protocol, to achieve valid PRDATA and PREADY =1 from slave in second clock cylce, PADDR should be sampled in the SETUP phase (first clock cycle) only,it is indicated to sample paddr during access phase only. So we cant realize if data sampled in access phase. But can be achieved only when sampled in Setup phase (PSEL =1 and PENABLE = 0). As these are NBA assignments it definitely required one cycle to return data. Sampling should be done only during access phase.

5. In some designs we see that PREADY is driven HIGH all the time. In such scenarios, how do the designer ensure PRDATA is valid in second clock cycle?

This is how read accesses can be completed in 2 cycles, and is how accesses HAD TO work in the first releases of APB where there was no PREADY signal.

6.In APB, Why do we use enable signal?

In APB, PSELx is used for selecting the slaves on APB bus and PENABLE is used to indicate second cycle of an APB transfer. There is no ready signal, so in order to do data transfer, master and slave both will use enable signal to know that data transfer is done. However the ready signal is added in APB 3.0 and then ready has to be used along with enable to know if data transfer is done.

APB Bridge generates one PSELx signal for each slave, and all the slaves share the same PENABLE signal. if u are design a slave. the PSEL should decode from an address decoder to select which APB slave. So if the APB slave device is not selected than just ignore the PENABLE.

Example: assign select = PWRITE? (PSEL & PENABLE) : (PSEL | PEN)

First clock : PSEL = 1 & PWRITE = 0/1 (read/write, here PREADY is LOW by default)

Second clock : PSEL =1 & PENABLE = 1 & PWRITE = 0/1 (for write mem[paddr] <= pwdata and for read prdata <= mem[paddr])

Third clock cycle : PREADY =1

7. For write transfers, the addr/data is sampled when PSEL =1 and PENABLE = 1 (SECOND CLOCK). Later PREADY = 1 is driven in the next clock cycle (which is third clock cycle) where the data is written to the internal registers to make sure both data and PREADY happens at same time.

But this takes three cycles to complete one APB write/read transfer. The same is done with read transfer where PRDATA and PREADY =1 is driven in third clock cycle

If I make PREADY =1 by default, the APB master will complete transfer in TWO clock cycles as per protocol. But PRDATA will be only available in third clock cycle by which time PSEL & PENABLE goes to LOW. So now prdata is not valid for Master.

Please suggest how can we realize making PRDATA have valid data in second clock cycle when PREADY =1 always HIGH

8. How can we realize making PRDATA have valid data in second clock cycle when PREADY =1 always HIGH? For your initial descriptions of the first clock and second clock, why can't PREADY be high during the second clock?

If your memory just needs an enable signal to know when PWDATA should be sampled, this just needs to be when PSEL & PWRITE & PENABLE is 1'b1 .if the data input is a latch, or the falling edge of the PSEL & PWRITE & PENABLE logic (both probably also including some PADDR decoding).

If you are using the above logic during the second clock just to know when to sample data, but this isn't written to the actual memory until one cycle later, then you could drive PREADY one cycle later as you describe, but that is something your design requires, not something the protocol requires.

For reads you could start the memory access as soon as PENABLE goes high, so if the memory being accessed can generate valid PRDATA during that single cycle "second clock", again PREADY could also be driven high in this cycle. If PRDATA cannot be driven valid during the second clock cycle, again that is a design issue requiring an extra cycle, not something mandated by the APB spec.

2 cycles is the basic access timing for APB, PREADY allows you to extend this access timing, but only if the peripheral being accessed needs additional access time.

APB basic access time is 2 cycles. (1st - SETUP, 2nd - ACCESS PHASE)

For read transfer, as per earlier details, PADDR is sampled in ACCESS PHASE. So we can’t have the PRDATA in the same clock cycle. It takes additional cycle to return the data. So we drive PREADY =1 only when valid data is available. (3rd clock)

As per protocol, to achieve valid PRDATA and PREADY =1 from slave in second clock cycle, PADDR should be sampled in the SETUP phase (first clock cycle) only, but in earlier discussion ,it is indicated to sample paddr during access phase only. So we cant realize if data sampled in access phase. But can be achieved only when sampled in Setup phase (PSEL =1 and PENABLE = 0). As these are NBA assignments it definitely required one cycle to return data. But understand that Sampling should be done only during access phase.

In some designs we see that PREADY is driven HIGH all the time. In such scenarios, how does the designer ensure PRDATA is valid in second clock cycle?

Control signals should be sampled by the peripheral at the end of the setup phase", so this would then allow the peripheral to start driving PRDATA at the start of the access phase, and so allow the transfer to complete at the end of a single cycle access phase if the peripheral's access time allows PRDATA to be valid at the end of that cycle. This is how read accesses can be completed in 2 cycles, and is how accesses HAD TO work in the first releases of APB where there was no PREADY signal.

AMBA protocols never specify max frequencies because that frequency will depend on the silicon library you are targeting, the complexity of the system you are designing, and how much effort you put into synthesis (not just going for simple timing budgets).

9. What is the difference between with wait state and with no wait state(read/write)?what are the advantages of both in APB?

Use a wait state if your periperal access timing requires it, or don't use a wait state if you don't need one.

10. What is an error condition in APB transfer and when it is valid?

PSLVERR to indicate an error condition on an APB transfer. Error conditions can occur on both read and write transactions.PSLVERR is only considered valid during the last cycle of an APB transfer, when PSEL, PENABLE, and PREADY are all HIGH.

It is recommended, but not mandatory, that drive PSLVERR LOW when it is not being sampled. That is, when any of PSEL, PENABLE, or PREADY are LOW.

Transactions that receive an error, might or might not have changed the state of the peripheral. This is peripheral-specific and either is acceptable. When a write transaction receives an error this does not mean that the register within the peripheral has not been updated. Read transactions that receive an error can return invalid data. There is no requirement for the peripheral to drive the data bus to all 0s for a read error.

APB peripherals are not required to support the PSLVERR pin. This is true for both existing and new APB peripheral designs. Where a peripheral does not include this pin then the appropriate input to the APB bridge is tied LOW.

11. PSLVERR

When bridging:

From AXI to APB An APB error is mapped back to RRESP/BRESP = SLVERR. This is achieved by mapping PSLVERR to the AXI signals

RRESP[1] for reads and BRESP[1] for writes.

From AHB to APB PSLVERR is mapped back to HRESP = ERROR for both reads and writes. This is achieved by mapping PSLVERR to the AHB signal HRESP[0].

12. Operational activity of the APB

IDLE This is the default state of the APB.

SETUP When a transfer is required the bus moves into the SETUP state, where the appropriate select signal, PSELx, is asserted. The bus only remains in the SETUP state for one clock cycle and always moves to the ACCESS state on the next rising edge of the clock.

ACCESS The enable signal, PENABLE, is asserted in the ACCESS state. The address, write, select, and write data signals must remain stable during the transition from the SETUP to ACCESS state.Exit from the ACCESS state is controlled by the PREADY signal from the slave:

If PREADY is held LOW by the slave then the peripheral bus remains in the ACCESS state.

If PREADY is driven HIGH by the slave then the ACCESS state is exited and the bus returns to the IDLE state if no more transfers are required. Alternatively, the bus moves directly to the SETUP state if another transfer follows.

12. In APB, There are two phases. SETUP and ACCESS. The ACCESS phase is indicated by assertion of PENABLE signal. Why we require this phases?

The signal PENABLE can be driven high at the same time when data and control signals are driven by master and the slave can assert the PREADY when it detects PSEL and PENABLE.

13. Why there is delay between assertion of PENABLE and Data/Control signals ?? Is there any reason for this delay?

The APB protocol was designed to be simple, so no complex pipelining or timing.So 2 phases allowed the APB master to say what it wanted (the setup phase) and then for the required transfer to occur (the access phase).Everything in one cycle wouldn't work in synchronous logic because the peripheral being accessed won't know there is a transfer request until the end of the first cycle, which for a read access means will always need a second cycle.

Or even for write acesses, the peripheral needing to signal wait states won't know to drive PREADY low until it has sampled PSEL/PADDR/PWRITE etc, so again requiring a second cycle for this peripheral decision to be synchronously made and sampled.If need higher performance, lower latency accesses, use a more complex "system" bus, such as AHB or AXI, and accept more complex interface designs.Control signals (PSEL, PADDR, PWRITE, PPROT etc) are driven at the start of the "setup" phase, and PENABLE is driven once cycle later at the start of the "access" phase.

It is only PREADY, PSLVERR and PRDATA that are changed during the "access" phase, and there you do PENABLE going high at a time when those PREADY/PSLVERR and PRDATA signals can be changing. But PENABLE, PREADY and PSLVERR valid for the next PCLK rising edge, and PRDATA valid for when PREADY is driven high at the end of the "access" phase.So simplicity of design is the answer.

1. When should APB slave Sample address/Data for read/write transaction from APB master?

As per APB, for READ/WRITE transaction from master, generate PSEL = 1 in the first clock cycle and then PENABLE = 1 in the next clock cycle.

For write transfers the peripheral can then sample PWDATA either at the end of the setup phase (when PSEL=1 and PENABLE=0) or else at the end of the access phase (when PSEL=1 and PENABLE=1).

First clock : PSEL = 1 & PWRITE = 0/1 (read/write, here PREADY is LOW by default)

Second clock: PSEL =1 & PENABLE = 1 & PWRITE = 0/1

(For write mem[paddr] <= pwdata and for read prdata <= mem[paddr])

Third clock cycle: PREADY =1

For write transfers, the addr/data is sampled when PSEL =1 and PENABLE = 1 (SECOND CLOCK).

2. Why is there no wait signal on the APB?

Typically the driver software will first access a status register to determine that data is available
and only then access the data register. Both of these accesses are possible without the addition of wait states and therefore the peripheral can easily be accessed as an APB device.