The 10 steps to model and design a complex SoC

Sondrel
Friday, 18 March, 2022


The 10 steps to model and design a complex SoC

The shrinking of nodes means that chips are becoming ever more complex and require ever larger teams of engineers.

To ensure that the design time of a chip does not increase as well, complex SoC designer Sondrel invests heavily in R&D to create novel modelling tools and flows to streamline the production of ASICs. Sondrel has codified this into its 10 steps required to design an SoC architecture.

“This complexity means that companies are increasingly outsourcing their SoC design to specialist companies such as ourselves who can deploy teams of multidisciplinary engineers,” explained Graham Curren, Sondrel’s CEO. “Because Sondrel has created hundreds of ASICs across many application areas, we already have the experience in-house that ensures we can use the latest and best design techniques, many of which we have designed ourselves.”

Overall, the strategy for successfully designing an SoC architecture is:

  • Break the problem into stages using a divide and conquer methodology.
  • As the level of detail increases at each stage, consider fewer options for exploration.
  • Use modelling to answer specific questions.
  • Focus resources as process progresses:
    • Avoid wasted simulation time.
    • Reduce dataset to be analysed.
    • Reduce simulation size and count to offset slowdown.
  • Consider issues in a logical order to simplify cause and effect analysis, and to reduce iterations required.
  • Iterate to identify issues and fix before advancing to reduce wasted design effort.
  • Balance the trade-off of accuracy versus speed.
     

The 10 steps are as follows:

1. Determine what the data is and what are the I/O constraints — such as burstiness, latency, timing and data formatting — to decide on the buffer requirements that are captured in a spreadsheet.

2. Break the processing down into subtasks and group parts of the SoC into common pieces of functionality.

3. Identify what third-party IP blocks will be required to perform the steps of an algorithm and how much memory and compute power they require from their datasheets that can be fed into the modelling environment to give a more accurate representation of what all the IP blocks will be doing.

4. Cover the method of exchanging data in between parts of an algorithm, such as on-chip SRAM or external DDR memory as well as FIFO, which are small spaces of memory on chip. The decision between SRAM and DDR depends on the size of the data and how often it needs to be accessed, with large pieces of data going to external memory and small pieces of data to SRAM or FIFO.

5. This is when a software representation is created of what the different stages are, with the conceptual view of the algorithm and actual simulation objects that correspond to the different software stages of the algorithm. These require settings such as latency and processing cycles, and are joined by objects known as channels that indicate what the sequencing is.

6. Having constructed all the simulation objects for the full algorithm, simulations can be run to see if the right sequencing of the algorithm has been captured.

7. Use models of the hardware platform with VPUs (virtual processor units) that will run the software of step 5, each with its own local memory. Here the interface timing can be considered and communication domains defined with their assigned channels and evaluated. It also enables the configuration of the VPUs to be verified as correct.

8. Take the memory available to each VPU and remodel it as being connected to external memory via a common memory controller. This gives a more accurate representation of the connectivity of all the VPUs and memories in the final system.

9. Add the interconnect fabric. Instead of the direct connections between the VPUs and the memory controller, these are replaced by the interconnect fabric and the effects on the timing and performance evaluated. The interconnect fabric is then adjusted to meet the performance required, with previous stages being redone to achieve the required results.

10. This is a good working model so, by simply adjusting settings, various simulations can be run to identify bottlenecks, what constraints there are in the system, and which parameters should be adjusted to improve the throughput and reduce the latency of the SoC. These take a few minutes to an hour to run so that it is straightforward and quick to test variants.

The first four steps can be done on paper or on a spreadsheet by calculation to understand the input/output dataflows into the SoC and what their characteristics are. The last six steps are simulation-based, where software models are constructed and simulations run to generate results that inform about the system.

An article covering this in more depth is available at www.sondrel.com/solutions/white-papers.

Image credit: ©stock.adobe.com/au/3dartists

Please follow us and share on Twitter and Facebook. You can also subscribe for FREE to our weekly newsletter and bimonthly magazine.

Originally published here.

Related Articles

Faster space communication with record-sensitive receiver

Researchers have developed a new system featuring a silent amplifier and a sensitive receiver,...

New record set for wireless data transfer

Researchers from University College London have set a new record for wireless transmission speed,...

Electrically modulated light antennas for faster computer chips

Researchers have achieved electrically controlled modulation of light antennas, paving the way...


  • All content Copyright © 2024 Westwick-Farrow Pty Ltd