Author Topic: Here comes the KiloCore chip with wormhole routing  (Read 429 times)

0 Members and 1 Guest are viewing this topic.

Online Elderberry

  • TBR Contributor
  • *****
  • Posts: 24,520
Here comes the KiloCore chip with wormhole routing
« on: July 02, 2016, 08:13:01 pm »
Here comes the KiloCore chip with wormhole routing

by  Jan Buiting Elektor 6/28/2016

https://www.elektormagazine.com/news/here-comes-the-kilocore-chip-with-wormhole-routing?utm_source=Elektor+United+States+%28English%29&utm_campaign=cb5a7a1940-A_1000_core_micro_16x16_LEDs_for_Arduino6_30_2016&utm_medium=email&utm_term=0_8b7374950c-cb5a7a1

A team from Davis University, California, has designed a processor with 1000* cores, boasting a throughput rate of 1.78 trillion instructions per second and containing 621 million transistors.

As opposed to a number of other attempts, some reaching 300 or so processors, the KiloCore chip has been fabricated and run; it was built by IBM (who else) using its 32-nm PD-SOI CMOS technology (what else).

The basic architecture used is MIMD (multiple instruction/multiple data) and each of the seven-stage-pipelined cores has a 72-instruction set, single instruction/cycle. None of the instructions is ‘algorithm-specific’ – setting the KiloCore apart from GPU-class devices. The terrific throughput is achieved at a clock speed of a mere 1.78 GHz, at 1.1 V. Running at 0.84 V and 1 GHz the beast consumes 13.1 W, while peak power efficiency of 5.8 pJ/Op is quoted at 0.56 V and 115 MHz.

Each core is independently powered and can shut down to leakage-only power if it has no task to perform. Rather than a cache architecture, every processor can store instructions and data in a hierarchy of locations; local memory, one or more nearby processors, on-chip independent memory modules, or off-chip memory.

The ‘wormhole’ routing employed implies, among others, that messages from an adjacent or nearby core will be routed via the ‘circuit’ network; those from further away in the processor matrix will travel via the packet network. If that’s a veritable can of worms to programmers remains to be seen. Each core has north-south-east-west comms buffers plus a fifth channel for host-processor traffic; maximum throughput is 45.5 Gbps per router and 9.1 Gbps per port at 1.1 V.

* as a niggling detail, K in my computerized editor's dictionary is for kilo = 1024. Sure, k is also for kilo, but meaning 1000 in old money, like in kHz.