365NEWSX
365NEWSX
Subscribe

Welcome

Tachyum Publishes Prodigy Universal Processor Whitepaper: Up To 6x Faster Than NVIDIA H100 & 30x Faster Than Intel Xeon 8380, Available In 2H 2023 - Wccftech

Tachyum Publishes Prodigy Universal Processor Whitepaper: Up To 6x Faster Than NVIDIA H100 & 30x Faster Than Intel Xeon 8380, Available In 2H 2023 - Wccftech

Tachyum Publishes Prodigy Universal Processor Whitepaper: Up To 6x Faster Than NVIDIA H100 & 30x Faster Than Intel Xeon 8380, Available In 2H 2023 - Wccftech
Oct 05, 2022 1 min, 54 secs

Tachyum has officially published the whitepaper of its 5nm Prodigy Universal Processor which was unveiled all the way back in 2018.

The Tachyum Prodigy CPUs utilize a universal processor design which means that they can execute CPU, GPU, and TPU tasks on the same chip, saving costs over competing products and also offering really high performance.

The company aims to tackle all three chip giants, AMD, Intel & NVIDIA with its Prodigy lineup and in their presentations, Tachyum has estimated a 4x performance uplift over Intel's Xeon CPUs, on the HPC front, a 3x increase over NVIDIA's H100 and a 6x increase in raw performance in AI & inference workloads.

The Tachyum Prodigy makes use of an OOD (Out-Of-Order) architecture that can decode and retire up to 8 instructions per clock, issue up to 11 instructions per clock, with an instruction queue that supports up to 48 instructions and a scheduler that supports 12 queues that are 15 entries deep.

In addition to on-chip cache coherency, Prodigy also supports hardware coherency between Prodigy devices that enables both 2-socket and 4-socket platforms to be fully coherent.

The vector and matrix subsystems support a wide range of data types, including FP64, FP32, TF32, BF16, Int8, FP8, as well as TAI, or Tachyum AI, a new data type that will be announced later this year and will deliver higher performance than FP8.

Now that's just one chip and Tachyum will allow full hardware coherency that supports 2 and 4-socket systems.

The Prodigy Platform will come in various rack solutions such as an air-cooled 2U server that will be able to house up to four Tachyum Prodigy chips, 64 16 GB DDR5 DIMMs, and 2x200 GbE RoCE NICs.

The company claims up to a 4x SPECrate 2017 Integer and 30x Raw Floating Point performance (FP64) increase versus the competition.

The company claims up to a 4x SPECrate 2017 Integer and 30x Raw Floating Point performance (FP64) increase versus the competition.

The company uses an Air-cooled Prodigy rack which is estimated to deliver up to 6.2 PetaFlops of HPC FP64 horsepower versus an NVIDIA H100 DGX POD rack which offers 960 TFLOPs of FP64 HPC performance

The next-gen upgrade to Prodigy, known as Prodigy 2 is also listed in Tachyum's roadmap and will be offering a new 3nm architecture with even more cores, higher memory bandwidth, PCIe 6.0 + CXL support, and enhanced connectivity

Summarized by 365NEWSX ROBOTS

RECENT NEWS

SUBSCRIBE

Get monthly updates and free resources.

CONNECT WITH US

© Copyright 2024 365NEWSX - All RIGHTS RESERVED