RapidMind is all about achieving the full performance potential of modern multi-core processors. Generally, total performance is a combination of two factors:
total performance = scalability across cores × per-core performance
It’s probably no surprise that RapidMind aims to provide excellent scalability across cores. What might surprise you is that we spend significant amounts of effort on single-core performance as well, and RapidMind-enabled apps often outperform code written in C/C++ without RapidMind significantly. Some of our customer case studies show improvements like “25x faster than non-RapidMind-code on 8 cores” – that 25x number is made up of perfect scaling across 8 cores, combined with a 3x performance advantage even on a single core.
How’s that possible? I’m going to explore this in my next few blog posts. Read on for the first reason why we get such good performance:
Reason 1: Our programming model
(more…)