# pi calculation benchmark [http://fab.cba.mit.edu/classes/MAS.864/text/benchmark.pdf](http://fab.cba.mit.edu/classes/864.17/text/benchmark.pdf) |estimated GFlops|code|description|system|date| |---|---|---|---|---| |8,668,954|[cudampipi.cu](CUDA/cudampipi.cu)|C++, CUDA+MPI
1024 nodes, 6144 ranks, GPUs
nvcc -arch=sm_70 -std=c++11|Summit
Oak Ridge OLCF
IBM AC922|October, 2020| |88,333|[mpimppi.c](hybrid/mpimppi.c)|C, MPI+OpenMP
1024 nodes, 64 cores/node, 4 threads/core
cc mpimppi.c -o mpimppi -O3 -ffast-math -fopenmp|Theta
Argonne ALCF
Cray XC40|October, 2019| |12,589|[cudapit.cu](CUDA/cudapit.cu)|C++, CUDA, 8 GPUs, 5120 cores/GPU|NVIDIA V100|March, 2020| |11,083|[mpithreadpi.cpp](hybrid/mpithreadpi.cpp)|C++, MPI+threads, 128 nodes, 64 cores/node, 4 threads/core
CC mpithreadpi.cpp -o mpithreadpi -O3 -ffast-math -std=c++11|Argonne ALCF Theta
Cray XC40|March, 2020| |2,117|[mpipi2.c](MPI/mpipi2.c)|C, MPI, 10 nodes, 96 cores/node
mpicc mpipi2.c -o mpipi2 -O3 -ffast-math|Intel 2x Xeon Platinum 8175M|October, 2019| |2,102|[mpipi2.py](Python/mpipi2.py)|Python, Numba, MPI
10 nodes, 96 cores/node|Intel 2x Xeon Platinum 8175M|February, 2020| |1,635|[cudapi.cu](CUDA/cudapi.cu)|C++, CUDA, 5120 cores|NVIDIA V100|March, 2020| |1,090|[numbapig.py](Python/numbapig.py)|Python, Numba, CUDA, 5120 cores|NVIDIA V100|March, 2020| |315|[numbapip.py](Python/numbapip.py)|Python, Numba, parallel, fastmath
96 cores|Intel 2x Xeon Platinum 8175M|February, 2020| |272|[threadpi.c](C/threadpi.c)|C, 96 threads
gcc threadpi.c -o threadpi -O3 -ffast-math -pthread|Intel 2x Xeon Platinum 8175M|June, 2019| |267|[threadpi.cpp](C++/threadpi.cpp)|C++, 96 threads
g++ threadpi.cpp -o threadpi -O3 -ffast-math -pthread|Intel 2x Xeon Platinum 8175M|March, 2020| |211|[mpipi2.c](MPI/mpipi2.c)|C, MPI, 1 node, 96 cores
mpicc mpipi2.c -o mpipi2 -O3 -ffast-math|Intel 2x Xeon Platinum 8175M|October, 2019| |180|[mpipi2.py](Python/mpipi2.py)|Python, Numba, MPI
mpirun -np 96 python mpipi2.py|Intel 2x Xeon Platinum 8175M|February, 2020| |173|[mppi.c](OpenMP/mppi.c)|C, OpenMP, 96 threads
gcc mppi.c -o mppi -O3 -ffast-math -fopenmp|Intel 2x Xeon Platinum 8175M|July, 2019| |152|[pi.html](https://pub.pages.cba.mit.edu/pi/JavaScript/pi.html)|JavaScript, 96 workers|Intel 2x Xeon Platinum 8175M|June, 2019| |93.2|[threadpi.c](C/threadpi.c)|C, 56 threads
gcc threadpi.c -o threadpi -O3 -ffast-math -pthread|Intel 2x E5-2680|December, 2018| |71.4|[pi.html](https://pub.pages.cba.mit.edu/pi/JavaScript/pi.html)|JavaScript, 56 workers|Intel 2x E5-2680|November, 2018| |46.9|[mpipi.c](MPI/mpipi.c)|C, MPI
mpicc mpipi.c -o mpipi -O3 -ffast-math
mpirun -np 6 mpipi|Intel i7-8700T|November, 2018| |44.6|[threadpi.c](C/threadpi.c)|C, 6 threads
gcc threadpi.c -o threadpi -O3 -ffast-math -pthread|Intel i7-8700T|December, 2018| |23.3|[mpipi2.py](Python/mpipi2.py)|Python, Numba, MPI
mpirun -np 6 python mpipi2.py|Intel i7-8700T|February, 2020| |16.1|[pi.html](https://pub.pages.cba.mit.edu/pi/JavaScript/pi.html)|JavaScript, 6 workers|Intel i7-8700T|November, 2018| |15.7|[clusterpi.js](Node/clusterpi.js)|Node, 6 workers|Intel i7-8700T|December, 2018| |9.37|[pi.c](C/pi.c)|C
gcc pi.c -o pi -lm -O3 -ffast-math|Intel i7-8700T|November, 2018| |4.87|[numbapi.py](Python/numbapi.py)|Python, Numba|Intel i7-8700T|February, 2020| |3.73|[pi.html](https://pub.pages.cba.mit.edu/pi/JavaScript/pi.html)|JavaScript, 1 worker|Intel i7-8700T|November, 2018| |3.47|[pi.html](https://pub.pages.cba.mit.edu/pi/JavaScript/pi.html)|JavaScript, 1 worker|Intel 2x E5-2680|November, 2018| |3.29|[pi.js](Node/pi.js)|Node|Intel i7-8700T|December, 2018| |3.12|[clusterpi.js](Node/clusterpi.js)|Node, 1 worker|Intel i7-8700T|December, 2018| |1.78|[threadpi.c](C/threadpi.c)|C, 4 threads
gcc threadpi.c -o threadpi -O3 -ffast-math -pthread|Raspberry Pi 4|December, 2020| |0.57|[pi.c](C/pi.c)|C
gcc pi.c -o pi -lm|Intel i7-8700T|November, 2018| |0.47|[numpi.py](Python/numpi.py)|Python, NumPy|Intel i7-8700T|November, 2018| |0.029|[pi.py](Python/pi.py)|Python|Intel i7-8700T|November, 2018| |estimated GFlops|estimated GFlops/W|code|description|system|date| |---|---|---|---|---|---| |0.0168|0.233
calculated|[pi.c](https://gitlab.cba.mit.edu/zfredin/samd51_benchmarks)|C
floats, -O3, gcc-arm-none-eabi, 160 MHz|SAMD51J20A
ARM Cortex M4F|October, 2019| |0.0128|0.171
calculated|[pi.c](https://gitlab.cba.mit.edu/zfredin/stm32f412_core/tree/master/nucleo-f412zg/pi)|C
floats, -O3, gcc-arm-none-eabi, 84 MHz|STM32F412
ARM Cortex M4F|October, 2019|