Benchmarking and performance

Konstantinos.Poulios · December 5, 2023, 6:11pm

Finding performance bottlenecks is important. All cases inside test_assembly.cc are quite optimized but this does not cover all possible cases.

Profiling a specific program that is based on GetFEM, only requires that the program (if compiled at all) and GetFEM itself are compiled with the debug flag “-g”. Then the program can be started with perf, as in the examples:

OMP_NUM_THREADS=1 perf record --call-graph dwarf ./test_assembly

or

OMP_NUM_THREADS=1 perf record --call-graph dwarf python3 gf_benchmark.py

The profiled program will run at its normal speed and perf will create a quite big file called perf.data with all profiling information. To visualize this file, just start hotspot in the same folder with

hotspot

by default it will search for a file called perf.data and will provide a visualization like this

Topic		Replies	Views
Parallelization Development	4	152	June 24, 2024
Problem during boundary condition assembly Questions and issues	133	242	June 23, 2025
Resources for Learning GetFEM++ with Python and GWFL General	12	77	April 1, 2025
GetFEM in action General	5	130	May 29, 2023
Parallel mumps solver and custom solver General	7	7	July 10, 2025

Benchmarking and performance

Related topics