Numeric Integration - Calculating Areas

Overview

Teaching: 15 min
Exercises: 5 min

Questions

How can we calculate integrals?

Objectives

Learn about critical sections

In this section, we will use the problem of numeric integration, i.e. calculating areas under curves, to look at how to control access to global variables. As our example, let’s say we wanted to integrate the sine function from 0 to Pi. This is the same as the area under the first half of a sine curve. The single-threaded version is below.

#include <stdio.h>
#include <stdlib.h>
#include <math.h>

int main(int argc, char **argv) {
   int steps = 1000;
   double delta = M_PI/steps;
   double total = 0.0;
   int i;
   for (i=0; i<steps; i++) {
      total = total + sin(delta*i) * delta;
   }
   printf("The integral of sine from 0 to Pi is %.12f\n", total);
}

Compiling with math

In order to include the math functions, you need to link in the math library. In GCC, you would use the following:
gcc -o pi-serial pi.c -lm
./pi

The answer in this case should be 2. It will be off by a small amount because of the limits of computer representations of numbers.

Step size

How would you change the step size. What happens if you do?

Solution

You can decrease the step size by increasing the steps variable. We normally expect this to increase the accuracy of the result. Does it? Is there a noticeable effect on the run time?

To see what happens to the time this program takes, we’ll use a new tool. Since we just want to see the total time, we can use the program time.

Timing

You can use the time utility to get the amount of time it takes for a program to run.
$ time ./pi-serial
Using 1000 steps
The integral of sine from 0 to Pi is 1.999998355066

real    0m0.005s
user    0m0.000s
sys     0m0.002s
The real output is the useful one; this example took 0.005 seconds to run. The user and sys lines describe how much time was spent in “user” code and how much in “system” code, a distinction that doesn’t interest us today.

Parallelizing numerical integration

How would you parallelize this code to get it to run faster?

Obviously, we could add #pragma parallel for. But do we make total private, or not?

The data dependency on total leads to what we call a race condition. Since we are updating a global variable, there is a race between the various threads as to who can read and then write the value of total. Multiple threads could read the current value, before a working thread can write the result of its addition. So these reading threads essentially miss out on some additions to the total. This can be handled by adding a critical section. A critical section only allows one thread at a time to run some code block.

#include <stdio.h>
#include <stdlib.h>
#include <math.h>
#include <omp.h>

int main(int argc, char **argv) {
   int steps = 1000;
   float delta = M_PI/steps;
   float total = 0.0;
   int i;
   #pragma omp parallel for
   for (i=0; i<steps; i++) {
      #pragma omp critical
      total = total + sin(delta*i) * delta;
   }
   printf("The integral of sine from 0 to Pi is %f\n", total);
}

The critical pragma is a very general construct that lets you ensure a code line is executed exclusively. However, making a sum is a very common operation in computing so OpenMP provides a specific mechanism to handle this case: Reduction variables. We’ll look at those in the next section.

Key Points

You can use a critical section to control access to a global variable

previous episode

ACENET Summer School - OpenMP

next episode

Numeric Integration - Calculating Areas

Overview

Compiling with math

Step size

Solution

Timing

Parallelizing numerical integration

Key Points

previous episode

next episode