Combining C and Python#

The standard Python interpreter is written in C, and it’s possible to write Python modules in C or C++ to improve performance. This not strictly within the scope of the course, but if you’re interested, you can find more information in the Python documentation.

A concrete example#

As a (relatively) simple example, let’s look at a C++ function which takes a list of numbers and returns the sum of the squares of the numbers. We’ll then write a wrapper to call this function from Python.

The C++ code file#

The core C++ function#

Here’s a version of simple a C++ function to calculate the sum of the squares of a bunch of numbers.

// 
// This is written in C style passing in an array and its length.

int sum_of_squares(int* numbers, int length){
    int sum = 0;
    for(int i=0; i<length; i++){
        sum += numbers[i]*numbers[i];
    }
    return sum;
}

The Python wrapper#

The Python wrapper is a little more complicated, but not much. We need to include the Python header file, and then define a function which takes a Python object as an argument, and returns a Python object back to the interpreter. The core function itself is very simple, it just converts the Python object into a C array, calls the C function, and then converts the result back into a Python object.

// These two lines are required to use the Python/C API
#define PY_SSIZE_T_CLEAN
#include <Python.h>

// A Python wrapper for the sum_of_squares function

static PyObject *
pysum_of_squares(PyObject *self, PyObject *args)
{
    PyObject *py_list;
    int length
    int sum;

    // Parse the Python input arguments (assumed to be a list of integers)
    if (!PyArg_ParseTuple(args, "O", &py_list))
        return NULL;

    // Get the length of the list and assign memory for the C array.

    length = PyList_Size(py_list);
    int *input = new int[length];


    // Convert the Python list to a C array

    for(int i=0; i<length; i++){
        input[i] = PyLong_AsLong(PyList_GetItem(py_list, i));
    }


    // Call the C function and get the result

    sum = sum_of_squares(input, length);


    // clean up the memory used by the C array
    delete[] input;

    return PyLong_FromLong(sum);
}

The method table#

The next step is to define a method table which tells Python how to call the function. This is a little more complicated than the previous examples, but not much. We need to define a method table which tells Python the name of the function, the C function to call, the type of the arguments, and a docstring to display when the user asks for help.

static PyMethodDef SquareSumMethods[] = {
    {"sum_of_squares", // Python name
     pysum_of_squares , // C function
     METH_VARARGS, // Argument type (in Python.h)
     "Sum the squares of the values in an iterable."}, // This is the docstring
    {NULL, NULL, 0, NULL}        // Sentinel
};

The module definition#

Finally, we need to define the module itself. This is mostly boilerplate, but we need to tell Python the name of the module, the docstring, and the method table we just defined.

To do this, we define a struct with the required information, and then pass it to the PyModule_Create function returned from a PyMODINIT_FUNC function. These are both defined inside the Python header file.

static struct PyModuleDef square_sum_module = {
    PyModuleDef_HEAD_INIT, // struct base from Python.h
    "square_sum",   // module name 
    "A module that sums the squares of the values in an iterable.", // docstring
    -1, // size of per-interpreter state of the module, or -1 if the module keeps state in global variables.
    SquareSumMethods // the array of methods created above
};

PyMODINIT_FUNC PyInit_square_sum(void) {
    Py_Initialize();
    return PyModule_Create(&square_sum_module);
}

The Python Setup File#

The most convenient way to build the module is to use a Python setup file. This is a Python script which uses the setuptools module to build the module. The setup file for this example is shown below.

from setuptools import Extension, setup

setup(
    ext_modules=[
        Extension(
            name="square_sum",  # as it would be imported
                               # may include packages/namespaces separated by `.`
                               # don't need to use the same name as the module

            sources=["sum_of_squares.cpp"], # all sources are compiled into a single binary file
        ),
    ]
)

Building the module#

To build the module, we need to run the setup.py script. This can be done using the command

python setup.py build_ext --inplace

This will create a file called something likesquare_sum.so,or square_sum.pyd on Windows, which can be directly imported into Python (note that most versions actually include more details on the version of Python the import will work with) . The --inplace option tells Python to put the file in the current directory, rather than in a subdirectory.

For this to work, you must have the Python development headers installed. On Ubuntu/WSL, this can be done using the command

sudo apt install python3-dev

Using the module#

Once the module is built, it can be imported into Python in the usual way a regular module file would. For example, if the module is called square_sum, then we can use it as follows from a Python script/interpreter window.

>>> import square_sum
>>> square_sum.sum_of_squares([1,2,3,4,5])
55