Functions which cannot fail #43

vstinner · 2023-11-16T03:07:02Z

The PyTuple_Check(obj) function cannot fail and the caller is not expected to check for errors. Expected usage:

    if (PyTuple_CheckExact(v)) {
        ...
    }

Here, "cannot fail" means that if you pass a NULL pointer, the code does crash.

I'm fine with strongly suggesting to check for errors in the general cases. I just ask for exceptions for specific cases.

The issue #5 requires all new functions to enforce the caller to always check for errors. IMO we need exceptions to that rule.

For example, PR python/cpython#112096 proposes adding Py_hash_t PyHash_Pointer(const void *ptr) function which cannot fail. It would be annoying to have to check for an hypothetical error if the current implementation cannot fail, and it's unlikely that the function will change to report errors.

What's matter here is to provide a convenient API, more than correctness. Obviously, if we enforce checking the result, it would be easier to change the API later. IMO it's just not worth it here.

For example, a function is unlikely to fail right now and is the future if:

It does not allocate memory.
Argument types are primitive C types such as uint64_t or double.

Some functions are also designed in a way so that they cannot fail. For example, PyUnicode_EqualToUTF8() cannot fail. But if the first argument is not a Unicode object, the function does crash. It's a design choice. Example of usage:

                if (PyUnicode_EqualToUTF8(key, kwlist[i])) {
                    match = 1;
                    break;
                }

Having to check for errors on such basic operation "compare two strings" sounds really annoying. For example, the C strcmp() function cannot fail, it's the same. Obviously, if you pass NULL pointer or strings which are not terminated by NUL, strcmp() does crash. That's the trade-off for a convenient API.

By the way, I think that it's fine for some cases to log exceptions with sys.unraisablehook. Like error in a callback that a function doesn't call directly, and the error cannot be reported to the function since there is an API which abstract the callback. For example, a weakref can have a callback which fail. When a Python object is finalized, errors in weakref callbacks cannot be reported to the finalizer: the finalizer doesn't know these callbacks nor how to handle these errors.

Examples which cannot fail:

int _Py_popcount32(uint32_t x)
Py_hash_t PyHash_Pointer(const void *ptr)
void Py_SET_REFCNT(PyObject *ob, Py_ssize_t refcnt)
void PyErr_SetObject(PyObject *, PyObject *)

Counter-examples of functions which cannot report where they can fail:

void Py_SetRecursionLimit(int): sys.setrecusionlimit() adds an additional check which cannot be implemented in the C function
void PyFrame_FastToLocals(PyFrameObject *): int PyFrame_FastToLocalsWithError(PyFrameObject *f) had to be added later
PyObject* PyDict_GetItem(PyObject *op, PyObject *key): ignore silently errors :-( PyDict_GetItemRef() and other functions were added to report errors to the caller.

Corner cases:

Destructors such as PyTypeObject.tp_dealloc functions: if they raise an exception, they must log it with PyErr_FormatUnraisable() which calls sys.unraisablehook.
void PyOS_AfterFork_Child(void): it's hard to report errors around a fork() call.
void PyObject_ClearWeakRefs(PyObject *)
void Py_ReprLeave(PyObject *), but int Py_ReprEnter(PyObject *) can fail

The text was updated successfully, but these errors were encountered:

vstinner · 2023-11-16T03:07:56Z

Examples which cannot fail:

int _Py_popcount32(uint32_t x)

Yes, for underscored and unstable functions, which we can simply remove/replace if their behaviour/signature needs to change, this guideline isn't useful.

Py_hash_t PyHash_Pointer(const void *ptr)

See above :)

void Py_SET_REFCNT(PyObject *ob, Py_ssize_t refcnt)

I'd rather remove this from the API altogether, but, yes: reference counting is a good exception:

It's the most used operations and needs to be fast.
It's used often enough that people know the exception.

void PyErr_SetObject(PyObject *, PyObject *)

This always sets an exception, so, it has a way of signaling failure.
(That could be against some wordings of the future guideline, but it's good in spirit.)
Thanks for bringing this one up!

Counter-examples of functions which cannot report where they can fail:

void Py_SetRecursionLimit(int): sys.setrecusionlimit() adds an additional check which cannot be implemented in the C function

void PyFrame_FastToLocals(PyFrameObject *): int PyFrame_FastToLocalsWithError(PyFrameObject *f) had to be added later

PyObject* PyDict_GetItem(PyObject *op, PyObject *key): ignore silently errors :-( PyDict_GetItemRef() and other functions were added to report errors to the caller.

Corner cases:

Destructors such as PyTypeObject.tp_dealloc functions: if they raise an exception, they must log it with PyErr_FormatUnraisable() which calls sys.unraisablehook.

I don't think this is a corner case. If we add new API for this, the dealloc function should raise normally and the caller should take care of “handling” the exception.

void PyOS_AfterFork_Child(void): it's hard to report errors around a fork() call.

The interpreter might not be in a consistent enough state that exceptions can be raised, right?
Still, it would be good if this function did have a way to report whatever exceptions it can (when it's not necessary to abort the process).

void PyObject_ClearWeakRefs(PyObject *)

If this was designed today, it should raise an ExceptionGroup.

void Py_ReprLeave(PyObject *), but int Py_ReprEnter(PyObject *) can fail

I don't think this is a corner case -- it should have a way to report exceptions.

vstinner · 2023-11-16T14:17:37Z

If PyHash_Pointer was added today, I'd rather provide a way for it to report errors. Why?

Warnings, including deprecation warnings, are errors with -W.

In the stable ABI, deprecations need to be emitted at runtime.

I don't know where this new constraint comes from. Would mind you to create an issue to elaborate on the ability to deprecate any fuction?

Sure, it's appealing to have the ability to be able to emit warnings and treat them as errors. But I don't think that it overrides another aspects on the API design.

encukou · 2023-11-20T15:22:44Z

Would mind you to create an issue to elaborate on the ability to deprecate any fuction?

That's #5 :)

vstinner · 2024-03-15T21:55:50Z

The Py_HashPointer() function was added without error handling, whereas the PyTime_Time() function was added with error handling.

I'm fine with strongly suggesting to check for errors in the general cases. I just ask for exceptions for specific cases.

I'm not longer sure of the purpose of this issue. It doesn't propose anything, it was more a general discussion. IMO we should discuss error handling on a case by case basis. In general, functions should have a way to report errors to the caller. "Cannot fail" should remain the exception.

This was referenced Nov 16, 2023

gh-111545: Add PyHash_Double() function python/cpython#112095

Closed

Mandatory error handling #5

Closed

vstinner mentioned this issue Jan 10, 2024

Add Py_HashDouble() function capi-workgroup/decisions#2

Closed

4 tasks

vstinner closed this as completed Mar 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Functions which cannot fail #43

Functions which cannot fail #43

vstinner commented Nov 16, 2023

vstinner commented Nov 16, 2023

vstinner commented Nov 16, 2023

encukou commented Nov 16, 2023

vstinner commented Nov 16, 2023

encukou commented Nov 20, 2023

vstinner commented Mar 15, 2024

Functions which cannot fail #43

Functions which cannot fail #43

Comments

vstinner commented Nov 16, 2023

vstinner commented Nov 16, 2023

vstinner commented Nov 16, 2023

encukou commented Nov 16, 2023

Examples which cannot fail:

Counter-examples of functions which cannot report where they can fail:

Corner cases:

vstinner commented Nov 16, 2023

encukou commented Nov 20, 2023

vstinner commented Mar 15, 2024