gh-126220: Adapt `_lsprof` to Argument Clinic #126233

sobolevn · 2024-10-31T11:08:22Z

I decided to go with the AC, because:

It is supported in this module
It solves the problem
It fixes multiple signatures

But, I believe it will be slower. If we really want to preserve speed in this case, I can add manual size checks for args.

Refs #103534

Issue: _lsprof.Profiler._creturn_callback() segfaults #126220

…h 0 args

ZeroIntensity

LGTM. I'm always happy with fixes via AC, because they're much more scalable.

gaogaotiantian · 2024-10-31T12:53:29Z

What's the perf impact here?

sobolevn · 2024-10-31T13:39:46Z

@gaogaotiantian I've never had experience with profiling cProfile. Do you have any links / suggestions on how to do that?

gaogaotiantian · 2024-10-31T13:48:51Z

Just do a quick fib() with cprofile and compare it with the previous implementation would do. Maybe also with one without cprofile.

sobolevn · 2024-10-31T14:12:14Z

@gaogaotiantian maybe I did something wrong, but looks like we also got a speedup from my numbers 😅 (I have little idea about how cProfile works).

# Test code: ex.py
def test():
    def fib():
        start, nextn = 0, 1
        while True:
            yield start
            start, nextn = nextn, start + nextn

    gen = fib()
    for _ in range(100000):
        next(gen)

import cProfile
cProfile.run(test.__code__)

Running it on main:

» ./python.exe ex.py
         200003 function calls in 0.188 seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.017    0.017    0.188    0.188 ex.py:1(test)
   100000    0.147    0.000    0.147    0.000 ex.py:2(fib)
        1    0.000    0.000    0.188    0.188 {built-in method builtins.exec}
   100000    0.024    0.000    0.171    0.000 {built-in method builtins.next}
        1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

Running on this PR:

» ./python.exe ex.py                                                     
         200003 function calls in 0.185 seconds

   Ordered by: standard name

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
        1    0.016    0.016    0.185    0.185 ex.py:1(test)
   100000    0.145    0.000    0.145    0.000 ex.py:2(fib)
        1    0.000    0.000    0.185    0.185 {built-in method builtins.exec}
   100000    0.024    0.000    0.168    0.000 {built-in method builtins.next}
        1    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

Am I missing something?

gaogaotiantian · 2024-10-31T14:50:56Z

Sorry I meant a recursive fib().

import time
def fib(n):
    if n <= 1:
        return 1
    return fib(n - 1) + fib(n - 2)
start = time.time()
# enable profiler
fib(24)
# disable profiler
print(time.time() - start)

We need to consider the worst case for cprofile, which is when the code has a lot of function calls. We need to know the overhead of the profiler, and how that is compared with before.

You can also refer to #103533, there's a profiling code listed. That might give some extra information.

Moving to sys.monitoring gave us a 20%+ improvement on overhead, just want to check what's the regression with clinic and whether it's acceptable. Profiler is a bit sensitive to performance because the more overhead there is, the less useful it is.

sobolevn · 2024-10-31T14:57:48Z

It still gives me a perf boost for some reason 🤔

import time
import cProfile

profiler = cProfile.Profile()

def fib(n):
    if n <= 1:
        return 1
    return fib(n - 1) + fib(n - 2)
start = time.perf_counter()
profiler.enable()
fib(30)  # 100th
profiler.disable()
print(time.perf_counter() - start)
# Old: 0.9349823750017094
# New: 0.9253051670020795
# 1% faster

gaogaotiantian · 2024-10-31T15:07:41Z

Is the boost stable(reproducible)?

sobolevn · 2024-10-31T15:09:17Z

Yes, it is pretty much the same on m2 macos:

(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py                                                     
0.9349823750017094
                                                                                           
(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py
0.9357958749969839
                                                                                           
(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py
0.9352227920026053
                                                                                           
(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py
0.924946416002058
                                                                                           
(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py
0.926337541997782
                                                                                           
(.venv) ~/Desktop/cpython2  main ✗                                                        
» ./python.exe ex.py
0.9347994170000311

(.venv) ~/Desktop/cpython2  issue-126220 ✗  
» ./python.exe ex.py
0.9370929580036318
                                                                                           
(.venv) ~/Desktop/cpython2  issue-126220 ✗                                                
» ./python.exe ex.py
0.9262957079990883
                                                                                           
(.venv) ~/Desktop/cpython2  issue-126220 ✗                                                
» ./python.exe ex.py
0.9328266249940498
                                                                                           
(.venv) ~/Desktop/cpython2  issue-126220 ✗                                                
» ./python.exe ex.py
0.9382220000011148
                                                                                           
(.venv) ~/Desktop/cpython2  issue-126220 ✗                                                
» ./python.exe ex.py
0.9285237919984502
                                                                                           
(.venv) ~/Desktop/cpython2  issue-126220 ✗                                                
» ./python.exe ex.py
0.9296539580027456

gaogaotiantian · 2024-10-31T15:21:03Z

Yeah from the result I don't think there's an observable boost. On the other hand, that's good, because no observable regression either.

gaogaotiantian

Just a side note - this needs to be backported to 3.12 as well :) I added the tags so you should be able to just merge it.

sobolevn · 2024-10-31T15:24:58Z

@gaogaotiantian sorry, I forgot to highlight it initially. Please, double check params names that I introduced. Are they correct?

gaogaotiantian · 2024-10-31T15:46:23Z

Actually, I have some doubts about the changes for functions that are not crashing. This is a bug fix, which would be backported. I don't think we should mix in the code polish to the PR. Even though I think changing this in main would be fine, I would prefer having only the crash fix (could be with AC) in one PR, and the rest in the other. One thing that bothers me immediately is that I can't quickly convince myself the changes to __init__ is purely equivalent.

As for the argument name, the obj in the callbacks should be either unused or instruction_offset(which is what it is).

gaogaotiantian · 2024-10-31T15:47:39Z

Can we split this PR into 2? One with the changes to only the 4 callbacks, and the other with all the other methods?

sobolevn · 2024-10-31T15:57:45Z

Fair enough, I would split this PR later! 👍
Thank you!

gaogaotiantian

LGTM!

sobolevn · 2024-10-31T21:56:54Z

@erlend-aasland maybe you would be interested in double checking the AC part? :)

erlend-aasland · 2024-11-01T07:56:24Z

We generally do not backport Argument Clinic adaptions; would it be possible to solve the issue without Argument Clinic first, and backport that bugfix; then apply Argument Clinic?

sobolevn · 2024-11-01T08:01:20Z

@erlend-aasland I think that this case is rather special. This is not just "convert X to use AC" type of PR. This PR solves a crash that was caused by incorrect function args handling. Basically, I can use PyArg_ParseTuple* to do the same, but I don't see a point in doing that and then converting it to AC in 3.14 only.

I can go with simple:

if (size < REQUIRED_ARGS) {
    PyErr_Format(PyExc_TypeError, "got %d arguments, expected %d", ...);
    return NULL;
}

This will kinda solve this case, but will be rather ugly.

sobolevn · 2024-11-01T08:21:27Z

@erlend-aasland sorry for bothering, but I cannot decode your 👍 reaction :)
Does it mean:

Let's go with if (size)?
Let's keep it as is?

:)

erlend-aasland · 2024-11-01T08:26:13Z

This will kinda solve this case, but will be rather ugly.

I see, and thanks for the explanation. I think I still would feel better with such workarounds¹ for 3.13 and 3.12, and then introduce the Argument Clinic adaptions in 3.14. Mixing features and bugfixes is seldom a good thing.

IMO, they are not that ugly ↩

sobolevn · 2024-11-01T09:02:53Z

Ok, I will open a new PR with the fix we can backport and then rebase this one and apply all AC fixes from the initial version. Thanks for the feedback! 👍

erlend-aasland · 2024-11-01T22:06:07Z

Please amend the title to reflect the actual change :)

Modules/_lsprof.c

Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com>

pythongh-126220: Fix crash on calls to _lsprof.Profiler methods wit…

299f63f

…h 0 args

sobolevn requested review from markshannon and gaogaotiantian October 31, 2024 11:08

bedevere-app bot added the awaiting core review label Oct 31, 2024

bedevere-app bot mentioned this pull request Oct 31, 2024

_lsprof.Profiler._creturn_callback() segfaults #126220

Open

Fix CI

af78745

ZeroIntensity approved these changes Oct 31, 2024

View reviewed changes

gaogaotiantian added needs backport to 3.12 bug and security fixes needs backport to 3.13 bugs and security fixes labels Oct 31, 2024

gaogaotiantian approved these changes Oct 31, 2024

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Oct 31, 2024

sobolevn added 2 commits October 31, 2024 22:06

Only fix 4 crashers

3fddc30

Use instruction_offset arg name

4ffecdd

sobolevn requested a review from gaogaotiantian October 31, 2024 19:08

gaogaotiantian approved these changes Oct 31, 2024

View reviewed changes

sobolevn mentioned this pull request Nov 1, 2024

gh-126220: Fix crash on calls to _lsprof.Profiler methods with 0 args (backportable) #126271

Merged

Merge

2b6ecc9

Convert all methods

e3160e3

sobolevn changed the title ~~gh-126220: Fix crash on calls to _lsprof.Profiler methods with 0 args~~ gh-126220: Convert _lsprof to AC Nov 1, 2024

sobolevn added the skip news label Nov 1, 2024

sobolevn requested a review from gaogaotiantian November 1, 2024 22:09

erlend-aasland reviewed Nov 1, 2024

View reviewed changes

Modules/_lsprof.c Outdated Show resolved Hide resolved

Modules/_lsprof.c Outdated Show resolved Hide resolved

Modules/_lsprof.c Outdated Show resolved Hide resolved

Modules/_lsprof.c Outdated Show resolved Hide resolved

erlend-aasland changed the title ~~gh-126220: Convert _lsprof to AC~~ gh-126220: Adapt _lsprof to Argument Clinic Nov 1, 2024

sobolevn and others added 3 commits November 2, 2024 08:30

Apply suggestions from code review

3cea189

Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com>

Address review

b84dde3

Typo

bf3e120

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-126220: Adapt `_lsprof` to Argument Clinic #126233

gh-126220: Adapt `_lsprof` to Argument Clinic #126233

sobolevn commented Oct 31, 2024 •

edited by bedevere-app bot

Loading

ZeroIntensity left a comment

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024 •

edited

Loading

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024 •

edited

Loading

gaogaotiantian commented Oct 31, 2024

gaogaotiantian left a comment

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian left a comment

sobolevn commented Oct 31, 2024

erlend-aasland commented Nov 1, 2024

sobolevn commented Nov 1, 2024

sobolevn commented Nov 1, 2024

erlend-aasland commented Nov 1, 2024

sobolevn commented Nov 1, 2024

erlend-aasland commented Nov 1, 2024

gh-126220: Adapt _lsprof to Argument Clinic #126233

Are you sure you want to change the base?

gh-126220: Adapt _lsprof to Argument Clinic #126233

Conversation

sobolevn commented Oct 31, 2024 • edited by bedevere-app bot Loading

ZeroIntensity left a comment

Choose a reason for hiding this comment

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024 • edited Loading

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024 • edited Loading

gaogaotiantian commented Oct 31, 2024

gaogaotiantian left a comment

Choose a reason for hiding this comment

sobolevn commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

gaogaotiantian commented Oct 31, 2024

sobolevn commented Oct 31, 2024

gaogaotiantian left a comment

Choose a reason for hiding this comment

sobolevn commented Oct 31, 2024

erlend-aasland commented Nov 1, 2024

sobolevn commented Nov 1, 2024

sobolevn commented Nov 1, 2024

erlend-aasland commented Nov 1, 2024

Footnotes

sobolevn commented Nov 1, 2024

erlend-aasland commented Nov 1, 2024

gh-126220: Adapt `_lsprof` to Argument Clinic #126233

gh-126220: Adapt `_lsprof` to Argument Clinic #126233

sobolevn commented Oct 31, 2024 •

edited by bedevere-app bot

Loading

gaogaotiantian commented Oct 31, 2024 •

edited

Loading

sobolevn commented Oct 31, 2024 •

edited

Loading