gh-143421: Move `JitOptContext` from stack allocation to per-thread heap allocation to avoid stack overflow. #143536

cocolato · 2026-01-08T03:22:17Z

Fidget-Spinner

Pretty close!

Python/pystate.c

Python/optimizer_analysis.c

Include/internal/pycore_tstate.h

cocolato · 2026-01-08T11:15:15Z

Updated!

Fidget-Spinner

Great, thanks!

Include/internal/pycore_optimizer.h

Co-authored-by: Kumar Aditya <kumaraditya@python.org>

markshannon · 2026-01-08T12:10:30Z

Can we avoid unnecessary pointer chasing by embedding the array in the jit state struct, and allocate the entire struct together instead of having part of it embedded in the thread state, and part in a new allocation?

ie. instead of

typedef struct _PyJitOptState {
    struct _JitOptContext *opt_context;
} _PyJitOptState;

typedef struct _PyJitTracerState {
    _PyUOpInstruction *code_buffer;
    _PyJitOptState opt_state;
    _PyJitTracerInitialState initial_state;
    _PyJitTracerPreviousState prev_state;
    _PyJitTracerTranslatorState translator_state;
} _PyJitTracerState;

we would have

typedef struct _PyJitTracerState {
    _PyJitTracerInitialState initial_state;
    _PyJitTracerPreviousState prev_state;
    _PyJitTracerTranslatorState translator_state;
     _JitOptContext opt_context;
    _PyUOpInstruction code_buffer[UOP_MAX_TRACE_LENGTH];
} _PyJitTracerState;

Doing so would also decouple the thread state header from the JIT headers, as only a pointer to an opaque struct would need to be declared in the thread state header.

I'm also concerned about the amount of allocation and freeing required to compile a small trace. Maybe allocate once and only free when the thread is destroyed?

cocolato · 2026-01-08T12:26:21Z

we would have

typedef struct _PyJitTracerState {
    _PyJitTracerInitialState initial_state;
    _PyJitTracerPreviousState prev_state;
    _PyJitTracerTranslatorState translator_state;
     _JitOptContext opt_context;
    _PyUOpInstruction code_buffer[UOP_MAX_TRACE_LENGTH];
} _PyJitTracerSt

I can try implementing it this way, but I'd like to know if this should be done in a new PR/issue. Perhaps we could use benchmarks to determine potential performance issues from allocating _PyJitTracerState once.

Fidget-Spinner · 2026-01-08T12:38:52Z

I'm also concerned about the amount of allocation and freeing required to compile a small trace. Maybe allocate once and only free when the thread is destroyed?

Isnt that already being done?

Fidget-Spinner · 2026-01-08T12:40:45Z

@cocolato I think we can apply Mark's changes, it means we would need no new allocations anymore as its all heap allocated with the _PyThreadStateImpl struct.

Include/internal/pycore_tstate.h

Python/pystate.c

cocolato · 2026-01-08T16:07:12Z

I'm not sure if the current implementation of the new pycore_optimizer_types.h is correct. I'd appreciate any suggestions for improvements.

Fidget-Spinner

Very cool, thank you!

Fidget-Spinner · 2026-01-09T10:47:12Z

@markshannon

This caused a 100% slowdown in bench_thread_pool benchmark https://doesjitgobrrr.com/run/2026-01-08

We need to lazily allocate it and pointer chase, otherwise we're slowing down significantly the spawning of threads.

…o per-thread heap allocation (pythonGH-143536)" This reverts commit aeb3403.

cocolato · 2026-01-09T11:01:10Z

@markshannon

This caused a 100% slowdown in bench_thread_pool benchmark https://doesjitgobrrr.com/run/2026-01-08

We need to lazily allocate it and pointer chase, otherwise we're slowing down significantly the spawning of threads.

So should we revert to the first version?4e7a918

Fidget-Spinner · 2026-01-09T11:05:44Z

So should we revert to the first version?4e7a918

I will have a PR up.

cocolato · 2026-01-09T11:09:01Z

Sorry, I will run a local benchmark first before making important changes in the next time.

Fidget-Spinner · 2026-01-09T11:12:34Z

Sorry, I will run a local benchmark first before making important changes in the next time.

No it's fine. That's the point of having a benchmark runner, to catch things like this! It's hard to predict what a change will affect, so that's the point of having benchark runners to save human time.

Fidget-Spinner · 2026-01-09T11:14:35Z

PR up at #143597

move JitOptContext to _PyThreadStateImpl

8b38039

cocolato requested review from Fidget-Spinner, ZeroIntensity, ericsnowcurrently, markshannon and tomasr8 as code owners January 8, 2026 03:22

bedevere-app bot added the awaiting review label Jan 8, 2026

cocolato changed the title ~~Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow.~~ gh143421: Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow. Jan 8, 2026

cocolato changed the title ~~gh143421: Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow.~~ gh-143421: Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow. Jan 8, 2026

bedevere-app bot mentioned this pull request Jan 8, 2026

JIT optimizer cleanups #143421

Open

remove redundant func parameter

9717629

Fidget-Spinner reviewed Jan 8, 2026

View reviewed changes

Python/pystate.c Outdated Show resolved Hide resolved

Python/optimizer_analysis.c Outdated Show resolved Hide resolved

Include/internal/pycore_tstate.h Outdated Show resolved Hide resolved

add _PyJitOptState

4e7a918

Fidget-Spinner added the skip news label Jan 8, 2026

Fidget-Spinner approved these changes Jan 8, 2026

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting review labels Jan 8, 2026

kumaraditya303 reviewed Jan 8, 2026

View reviewed changes

Include/internal/pycore_optimizer.h Outdated Show resolved Hide resolved

Update Include/internal/pycore_optimizer.h

f25f6c8

Co-authored-by: Kumar Aditya <kumaraditya@python.org>

cocolato commented Jan 8, 2026

View reviewed changes

Include/internal/pycore_tstate.h Outdated Show resolved Hide resolved

cocolato closed this Jan 8, 2026

cocolato reopened this Jan 8, 2026

Fidget-Spinner reviewed Jan 8, 2026

View reviewed changes

Python/pystate.c Outdated Show resolved Hide resolved

Embed JitOptContext and code_buffer in _PyThreadStateImpl

32dc7fb

Fidget-Spinner approved these changes Jan 8, 2026

View reviewed changes

Fidget-Spinner merged commit aeb3403 into python:main Jan 8, 2026
62 checks passed

bedevere-app bot removed the awaiting merge label Jan 8, 2026

Fidget-Spinner added a commit to Fidget-Spinner/cpython that referenced this pull request Jan 9, 2026

Revert "pythongh-143421: Move JitOptContext from stack allocation t…

738696f

…o per-thread heap allocation (pythonGH-143536)" This reverts commit aeb3403.

Uh oh!

gh-143421: Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow. #143536

gh-143421: Move JitOptContext from stack allocation to per-thread heap allocation to avoid stack overflow. #143536

Conversation

cocolato commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cocolato commented Jan 8, 2026

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

markshannon commented Jan 8, 2026

Uh oh!

cocolato commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Jan 8, 2026

Uh oh!

Fidget-Spinner commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cocolato commented Jan 8, 2026

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fidget-Spinner commented Jan 9, 2026

Uh oh!

cocolato commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fidget-Spinner commented Jan 9, 2026

Uh oh!

cocolato commented Jan 9, 2026

Uh oh!

Fidget-Spinner commented Jan 9, 2026

Uh oh!

Fidget-Spinner commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gh-143421: Move `JitOptContext` from stack allocation to per-thread heap allocation to avoid stack overflow. #143536

gh-143421: Move `JitOptContext` from stack allocation to per-thread heap allocation to avoid stack overflow. #143536

cocolato commented Jan 8, 2026 •

edited

Loading

cocolato commented Jan 8, 2026 •

edited

Loading

Fidget-Spinner commented Jan 8, 2026 •

edited

Loading

cocolato commented Jan 9, 2026 •

edited

Loading