gh-140232: Do not track frozenset objects with immutables #140234

eendebakpt · 2025-10-16T21:01:34Z

In the PR we untrack frozen tuples for the normal constructors. There are a few methods shared between the set and frozenset (for example set_intersection in setobject.c) where we have not added the untracking. (this is possible, but I am not sure this is worthwhile to do).

Here is a small script to test the idea:

import gc
import time
from statistics import mean

number_of_iterations = 20
number_of_gc_iterations = 50

deltas = []

gc.disable()
gc.collect()
for kk in range(number_of_iterations):
    t0 = time.perf_counter()
    for jj in range(number_of_gc_iterations):
        gc.collect()
    dt = time.perf_counter() - t0
    deltas.append(dt)
print(f"time per collection: mean {1e3 * mean(deltas) / number_of_iterations:.3f} [ms], min {1e3 * min(deltas) / number_of_iterations:.3f} [ms]")

sets = [frozenset([ii]) for ii in range(10_000)]
deltas = []
print("---")
gc.disable()
gc.collect()
for kk in range(number_of_iterations):
    t0 = time.perf_counter()
    for jj in range(number_of_gc_iterations):
        gc.collect()
    dt = time.perf_counter() - t0
    deltas.append(dt)
print(f"time per collection: mean {1e3 * mean(deltas) / number_of_iterations:.3f} [ms], min {1e3 * min(deltas) / number_of_iterations:.3f} [ms]")

#%% Show statistics of frozen containers

gc.collect()

def candidate(obj):
    return all(not gc.is_tracked(x) for x in obj)

for immutable_type in (tuple, frozenset):
    number_of_objects_tracked = 0
    number_of_candidates = 0
    number_of_immutable_candidates = 0

    for obj in gc.get_objects():
        number_of_objects_tracked += 1
        if type(obj) is immutable_type:
            number_of_candidates += 1
            # print(f"{type(obj)} = {obj}")
            if candidate(obj):
                number_of_immutable_candidates += 1

    print(f"type {immutable_type}")
    print(f"  {number_of_objects_tracked=}")
    print(f"  {number_of_candidates=}")
    print(f"  {number_of_immutable_candidates=}")

It measures the performance of garbage collection, and outputs some statistics for the numbers of frozen containers.

Main:

time per collection: mean 1.311 [ms], min 1.301 [ms]
---
time per collection: mean 2.467 [ms], min 2.272 [ms]
type <class 'tuple'>
  number_of_objects_tracked=18330
  number_of_candidates=546
  number_of_immutable_candidates=1
type <class 'frozenset'>
  number_of_objects_tracked=18330
  number_of_candidates=10059
  number_of_immutable_candidates=10057

PR

time per collection: mean 1.285 [ms], min 1.251 [ms]
---
time per collection: mean 1.424 [ms], min 1.396 [ms]
type <class 'tuple'>
  number_of_objects_tracked=8273
  number_of_candidates=546
  number_of_immutable_candidates=6
type <class 'frozenset'>
  number_of_objects_tracked=8273
  number_of_candidates=2
  number_of_immutable_candidates=0

Note: generative ai was used in creating the PR

Issue: Disable tracking of frozenset objects with immutables in the GC #140232

Objects/setobject.c

Co-authored-by: Mikhail Efimov <efimov.mikhail@gmail.com>

sergey-miryanov · 2025-10-17T06:03:52Z

Maybe it is worth to change tp_alloc for something like:

PyObject *
PyFrozenSet_Alloc(PyTypeObject *type, Py_ssize_t nitems)
{
    PyObject *obj = PyType_GenericAlloc(type, nitems);
    if (obj == NULL) {
        return NULL;
    }

    _PyFrozenSet_MaybeUntrack(obj);
    return obj;
}

eendebakpt · 2025-10-17T06:52:35Z

Maybe it is worth to change tp_alloc for something like:

The tp_alloc is used in make_new_set, which in turn is called by make_new_set. The last one is used set_intersection which modifies a frozenset. So adding _PyFrozenSet_MaybeUntrack to tp_alloc would mean we have to add a _PyFrozenSet_MaybeTrack to the end of set_intersection. This is a complication I do not want to tackle (certainly not in this PR).

Lib/test/test_sys.py

sergey-miryanov

Code looks good to me.

…cpython into frozenset_immutable_tracking

Modules/_testcapimodule.c

vstinner · 2025-10-24T12:51:02Z

Would it be possible to write tests in Python rather than in C?

eendebakpt · 2025-10-24T22:15:30Z

Would it be possible to write tests in Python rather than in C?

I tried, but it is not easy. We have to expose PySet_Add (frozenset().add does not exist on the python side). I added pyset_add on the _testcapi module (with pyset_add just calling PySet_Add). But running this on a frozenset from the python side does not work: when calling _testcapi.pyset_add(frozen_set, item) there too many references to the frozen_set and PySet_Add will fail with an internal error here:

cpython/Objects/setobject.c

Line 2778 in d78d7a5

PyErr_BadInternalCall();

And when calling _testcapi.pyset_add(frozenset(), item) we do not have the frozenset available to test whether tracking has been enabled.

sergey-miryanov · 2025-10-26T08:07:22Z

And when calling _testcapi.pyset_add(frozenset(), item) we do not have the frozenset available to test whether tracking has been enabled.

IIUC, if you return the first argument from pyset_add then you can test it on the python side.

eendebakpt · 2025-10-26T19:20:57Z

And when calling _testcapi.pyset_add(frozenset(), item) we do not have the frozenset available to test whether tracking has been enabled.

IIUC, if you return the first argument from pyset_add then you can test it on the python side.

Ok, I gave it another try. The first attempt failed, but by using the vectorcall convention I can keep the reference count at 1 also from the Python side.

…cpython into frozenset_immutable_tracking

Modules/_testcapimodule.c

Modules/_testlimitedcapi/set.c

Objects/setobject.c

Modules/_testlimitedcapi/set.c

Objects/setobject.c

Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner

LGTM. With one last comment :-)

Objects/setobject.c

eendebakpt · 2026-01-27T15:53:47Z

LGTM. With one last comment :-)

Nice. Give me one second to double check something.

efimov-mikhail

LGTM

Objects/setobject.c

eendebakpt · 2026-01-27T18:35:16Z

LGTM. With one last comment :-)

Nice. Give me one second to double check something.

The _PyFrozenSet_MaybeUntrack is only used in two places and in one of then the argument is guaranteed to be an exact frozenset. I checked whether we can simplify code by changing the if (!PyFrozenSet_CheckExact(op)) ... into an assert. It can be done, but then we have to add other check in make_new_frozenset so overall it is not clear improvement.

PR is ready from my side!

sergey-miryanov

Two nitpicks. Otherwise LGTM.

Modules/_testlimitedcapi/set.c

eendebakpt added 3 commits October 16, 2025 21:36

Do not track frozenset objects with immutables

a3292c2

cleanup

cd294a6

cleanup

7e28cf2

eendebakpt requested a review from rhettinger as a code owner October 16, 2025 21:01

bedevere-app bot mentioned this pull request Oct 16, 2025

Disable tracking of frozenset objects with immutables in the GC #140232

Open

bedevere-app bot added the awaiting review label Oct 16, 2025

eendebakpt and others added 3 commits October 16, 2025 23:08

Merge branch 'main' into frozenset_immutable_tracking

30057a5

fix test

c4deb03

📜🤖 Added by blurb_it.

607237a

efimov-mikhail reviewed Oct 17, 2025

View reviewed changes

Objects/setobject.c Outdated Show resolved Hide resolved

Update Objects/setobject.c

2735a71

Co-authored-by: Mikhail Efimov <efimov.mikhail@gmail.com>

sergey-miryanov reviewed Oct 17, 2025

View reviewed changes

Lib/test/test_sys.py Show resolved Hide resolved

sergey-miryanov approved these changes Oct 17, 2025

View reviewed changes

bedevere-app bot added awaiting core review and removed awaiting review labels Oct 17, 2025

eendebakpt mentioned this pull request Oct 22, 2025

gh-140476: Optimize PySet_Add() for frozenset in free-threading #140440

Merged

eendebakpt added 2 commits October 24, 2025 12:41

make sure PySet_Add tracks frozensets if needed

c05db54

Merge branch 'frozenset_immutable_tracking' of github.com:eendebakpt/…

7f6bc4b

…cpython into frozenset_immutable_tracking

sergey-miryanov reviewed Oct 24, 2025

View reviewed changes

Modules/_testcapimodule.c Outdated Show resolved Hide resolved

sergey-miryanov reviewed Oct 24, 2025

View reviewed changes

Modules/_testcapimodule.c Outdated Show resolved Hide resolved

eendebakpt added 2 commits October 24, 2025 14:30

review comment

0b97604

Merge branch 'main' into frozenset_immutable_tracking

948daed

use _testcapi for testing

08e22c3

whitespace

62afc76

eendebakpt added 7 commits January 24, 2026 15:35

revert to fastcall

c62fa9a

Merge branch 'frozenset_immutable_tracking' of github.com:eendebakpt/…

69b728f

…cpython into frozenset_immutable_tracking

fix header

71d9eba

Fully write test in C

5c19de2

adjust tests

e7dd248

cleanup tests

075b582

cleanup tests

3dd4c95

vstinner reviewed Jan 26, 2026

View reviewed changes

Modules/_testcapimodule.c Outdated Show resolved Hide resolved

eendebakpt added 4 commits January 26, 2026 17:31

rework

8d864ef

refactor code

e262963

cleanup

6027391

Merge branch 'main' into frozenset_immutable_tracking

6269f68

eendebakpt commented Jan 27, 2026

View reviewed changes

Modules/_testlimitedcapi/set.c Outdated Show resolved Hide resolved

eendebakpt added 2 commits January 27, 2026 11:32

Update Modules/_testlimitedcapi/set.c

37af64f

Merge branch 'main' into frozenset_immutable_tracking

ea3ba39

eendebakpt commented Jan 27, 2026

View reviewed changes

Modules/_testlimitedcapi/set.c Show resolved Hide resolved

Update Modules/_testlimitedcapi/set.c

ff61155

vstinner reviewed Jan 27, 2026

View reviewed changes

Objects/setobject.c Outdated Show resolved Hide resolved

Objects/setobject.c Outdated Show resolved Hide resolved

Objects/setobject.c Outdated Show resolved Hide resolved

Modules/_testlimitedcapi/set.c Outdated Show resolved Hide resolved

Objects/setobject.c Outdated Show resolved Hide resolved

Apply suggestions from code review

6bb0d58

Co-authored-by: Victor Stinner <vstinner@python.org>

vstinner approved these changes Jan 27, 2026

View reviewed changes

Objects/setobject.c Outdated Show resolved Hide resolved

bedevere-app bot added awaiting merge and removed awaiting core review labels Jan 27, 2026

Update Objects/setobject.c

7a4859b

efimov-mikhail approved these changes Jan 27, 2026

View reviewed changes

Objects/setobject.c Outdated Show resolved Hide resolved

review comment

a2bcfdb

sergey-miryanov approved these changes Jan 27, 2026

View reviewed changes

Modules/_testlimitedcapi/set.c Outdated Show resolved Hide resolved

Modules/_testlimitedcapi/set.c Show resolved Hide resolved

eendebakpt added 2 commits January 27, 2026 20:17

whitespace

88c3f4a

Merge branch 'main' into frozenset_immutable_tracking

be20046

Uh oh!

gh-140232: Do not track frozenset objects with immutables #140234

Are you sure you want to change the base?

gh-140232: Do not track frozenset objects with immutables #140234

Conversation

eendebakpt commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sergey-miryanov commented Oct 17, 2025 • edited by efimov-mikhail Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eendebakpt commented Oct 17, 2025

Uh oh!

Uh oh!

sergey-miryanov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vstinner commented Oct 24, 2025

Uh oh!

eendebakpt commented Oct 24, 2025

Uh oh!

sergey-miryanov commented Oct 26, 2025

Uh oh!

eendebakpt commented Oct 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eendebakpt commented Jan 27, 2026

Uh oh!

efimov-mikhail left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

eendebakpt commented Jan 27, 2026

Uh oh!

sergey-miryanov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eendebakpt commented Oct 16, 2025 •

edited

Loading

sergey-miryanov commented Oct 17, 2025 •

edited by efimov-mikhail

Loading