cpython

Commit Graph

Author	SHA1	Message	Date
T. Wouters	de2f7da77d	gh-115999: Add free-threaded specialization for FOR_ITER (#128798 ) Add free-threaded versions of existing specialization for FOR_ITER (list, tuples, fast range iterators and generators), without significantly affecting their thread-safety. (Iterating over shared lists/tuples/ranges should be fine like before. Reusing iterators between threads is not fine, like before. Sharing generators between threads is a recipe for significant crashes, like before.)	2025-03-12 16:21:46 +01:00
Mark Shannon	2bef8ea8ea	GH-127705: Use `_PyStackRef`s in the default build. (GH-127875)	2025-03-10 14:06:56 +00:00
Mark Shannon	89df62c120	GH-128534: Fix behavior of branch monitoring for `async for` (GH-130847) * Both branches in a pair now have a common source and are included in co_branches	2025-03-07 14:30:31 +00:00
Tomasz Pytel	aeb2327386	gh-130574: renumber RESUME opcode from 149 to 128 (GH-130685)	2025-03-06 08:59:36 +00:00
mpage	d7bb7c7817	gh-118331: Fix a couple of issues when list allocation fails (#130811 ) * Fix use after free in list objects Set the items pointer in the list object to NULL after the items array is freed during list deallocation. Otherwise, we can end up with a list object added to the free list that contains a pointer to an already-freed items array. * Mark `_PyList_FromStackRefStealOnSuccess` as escaping I think technically it's not escaping, because the only object that can be decrefed if allocation fails is an exact list, which cannot execute arbitrary code when it is destroyed. However, this seems less intrusive than trying to special cases objects in the assert in `_Py_Dealloc` that checks for non-null stackpointers and shouldn't matter for performance.	2025-03-05 10:42:09 -08:00
Mark Shannon	54965f3fb2	GH-130296: Avoid stack transients in four instructions. (GH-130310) * Combine _GUARD_GLOBALS_VERSION_PUSH_KEYS and _LOAD_GLOBAL_MODULE_FROM_KEYS into _LOAD_GLOBAL_MODULE * Combine _GUARD_BUILTINS_VERSION_PUSH_KEYS and _LOAD_GLOBAL_BUILTINS_FROM_KEYS into _LOAD_GLOBAL_BUILTINS * Combine _CHECK_ATTR_MODULE_PUSH_KEYS and _LOAD_ATTR_MODULE_FROM_KEYS into _LOAD_ATTR_MODULE * Remove stack transient in LOAD_ATTR_WITH_HINT	2025-02-28 18:00:38 +00:00
Mark Shannon	2a18e80695	GH-128534: Instrument branches for `async for` loops. (GH-130569)	2025-02-27 09:36:41 +00:00
Mark Shannon	72f56654d0	GH-128682: Account for escapes in `DECREF_INPUTS` (GH-129953) * Handle escapes in DECREF_INPUTS * Mark a few more functions as escaping * Replace DECREF_INPUTS with PyStackRef_CLOSE where possible	2025-02-12 17:44:59 +00:00
Irit Katriel	a1417b211f	gh-100239: replace BINARY_SUBSCR & family by BINARY_OP with oparg NB_SUBSCR (#129700 )	2025-02-07 22:39:54 +00:00
Mark Shannon	75b628adeb	GH-128563: Generate `opcode = ...` in instructions that need `opcode` (GH-129608) * Remove support for GO_TO_INSTRUCTION	2025-02-03 15:09:21 +00:00
Mark Shannon	808071b994	GH-128682: Make `PyStackRef_CLOSE` escaping. (GH-129404)	2025-02-03 12:41:32 +00:00
Irit Katriel	4815131910	gh-100239: specialize bitwise logical binary ops on ints (#128927 )	2025-01-29 09:28:21 +00:00
Brandt Bucher	828b27680f	GH-126599: Remove the PyOptimizer API (GH-129194)	2025-01-28 16:10:51 -08:00
Mark Shannon	75b4962157	GH-128914: Remove all but one conditional stack effects (GH-129226) * Remove all 'if (0)' and 'if (1)' conditional stack effects * Use array instead of conditional for BUILD_SLICE args * Refactor LOAD_GLOBAL to use a common conditional uop * Remove conditional stack effects from LOAD_ATTR specializations * Replace conditional stack effects in LOAD_ATTR with a 0 or 1 sized array. * Remove conditional stack effects from CALL_FUNCTION_EX	2025-01-27 16:24:48 +00:00
Sam Gross	a10f99375e	Revert "GH-128914: Remove conditional stack effects from `bytecodes.c` and the code generators (GH-128918)" (GH-129202) The commit introduced a ~2.5-3% regression in the free threading build. This reverts commit `ab61d3f430`.	2025-01-23 09:26:25 +00:00
Mark Shannon	470a0a68eb	GH-128682: Change a couple of functions to only steal references on success. (GH-129132) Change PyTuple_FromStackRefSteal and PyList_FromStackRefSteal to only steal on success to avoid escaping	2025-01-22 10:51:37 +00:00
Mark Shannon	ab61d3f430	GH-128914: Remove conditional stack effects from `bytecodes.c` and the code generators (GH-128918)	2025-01-20 17:09:23 +00:00
Irit Katriel	3893a92d95	gh-100239: specialize long tail of binary operations (#128722 )	2025-01-16 15:22:13 +00:00
mpage	b5ee0258bf	gh-115999: Specialize `LOAD_ATTR` for instance and class receivers in free-threaded builds (#128164 ) Finish specialization for LOAD_ATTR in the free-threaded build by adding support for class and instance receivers.	2025-01-14 11:56:11 -08:00
Mark Shannon	517dc65ffc	GH-128682: Stronger checking of `PyStackRef_CLOSE` and `DEAD`. (GH-128683)	2025-01-13 12:37:48 +00:00
Mark Shannon	39fc7ef4fe	GH-124483: Mark `Py_DECREF`, etc. as escaping for the JIT (GH-128678)	2025-01-13 11:42:45 +00:00
Mark Shannon	ddd959987c	GH-128685: Specialize (rather than quicken) LOAD_CONST into LOAD_CONST_[IM]MORTAL (GH-128708)	2025-01-13 10:30:28 +00:00
Mark Shannon	f826beca0c	GH-128375: Better instrument for `FOR_ITER` (GH-128445)	2025-01-06 17:54:47 +00:00
Neil Schemenauer	1b15c89a17	gh-115999: Specialize `STORE_ATTR` in free-threaded builds. (gh-127838) * Add `_PyDictKeys_StringLookupSplit` which does locking on dict keys and use in place of `_PyDictKeys_StringLookup`. * Change `_PyObject_TryGetInstanceAttribute` to use that function in the case of split keys. * Add `unicodekeys_lookup_split` helper which allows code sharing between `_Py_dict_lookup` and `_PyDictKeys_StringLookupSplit`. * Fix locking for `STORE_ATTR_INSTANCE_VALUE`. Create `_GUARD_TYPE_VERSION_AND_LOCK` uop so that object stays locked and `tp_version_tag` cannot change. * Pass `tp_version_tag` to `specialize_dict_access()`, ensuring the version we store on the cache is the correct one (in case of it changing during the specalize analysis). * Split `analyze_descriptor` into `analyze_descriptor_load` and `analyze_descriptor_store` since those don't share much logic. Add `descriptor_is_class` helper function. * In `specialize_dict_access`, double check `_PyObject_GetManagedDict()` in case we race and dict was materialized before the lock. * Avoid borrowed references in `_Py_Specialize_StoreAttr()`. * Use `specialize()` and `unspecialize()` helpers. * Add unit tests to ensure specializing happens as expected in FT builds. * Add unit tests to attempt to trigger data races (useful for running under TSAN). * Add `has_split_table` function to `_testinternalcapi`.	2024-12-19 10:21:17 -08:00
Mark Shannon	d2f1d917e8	GH-122548: Implement branch taken and not taken events for sys.monitoring (GH-122564)	2024-12-19 16:59:51 +00:00
Donghee Na	48c70b8f7d	gh-115999: Enable BINARY_SUBSCR_GETITEM for free-threaded build (gh-127737)	2024-12-19 11:08:17 +09:00
mpage	2de048ce79	gh-115999: Specialize loading attributes from modules in free-threaded builds (#127711 ) We use the same approach that was used for specialization of LOAD_GLOBAL in free-threaded builds: _CHECK_ATTR_MODULE is renamed to _CHECK_ATTR_MODULE_PUSH_KEYS; it pushes the keys object for the following _LOAD_ATTR_MODULE_FROM_KEYS (nee _LOAD_ATTR_MODULE). This arrangement avoids having to recheck the keys version. _LOAD_ATTR_MODULE is renamed to _LOAD_ATTR_MODULE_FROM_KEYS; it loads the value from the keys object pushed by the preceding _CHECK_ATTR_MODULE_PUSH_KEYS at the cached index.	2024-12-13 10:17:16 -08:00
Donghee Na	7c2bd9b226	gh-115999: Use light-weight lock for UNPACK_SEQUENCE_LIST (gh-127514)	2024-12-03 00:14:40 +09:00
Donghee Na	e2713409cf	gh-115999: Add partial free-thread specialization for BINARY_SUBSCR (gh-127227)	2024-12-02 10:38:17 +09:00
mpage	193890c1cc	gh-126612: Include stack effects of uops when computing maximum stack depth (#126894 )	2024-11-26 00:53:49 +00:00
Kirill Podoprigora	27486c3365	gh-115999: Add free-threaded specialization for `UNPACK_SEQUENCE` (#126600 ) Add free-threaded specialization for `UNPACK_SEQUENCE` opcode. `UNPACK_SEQUENCE_TUPLE/UNPACK_SEQUENCE_TWO_TUPLE` are already thread safe since tuples are immutable. `UNPACK_SEQUENCE_LIST` is not thread safe because of nature of lists (there is nothing preventing another thread from adding items to or removing them the list while the instruction is executing). To achieve thread safety we add a critical section to the implementation of `UNPACK_SEQUENCE_LIST`, especially around the parts where we check the size of the list and push items onto the stack. --------- Co-authored-by: Matt Page <mpage@meta.com> Co-authored-by: mpage <mpage@cs.stanford.edu>	2024-11-22 19:00:35 +02:00
Mark Shannon	faa3272fb8	GH-125837: Split `LOAD_CONST` into three. (GH-125972) * Add LOAD_CONST_IMMORTAL opcode * Add LOAD_SMALL_INT opcode * Remove RETURN_CONST opcode	2024-10-29 11:15:42 +00:00
Mark Shannon	25441592db	GH-125515: Reduce number of compiler warnings in generated code (GH-125697)	2024-10-28 10:30:31 +00:00
Mark Shannon	06ca33020e	GH-125323: Convert DECREF_INPUTS_AND_REUSE_FLOAT into a function that takes PyStackRefs. (GH-125439)	2024-10-14 14:18:57 +01:00
mpage	f978fb4f8d	gh-115999: Refactor `LOAD_GLOBAL` specializations to avoid reloading {globals, builtins} keys (gh-124953) Each of the `LOAD_GLOBAL` specializations is implemented roughly as: 1. Load keys version. 2. Load cached keys version. 3. Deopt if (1) and (2) don't match. 4. Load keys. 5. Load cached index into keys. 6. Load object from (4) at offset from (5). This is not thread-safe in free-threaded builds; the keys object may be replaced in between steps (3) and (4). This change refactors the specializations to avoid reloading the keys object and instead pass the keys object from guards to be consumed by downstream uops.	2024-10-09 15:18:25 +00:00
Mark Shannon	da071fa3e8	GH-119866: Spill the stack around escaping calls. (GH-124392) * Spill the evaluation around escaping calls in the generated interpreter and JIT. * The code generator tracks live, cached values so they can be saved to memory when needed. * Spills the stack pointer around escaping calls, so that the exact stack is visible to the cycle GC.	2024-10-07 14:56:39 +01:00
Irit Katriel	78aeb38f7d	gh-124285: Fix bug where bool() is called multiple times for the same part of a boolean expression (#124394 )	2024-09-25 15:51:25 +01:00
Victor Stinner	f1a0d96f41	gh-123091: Use _Py_IsImmortalLoose() (#123511 ) Use _Py_IsImmortalLoose() in bytesobject.c, typeobject.c and ceval.c.	2024-09-02 14:25:19 +02:00
Mark Shannon	5d3201fe3f	GH-123040: Specialize shadowed `LOAD_ATTR`. (GH-123219)	2024-08-23 10:22:35 +01:00
Mark Shannon	bb1d30336e	GH-118093: Make `CALL_ALLOC_AND_ENTER_INIT` suitable for tier 2. (GH-123140) * Convert CALL_ALLOC_AND_ENTER_INIT to micro-ops such that tier 2 supports it * Allow inexact arguments for CALL_ALLOC_AND_ENTER_INIT.	2024-08-20 16:52:58 +01:00
Mark Shannon	c13e7d98fb	GH-118093: Specialize `CALL_KW` (GH-123006)	2024-08-16 17:11:24 +01:00
Brandt Bucher	f84754b705	GH-118093: Turn some DEOPT_IFs into EXIT_IFs (GH-122998)	2024-08-14 07:54:42 -07:00
Mark Shannon	eec7bdaf01	GH-120024: Remove `CHECK_EVAL_BREAKER` macro. (GH-122968) * Factor some instructions into micro-ops to isolate CHECK_EVAL_BREAKER for escape analysis * Eliminate CHECK_EVAL_BREAKER macro	2024-08-14 12:04:05 +01:00
Mark Shannon	7a65439b93	GH-122390: Replace `_Py_GetbaseOpcode` with `_Py_GetBaseCodeUnit` (GH-122942)	2024-08-13 14:22:57 +01:00
Mark Shannon	df13a1821a	GH-118095: Add tier two support for BINARY_SUBSCR_GETITEM (GH-120793)	2024-08-01 16:19:05 -07:00
Mark Shannon	a9d56e38a0	GH-122155: Track local variables between pops and pushes in cases generator (GH-122286)	2024-08-01 09:27:26 +01:00
Mark Shannon	95a73917cd	GH-122029: Break INSTRUMENTED_CALL into micro-ops, so that its behavior is consistent with CALL (GH-122177)	2024-07-26 14:35:57 +01:00
Mark Shannon	afb0aa6ed2	GH-121131: Clean up and fix some instrumented instructions. (GH-121132) * Add support for 'prev_instr' to code generator and refactor some INSTRUMENTED instructions	2024-07-26 12:24:12 +01:00
Brandt Bucher	d9efa45d74	GH-118093: Add tier two support for BINARY_OP_INPLACE_ADD_UNICODE (GH-122253)	2024-07-25 14:45:07 -07:00
Brandt Bucher	5f6001130f	GH-118093: Add tier two support for LOAD_ATTR_PROPERTY (GH-122283)	2024-07-25 10:45:28 -07:00

1 2 3

145 Commits