cpython

Commit Graph

Author	SHA1	Message	Date
Serhiy Storchaka	dee7beeb4f	bpo-34392: Add sys. _is_interned() (GH-8755)	2023-12-04 11:09:06 +02:00
Victor Stinner	11e83488c5	gh-111089: Revert PyUnicode_AsUTF8() changes (#111833 ) * Revert "gh-111089: Use PyUnicode_AsUTF8() in Argument Clinic (#111585)" This reverts commit `d9b606b3d0`. * Revert "gh-111089: Use PyUnicode_AsUTF8() in getargs.c (#111620)" This reverts commit `cde1071b2a`. * Revert "gh-111089: PyUnicode_AsUTF8() now raises on embedded NUL (#111091)" This reverts commit `d731579bfb`. * Revert "gh-111089: Add PyUnicode_AsUTF8() to the limited C API (#111121)" This reverts commit `d8f32be5b6`. * Revert "gh-111089: Use PyUnicode_AsUTF8() in sqlite3 (#111122)" This reverts commit `37e4e20eaa`.	2023-11-07 22:36:13 +00:00
Victor Stinner	d9b606b3d0	gh-111089: Use PyUnicode_AsUTF8() in Argument Clinic (#111585 ) Replace PyUnicode_AsUTF8AndSize() with PyUnicode_AsUTF8() to remove the explicit check for embedded null characters. The change avoids to have to include explicitly <string.h> to get the strlen() function when using a recent version of the limited C API.	2023-11-01 16:34:42 +01:00
Furkan Onder	32c37fe1ba	gh-67565: Remove redundant C-contiguity checks (GH-105521) Co-authored-by: Stefan Krah <skrah@bytereef.org>	2023-10-23 12:54:46 +03:00
Victor Stinner	be5e8a0103	gh-110964: Remove private _PyArg functions (#110966 ) Move the following private functions and structures to pycore_modsupport.h internal C API: * _PyArg_BadArgument() * _PyArg_CheckPositional() * _PyArg_NoKeywords() * _PyArg_NoPositional() * _PyArg_ParseStack() * _PyArg_ParseStackAndKeywords() * _PyArg_Parser structure * _PyArg_UnpackKeywords() * _PyArg_UnpackKeywordsWithVararg() * _PyArg_UnpackStack() * _Py_ANY_VARARGS() Changes: * Python/getargs.h now includes pycore_modsupport.h to export functions. * clinic.py now adds pycore_modsupport.h when one of these functions is used. * Add pycore_modsupport.h includes when a C extension uses one of these functions. * Define Py_BUILD_CORE_MODULE in C extensions which now include directly or indirectly (via code generated by Argument Clinic) pycore_modsupport.h: * _csv * _curses_panel * _dbm * _gdbm * _multiprocessing.posixshmem * _sqlite.row * _statistics * grp * resource * syslog * _testcapi: bad_get() no longer uses METH_FASTCALL calling convention but METH_VARARGS. Replace _PyArg_UnpackStack() with PyArg_ParseTuple(). * _testcapi: add PYTESTCAPI_NEED_INTERNAL_API macro which is defined by _testcapi sub-modules which need the internal C API (pycore_modsupport.h): exceptions.c, float.c, vectorcall.c, watchers.c. * Remove Include/cpython/modsupport.h header file. Include/modsupport.h no longer includes the removed header file. * Fix mypy clinic.py	2023-10-17 14:30:31 +02:00
Donghee Na	0362cbf908	gh-109595: Add -Xcpu_count=<n> cmdline for container users (#109667 ) --------- Co-authored-by: Victor Stinner <vstinner@python.org> Co-authored-by: Gregory P. Smith [Google LLC] <greg@krypto.org>	2023-10-10 19:00:09 +09:00
Victor Stinner	a0773b89df	gh-108753: Enhance pystats (#108754 ) Statistics gathering is now off by default. Use the "-X pystats" command line option or set the new PYTHONSTATS environment variable to 1 to turn statistics gathering on at Python startup. Statistics are no longer dumped at exit if statistics gathering was off or statistics have been cleared. Changes: * Add PYTHONSTATS environment variable. * sys._stats_dump() now returns False if statistics are not dumped because they are all equal to zero. * Add PyConfig._pystats member. * Add tests on sys functions and on setting PyConfig._pystats to 1. * Add Include/cpython/pystats.h and Include/internal/pycore_pystats.h header files. * Rename '_py_stats' variable to '_Py_stats'. * Exclude Include/cpython/pystats.h from the Py_LIMITED_API. * Move pystats.h include from object.h to Python.h. * Add _Py_StatsOn() and _Py_StatsOff() functions. Remove '_py_stats_struct' variable from the API: make it static in specialize.c. * Document API in Include/pystats.h and Include/cpython/pystats.h. * Complete pystats documentation in Doc/using/configure.rst. * Don't write "all zeros" stats: if _stats_off() and _stats_clear() or _stats_dump() were called. * _PyEval_Fini() now always call _Py_PrintSpecializationStats() which does nothing if stats are all zeros. Co-authored-by: Michael Droettboom <mdboom@gmail.com>	2023-09-06 15:54:59 +00:00
Victor Stinner	ad73674283	gh-107603: Argument Clinic: Only include pycore_gc.h if needed (#108726 ) Argument Clinic now only includes pycore_gc.h if PyGC_Head is needed, and only includes pycore_runtime.h if _Py_ID() is needed. * Add 'condition' optional argument to Clinic.add_include(). * deprecate_keyword_use() includes pycore_runtime.h when using the _PyID() function. * Fix rendering of includes: comments start at the column 35. * Mark PC/clinic/_wmimodule.cpp.h and "Objects/stringlib/clinic/.h.h" header files as generated in .gitattributes. Effects: 42 header files generated by AC no longer include the internal C API, instead of 4 header files before. For example, Modules/clinic/_abc.c.h no longer includes the internal C API. * Fix _testclinic_depr.c.h: it now always includes pycore_runtime.h to get _Py_ID().	2023-08-31 23:42:34 +02:00
Victor Stinner	8ba4714611	gh-106320: Remove private AC converter functions (#108505 ) Move these private functions to the internal C API (pycore_abstract.h): * _Py_convert_optional_to_ssize_t() * _PyNumber_Index() Argument Clinic now emits #include "pycore_abstract.h" when these functions are used. The parser of the c-analyzer tool now uses a list of files which use the limited C API, rather than a list of files using the internal C API.	2023-08-26 04:05:17 +02:00
Victor Stinner	4e5a7284ee	gh-108444: Argument Clinic uses PyLong_AsInt() (#108458 ) Argument Clinic now uses the new public PyLong_AsInt(), rather than the old name _PyLong_AsInt().	2023-08-25 00:51:22 +02:00
Nikita Sobolev	db6dc6ce41	gh-107526: Revert "gh-100357: Convert several functions in bltinsmodule to AC" (#107542 )	2023-08-20 17:54:10 -07:00
Victor Stinner	cbb9ba844f	gh-92536: Argument Clinic no longer emits PyUnicode_READY() (#105208 ) Since Python 3.12, PyUnicode_READY() does nothing and always returns 0. Argument Clinic now also checks for .cpp files (PC/_wmimodule.cpp).	2023-06-02 01:31:58 +02:00
Pablo Galindo Salgado	9216e69a87	gh-105069: Add a readline-like callable to the tokenizer to consume input iteratively (#105070 )	2023-05-30 22:43:34 +01:00
Marta Gómez Macías	6715f91edc	gh-102856: Python tokenizer implementation for PEP 701 (#104323 ) This commit replaces the Python implementation of the tokenize module with an implementation that reuses the real C tokenizer via a private extension module. The tokenize module now implements a compatibility layer that transforms tokens from the C tokenizer into Python tokenize tokens for backward compatibility. As the C tokenizer does not emit some tokens that the Python tokenizer provides (such as comments and non-semantic newlines), a new special mode has been added to the C tokenizer mode that currently is only used via the extension module that exposes it to the Python layer. This new mode forces the C tokenizer to emit these new extra tokens and add the appropriate metadata that is needed to match the old Python implementation. Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2023-05-21 01:03:02 +01:00
Eddie Elizondo	ea2c001650	gh-84436: Implement Immortal Objects (gh-19474) This is the implementation of PEP683 Motivation: The PR introduces the ability to immortalize instances in CPython which bypasses reference counting. Tagging objects as immortal allows up to skip certain operations when we know that the object will be around for the entire execution of the runtime. Note that this by itself will bring a performance regression to the runtime due to the extra reference count checks. However, this brings the ability of having truly immutable objects that are useful in other contexts such as immutable data sharing between sub-interpreters.	2023-04-22 13:39:37 -06:00
Mark Shannon	411b169281	GH-103082: Implementation of PEP 669: Low Impact Monitoring for CPython (GH-103083) * The majority of the monitoring code is in instrumentation.c * The new instrumentation bytecodes are in bytecodes.c * legacy_tracing.c adapts the new API to the old sys.setrace and sys.setprofile APIs	2023-04-12 12:04:55 +01:00
Eric Snow	89ac665891	gh-98627: Add an Optional Check for Extension Module Subinterpreter Compatibility (gh-99040) Enforcing (optionally) the restriction set by PEP 489 makes sense. Furthermore, this sets the stage for a potential restriction related to a per-interpreter GIL. This change includes the following: * add tests for extension module subinterpreter compatibility * add _PyInterpreterConfig.check_multi_interp_extensions * add Py_RTFLAGS_MULTI_INTERP_EXTENSIONS * add _PyImport_CheckSubinterpIncompatibleExtensionAllowed() * fail iff the module does not implement multi-phase init and the current interpreter is configured to check https://github.com/python/cpython/issues/98627	2023-02-15 18:16:00 -07:00
Gregory P. Smith	052f53d65d	gh-39615: Add warnings.warn() skip_file_prefixes support (#100840 ) `warnings.warn()` gains the ability to skip stack frames based on code filename prefix rather than only a numeric `stacklevel=` via a new `skip_file_prefixes=` keyword argument.	2023-01-27 18:35:14 -08:00
Steve Dower	b5d4347950	gh-86682: Adds sys._getframemodulename as an alternative to using _getframe (GH-99520) Also updates calls in collections, doctest, enum, and typing modules to use _getframemodulename first when available.	2023-01-13 11:31:06 +00:00
Shantanu	a2141882f2	gh-100776: Fix misleading default value in help(input) (#100788 )	2023-01-08 13:27:41 +05:30
Nikita Sobolev	bdfb694386	gh-100357: Convert several functions in `bltinsmodule` to AC (#100358 )	2022-12-24 06:45:47 -08:00
Jeremy Paige	1cf3d78c92	Correct CVE-2020-10735 documentation (#100306 )	2022-12-18 12:41:21 +05:30
Serhiy Storchaka	a87c46eab3	bpo-15999: Accept arbitrary values for boolean parameters. (#15609 ) builtins and extension module functions and methods that expect boolean values for parameters now accept any Python object rather than just a bool or int type. This is more consistent with how native Python code itself behaves.	2022-12-03 11:52:21 -08:00
Brandt Bucher	276d77724f	GH-98686: Quicken everything (GH-98687)	2022-11-02 10:42:57 -07:00
Erlend E. Aasland	723ebe76e7	gh-96143: Improve perf profiler docs (#96445 )	2022-10-27 14:06:48 +01:00
Dong-hee Na	a04656ec32	gh-97841: Add methoddef for _filters_mutated (gh-98115)	2022-10-09 20:16:33 +09:00
Michael	07b8e85d0e	gh-96526: Clarify format and __format__ docstrings (gh-96648)	2022-10-03 15:28:02 -07:00
Nikita Sobolev	873a2f2527	Fix typos in `bltinmodule.c`. (GH-97766)	2022-10-03 12:09:03 -07:00
Gregory P. Smith	511ca94520	gh-95778: CVE-2020-10735: Prevent DoS by very large int() (#96499 ) Integer to and from text conversions via CPython's bignum `int` type is not safe against denial of service attacks due to malicious input. Very large input strings with hundred thousands of digits can consume several CPU seconds. This PR comes fresh from a pile of work done in our private PSRT security response team repo. Signed-off-by: Christian Heimes [Red Hat] <christian@python.org> Tons-of-polishing-up-by: Gregory P. Smith [Google] <greg@krypto.org> Reviews via the private PSRT repo via many others (see the NEWS entry in the PR). <!-- gh-issue-number: gh-95778 --> * Issue: gh-95778 <!-- /gh-issue-number --> I wrote up [a one pager for the release managers](https://docs.google.com/document/d/1KjuF_aXlzPUxTK4BMgezGJ2Pn7uevfX7g0_mvgHlL7Y/edit#). Much of that text wound up in the Issue. Backports PRs already exist. See the issue for links.	2022-09-02 09:35:08 -07:00
Pablo Galindo Salgado	6d791a9736	gh-96143: Allow Linux perf profiler to see Python calls (GH-96123) ⚠️ ⚠️ Note for reviewers, hackers and fellow systems/low-level/compiler engineers ⚠️ ⚠️ If you have a lot of experience with this kind of shenanigans and want to improve the first version, please make a PR against my branch or reach out by email or suggest code changes directly on GitHub. If you have any refinements or optimizations please, wait until the first version is merged before starting hacking or proposing those so we can keep this PR productive.	2022-08-30 10:11:18 -07:00
Pablo Galindo Salgado	e34c82abeb	GH-93503: Add thread-specific APIs to set profiling and tracing functions in the C-API (#93504 ) * gh-93503: Add APIs to set profiling and tracing functions in all threads in the C-API * Use a separate API * Fix NEWS entry * Add locks around the loop * Document ignoring exceptions * Use the new APIs in the sys module * Update docs	2022-08-24 23:21:39 +01:00
Erlend E. Aasland	f07adf82f3	gh-90928: Improve static initialization of keywords tuple in AC (#95907 )	2022-08-13 12:09:40 +02:00
Eric Snow	6f6a4e6cc5	gh-90928: Statically Initialize the Keywords Tuple in Clinic-Generated Code (gh-95860) We only statically initialize for core code and builtin modules. Extension modules still create the tuple at runtime. We'll solve that part of interpreter isolation separately. This change includes generated code. The non-generated changes are in: * Tools/clinic/clinic.py * Python/getargs.c * Include/cpython/modsupport.h * Makefile.pre.in (re-generate global strings after running clinic) * very minor tweaks to Modules/_codecsmodule.c and Python/Python-tokenize.c All other changes are generated code (clinic, global strings).	2022-08-11 15:25:49 -06:00
Oleg Iarygin	41e0585ffa	gh-91102: Port 8-argument _warnings.warn_explicit to Argument Clinic (#92891 ) Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com>	2022-07-20 22:24:51 +02:00
Mark Shannon	6f8875eba3	GH-93841: Allow stats to be turned on and off, cleared and dumped at runtime. (GH-93843)	2022-06-21 15:40:54 +01:00
larryhastings	5021064390	gh-92203: Add closure support to exec(). (#92204 ) Add a closure keyword-only parameter to exec(). It can only be specified when exec-ing a code object that uses free variables. When specified, it must be a tuple, with exactly the number of cell variables referenced by the code object. closure has a default value of None, and it must be None if the code object doesn't refer to any free variables.	2022-05-06 10:09:35 -07:00
Victor Stinner	b270b82f11	gh-91320: Argument Clinic uses _PyCFunction_CAST() (#32210 ) Replace "(PyCFunction)(void(*)(void))func" cast with _PyCFunction_CAST(func).	2022-05-03 20:25:41 +02:00
Kumar Aditya	6f3b9e2243	Use FASTCALL for __import__ (GH-31752)	2022-03-11 18:46:55 +02:00
Irit Katriel	c590b581bb	bpo-46328: Add sys.exception() (GH-30514)	2022-01-13 12:35:58 +00:00
Eric Snow	79cf20e48d	bpo-21736: Set __file__ on frozen stdlib modules. (gh-28656) Currently frozen modules do not have __file__ set. In their spec, origin is set to "frozen" and they are marked as not having a location. (Similarly, for frozen packages __path__ is set to an empty list.) However, for frozen stdlib modules we are able to extrapolate __file__ as long as we can determine the stdlib directory at runtime. (We now do so since gh-28586.) Having __file__ set is helpful for a number of reasons. Likewise, having a non-empty __path__ means we can import submodules of a frozen package from the filesystem (e.g. we could partially freeze the encodings module). This change sets __file__ (and adds to __path__) for frozen stdlib modules. It uses sys._stdlibdir (from gh-28586) and the frozen module alias information (from gh-28655). All that work is done in FrozenImporter (in Lib/importlib/_bootstrap.py). Also, if a frozen module is imported before importlib is bootstrapped (during interpreter initialization) then we fix up that module and its spec during the importlib bootstrapping step (i.e. imporlib._bootstrap._setup()) to match what gets set by FrozenImporter, including setting the file info (if the stdlib dir is known). To facilitate this, modules imported using PyImport_ImportFrozenModule() have __origname__ set using the frozen module alias info. __origname__ is popped off during importlib bootstrap. (To be clear, even with this change the new code to set __file__ during fixups in imporlib._bootstrap._setup() doesn't actually get triggered yet. This is because sys._stdlibdir hasn't been set yet in interpreter initialization at the point importlib is bootstrapped. However, we do fix up such modules at that point to otherwise match the result of importing through FrozenImporter, just not the __file__ and __path__ parts. Doing so will require changes in the order in which things happen during interpreter initialization. That can be addressed separately. Once it is, the file-related fixup code from this PR will kick in.) Here are things this change does not do: * set __file__ for non-stdlib modules (no way of knowing the parent dir) * set __file__ if the stdlib dir is not known (nor assume the expense of finding it) * relatedly, set __file__ if the stdlib is in a zip file * verify that the filename set to __file__ actually exists (too expensive) * update __path__ for frozen packages that alias a non-package (since there is no package dir) Other things this change skips, but we may do later: * set __file__ on modules imported using PyImport_ImportFrozenModule() * set co_filename when we unmarshal the frozen code object while importing the module (e.g. in FrozenImporter.exec_module()) -- this would allow tracebacks to show source lines * implement FrozenImporter.get_filename() and FrozenImporter.get_source() https://bugs.python.org/issue21736	2021-10-14 15:32:18 -06:00
Eric Snow	08285d563e	bpo-45020: Identify which frozen modules are actually aliases. (gh-28655) In the list of generated frozen modules at the top of Tools/scripts/freeze_modules.py, you will find that some of the modules have a different name than the module (or .py file) that is actually frozen. Let's call each case an "alias". Aliases do not come into play until we get to the (generated) list of modules in Python/frozen.c. (The tool for freezing modules, Programs/_freeze_module, is only concerned with the source file, not the module it will be used for.) Knowledge of which frozen modules are aliases (and the identity of the original module) normally isn't important. However, this information is valuable when we go to set __file__ on frozen stdlib modules. This change updates Tools/scripts/freeze_modules.py to map aliases to the original module name (or None if not a stdlib module) in Python/frozen.c. We also add a helper function in Python/import.c to look up a frozen module's alias and add the result of that function to the frozen info returned from find_frozen(). https://bugs.python.org/issue45020	2021-10-05 11:26:37 -06:00
Eric Snow	c3d9ac8b34	bpo-45324: Capture data in FrozenImporter.find_spec() to use in exec_module(). (gh-28633) Before this change we end up duplicating effort and throwing away data in FrozenImporter.find_spec(). Now we do the work once in find_spec() and the only thing we do in FrozenImporter.exec_module() is turn the raw frozen data into a code object and then exec it. We've added _imp.find_frozen(), add an arg to _imp.get_frozen_object(), and updated FrozenImporter. We've also moved some code around to reduce duplication, get a little more consistency in outcomes, and be more efficient. Note that this change is mostly necessary if we want to set __file__ on frozen stdlib modules. (See https://bugs.python.org/issue21736.) https://bugs.python.org/issue45324	2021-10-05 10:01:27 -06:00
Eric Snow	a65c86889e	bpo-45020: Add -X frozen_modules=[on\|off] to explicitly control use of frozen modules. (gh-28320) Currently we freeze several modules into the runtime. For each of these modules it is essential to bootstrapping the runtime that they be frozen. Any other stdlib module that we later freeze into the runtime is not essential. We can just as well import from the .py file. This PR lets users explicitly choose which should be used, with the new "-X frozen_modules=[on\|off]" CLI flag. The default is "off" for now. https://bugs.python.org/issue45020	2021-09-14 17:31:45 -06:00
Eric Snow	a2d8c4b81b	bpo-45019: Do some cleanup related to frozen modules. (gh-28319) There are a few things I missed in gh-27980. This is a follow-up that will make subsequent PRs cleaner. It includes fixes to tests and tools that reference the frozen modules. https://bugs.python.org/issue45019	2021-09-13 16:18:37 -06:00
Pablo Galindo Salgado	a24676bedc	Add tests for the C tokenizer and expose it as a private module (GH-27924)	2021-08-24 17:50:05 +01:00
Batuhan Taskaya	9af34c9351	bpo-20201: variadic arguments support for AC (GH-18609) Implement support for `*args` in AC, and port `print()` to use it.	2021-07-16 18:43:02 +03:00
Pablo Galindo Salgado	4cb7263f0c	Remove sys._deactivate_opcache() now that is not needed (GH-27154)	2021-07-15 14:43:59 +01:00
Erik Welch	6af4e6b266	bpo-43918: document signature and default argument of `anext` builtin (#25551 ) Co-authored-by: Dennis Sweeney <36520290+sweeneyde@users.noreply.github.com>	2021-06-22 14:00:51 -07:00
Mark Shannon	001eb520b5	bpo-44187: Quickening infrastructure (GH-26264) * Add co_firstinstr field to code object. * Implement barebones quickening. * Use non-quickened bytecode when tracing. * Add NEWS item * Add new file to Windows build. * Don't specialize instructions with EXTENDED_ARG.	2021-06-07 18:38:06 +01:00
Joshua Bronson	f0a6fde882	bpo-31861: Add aiter and anext to builtins (#23847 ) Co-authored-by: jab <jab@users.noreply.github.com> Co-authored-by: Daniel Pope <mauve@mauveweb.co.uk> Co-authored-by: Justin Wang <justin39@gmail.com>	2021-03-23 15:47:21 -07:00

1 2 3

107 Commits