cpython

Commit Graph

Author	SHA1	Message	Date
R David Murray	97f43c019f	#15160 : Extend the new email parser to handle MIME headers. This code passes all the same tests that the existing RFC mime header parser passes, plus a bunch of additional ones. There are a couple of commented out tests where there are issues with the folding. The folding doesn't normally get invoked for headers parsed from source, and the cases are marginal anyway (headers with invalid binary data) so I'm not worried about them, but will fix them after the beta. There are things that can be done to make this API even more convenient, but I think this is a solid foundation worth having. And the parser is a full RFC parser, so it handles cases that the current parser doesn't. (There are also probably cases where it fails when the current parser doesn't, but I haven't found them yet ;) Oh, yeah, and there are some really ugly bits in the parser for handling some 'postel' cases that are unfortunately common. I hope/plan to to eventually refactor a lot of the code in the parser which should reduce the line count...but there is no escaping the fact that the error recovery is welter of special cases.	2012-06-24 05:03:27 -04:00
Alexander Belopolsky	76935b9c8c	Issue #14653 : email.utils.mktime_tz() no longer relies on system mktime() when timezone offest is supplied.	2012-06-21 20:48:23 -04:00
R David Murray	82ffabdfa4	#2658 : Add test for issue fixed by fix for #1079 .	2012-06-03 12:27:07 -04:00
R David Murray	07ea53cb21	#1079 : Fix parsing of encoded words. This is a behavior change: before this leading and trailing spaces were stripped from ASCII parts, now they are preserved. Without this fix we didn't parse the examples in the RFC correctly, so I think breaking backward compatibility here is justified. Patch by Ralf Schlatterbeck.	2012-06-02 17:56:49 -04:00
R David Murray	1be413e366	Don't use metaclasses when class decorators can do the job. Thanks to Nick Coghlan for pointing out that I'd forgotten about class decorators.	2012-05-31 18:00:45 -04:00
R David Murray	56517e5cb9	Make parameterized tests in email less hackish. Or perhaps more hackish, depending on your perspective. But at least this way it is now possible to run the individual tests using the unittest CLI.	2012-05-30 21:53:40 -04:00
R David Murray	026ba312d4	#10839 : add new test file that was omitted from checkin	2012-05-29 12:31:11 -04:00
R David Murray	6bed342b58	Refactor test_email/test_pickleable and add tests for date headers	2012-05-28 21:09:04 -04:00
R David Murray	a7c9ddb59c	Regularize test_email/test_headerregistry's references to policy.	2012-05-28 20:22:37 -04:00
R David Murray	d41595b920	Refactor test_email/test_defect_handling.	2012-05-28 20:14:10 -04:00
R David Murray	7ef3ff3f2e	#12515 : email now registers a defect if the MIME end boundary is missing. This commit also restores the news item for 167256 that it looks like Terry inadvertently deleted. (Either that, or I don't understand now merging works...which is equally possible.)	2012-05-27 22:20:42 -04:00
R David Murray	80e0aee95b	#1672568 : email now registers defects for base64 payload format errors. Which also means that it is now producing something for any base64 payload, which is what leads to the couple of older test changes in test_email. This is a slightly backward incompatible behavior change, but the new behavior is so much more useful than the old (you can now reliably detect errors, and any program that was detecting errors by sniffing for a base64 return from get_payload(decode=True) and then doing its own error-recovery decode will just get the error-recovery decode right away). So this seems to me to be worth the small risk inherent in this behavior change. This patch also refactors the defect tests into a separate test file, since they are no longer just parser tests.	2012-05-27 21:23:34 -04:00
R David Murray	adbdcdbd95	#14925 : email now registers a defect for missing header/body separator. This patch also deprecates the MalformedHeaderDefect. My best guess is that this defect was rendered obsolete by a refactoring of the parser, and the corresponding defect for the new parser (which this patch introduces) was overlooked.	2012-05-27 20:45:01 -04:00
R David Murray	ea9766897b	Make headerregistry fully part of the provisional api. When I made the checkin of the provisional email policy, I knew that Address and Group needed to be made accessible from somewhere. The more I looked at it, though, the more it became clear that since this is a provisional API anyway, there's no good reason to hide headerregistry as a private API. It was designed to ultimately be part of the public API, and so it should be part of the provisional API. This patch fully documents the headerregistry API, and deletes the abbreviated version of those docs I had added to the provisional policy docs.	2012-05-27 15:03:38 -04:00
R David Murray	032eed3c4a	Recognize '<>' as a special case of an angle-addr in header_value_parser. Although '<>' is invalid according to RFC 5322, SMTP uses it for various things, and it sometimes ends up in email headers. This patch changes get_angle_addr to recognize it and just register a Defect instead of raising a parsing error.	2012-05-26 14:31:12 -04:00
R David Murray	d2d521eafd	#665194 : Add a localtime function to email.utils. Without this function people would be tempted to use the other date functions in email.utils to compute an aware localtime, and those functions are not as good for that purpose as this code. The code is Alexander Belopolsy's from his proposed patch for issue 9527, with a fix (and additional tests) by Brian K. Jones.	2012-05-25 23:22:59 -04:00
R David Murray	dcaf2ece6c	#12586 : Fix a small oversight in the new email policy header setting code. This is a danger of focusing on unit tests: sometimes you forget to do the integration tests.	2012-05-25 22:53:12 -04:00
R David Murray	0b6f6c82b5	#12586 : add provisional email policy with new header parsing and folding. When the new policies are used (and only when the new policies are explicitly used) headers turn into objects that have attributes based on their parsed values, and can be set using objects that encapsulate the values, as well as set directly from unicode strings. The folding algorithm then takes care of encoding unicode where needed, and folding according to the highest level syntactic objects. With this patch only date and time headers are parsed as anything other than unstructured, but that is all the helper methods in the existing API handle. I do plan to add more parsers, and complete the set specified in the RFC before the package becomes stable.	2012-05-25 18:42:14 -04:00
R David Murray	c27e52265b	#14731 : refactor email policy framework. This patch primarily does two things: (1) it adds some internal-interface methods to Policy that allow for Policy to control the parsing and folding of headers in such a way that we can construct a backward compatibility policy that is 100% compatible with the 3.2 API, while allowing a new policy to implement the email6 API. (2) it adds that backward compatibility policy and refactors the test suite so that the only differences between the 3.2 test_email.py file and the 3.3 test_email.py file is some small changes in test framework and the addition of tests for bugs fixed that apply to the 3.2 API. There are some additional teaks, such as moving just the code needed for the compatibility policy into _policybase, so that the library code can import only _policybase. That way the new code that will be added for email6 will only get imported when a non-compatibility policy is imported.	2012-05-25 15:01:48 -04:00
R David Murray	42243c4dca	#14380 : Make actual default match docs, fix __init__ order. Éric pointed out that given that the default was documented as None, someone would reasonably pass that to get the default behavior. In fixing the code to use None, I noticed that the change to _charset was being done after it had already been passed to MIMENonMultipart. The change to the test verifies that the order is now correct.	2012-03-22 22:40:44 -04:00
R David Murray	8680bcc5db	#14380 : Have MIMEText defaults to utf-8 when passed non-ASCII unicode Previously it would just accept the unicode, which would wind up as unicode in the transfer-encoded message object, which is just wrong. Patch by Jeff Knupp.	2012-03-22 22:17:51 -04:00
R David Murray	80e22b56d3	Merge #11686 : add missing entries to email __all__ lists. Original patch by Steffen Daode Nurpmeso	2012-03-16 22:46:14 -04:00
R David Murray	7104c7295c	#12788 : fix error in test_policy when run under refleak detection	2012-03-16 21:39:57 -04:00
R David Murray	b53319f509	#12818 : remove escaping of () in quoted strings in formataddr The quoting of ()s inside quoted strings is allowed by the RFC, but is not needed. There seems to be no reason to add needless escapes.	2012-03-14 15:31:47 -04:00
R David Murray	8d8f110492	#14062 : fix BytesParser handling of Header objects This is a different fix than the 3.2 fix, but the new tests are the same. This also affected smtplib.SMTP.send_message, which calls BytesParser.	2012-03-14 14:24:22 -04:00
R David Murray	e2922835b0	Merge #14291 : if a header has non-ascii unicode, default to CTE using utf-8 In Python2, if a unicode string was assigned as the value of a header, email would automatically CTE encode it using the UTF8 charset. This capability was lost in the Python3 translation, and this patch restores it. Patch by Ali Ikinci, assisted by R. David Murray. I also added a fix for the mailbox test that was depending (with a comment that it was a bad idea to so depend) on non-ASCII causing message_from_string to raise an error. It now uses support.patch to induce an error during message serialization.	2012-03-14 03:03:27 -04:00
R David Murray	910df329fd	#8315 : add automatic unittest test discovery in test.test_email	2012-03-13 18:02:22 -04:00
Ezio Melotti	d8b509b192	#13012 : use splitlines(keepends=True/False) instead of splitlines(0/1).	2011-09-28 17:37:55 +03:00
R David Murray	875048bd4c	#665194 : support roundtripping RFC2822 date stamps in the email.utils module	2011-07-20 11:41:21 -04:00
R David Murray	749073af13	#1874 : detect invalid multipart CTE and report it as a defect.	2011-06-22 13:47:53 -04:00
R David Murray	e76ff4081a	merge #11584 : make Header and make_header handle binary unknown-8bit input	2011-06-18 13:02:42 -04:00
R David Murray	7df08379c6	merge #11584 : make decode_header handle Header objects correctly This updates 12e39cd7a0e4 (merge of b21fdfa0019c), which fixed this bug incorrectly.	2011-06-18 12:32:27 -04:00
R David Murray	3edd22ac95	#11731 : simplify/enhance parser/generator API by introducing policy objects. This new interface will also allow for future planned enhancements in control over the parser/generator without requiring any additional complexity in the parser/generator API. Patch reviewed by Éric Araujo and Barry Warsaw.	2011-04-18 13:59:37 -04:00
R David Murray	f3299989a2	Merge: #11492 : rewrite header folding algorithm. Less code, more passing tests.	2011-04-18 10:11:06 -04:00
R David Murray	e2c4cfce54	Merge: Improve message.py test coverage to 100%. coverage.py reports 99% on branch coverage, but that appears to be a bug or limitation in coverage.py.	2011-04-16 09:21:49 -04:00
R David Murray	b35c850a3f	#11684 : Complete parser bytes interface by adding BytesHeaderParser Patch by Steffen Daode Nurpmeso.	2011-04-13 16:46:05 -04:00
R David Murray	eb9e074dca	Use stock assertEqual instead of custom ndiffAssertEqual. Eventually I'll actually replace the calls in the tests themselves.	2011-04-10 15:28:29 -04:00
R David Murray	7ede59d77a	Merge #11492 : fix header truncation on folding when there are runs of split chars. Not a complete fix for this issue.	2011-04-07 21:00:33 -04:00
R David Murray	63d320b44f	Merge: Improve test coverage of _split_ascii method.	2011-04-07 20:42:28 -04:00
R David Murray	8debacb51c	#1690608 : make formataddr RFC2047 aware. Patch by Torsten Becker.	2011-04-06 09:35:57 -04:00
R David Murray	a0b1c77a19	Merge #11605 : don't use set/get_payload in feedparser; they do conversions.	2011-04-06 08:16:13 -04:00
R David Murray	a46ed1186f	Move assertBytesEqual to base test class, improve it, and hook into assertEqual	2011-03-31 13:11:40 -04:00
R David Murray	a256bacf91	Move infrastructure into __init__ to lay groundwork for splitting test_email. The split probably won't happen for a while, but I might as well lay the groundwork now since I'll be adding new test modules before too long.	2011-03-31 12:20:23 -04:00
R David Murray	28346b8077	Only a few files were opened using findfile; consistently don't use it.	2011-03-31 11:40:20 -04:00
R David Murray	1ebdd714ac	Add a __main__.py to test_email so it works with -m like it did before move.	2011-03-29 09:59:45 -04:00
R David Murray	961355a56a	Merge #11584 : Since __getitem__ returns headers, make decode_header handle them.	2011-03-25 15:31:52 -04:00
R David Murray	73bd0448b9	Merge #11606 : improved body_encode algorithm, no longer produces overlong lines	2011-03-24 12:28:39 -04:00
R David Murray	5839b9635c	Merge #11590 : fix quoprimime decode handling of empty strings and line endings.	2011-03-23 15:37:26 -04:00
R David Murray	3dcf745a61	Merge #11589 : add additional tests for the email quoprimime module.	2011-03-23 14:29:49 -04:00
R David Murray	39bc38c39c	Fix rename spelling error.	2011-03-21 17:36:15 -04:00

1 2

51 Commits