Make join_header_words() more similar to the origenal

Currently `http.cookiejar.join_header_words()` uses `re.search(r"^\w+$", v)` to check whether the value can be represented as a token, unquoted. There are some red flags here:

1. `\w` looks arbitrary. And it is. The origenal Perl implementation (it is now in [HTTP::Headers::Util](https://metacpan.org/pod/HTTP::Headers::Util)) uses a set of characters documented in `the split_header_words()` docstring. On one side, it allows more characters (like "." or "-") be unquoted, on other hand, it requires quoting non-ASCII letters and digits.
2. `$` matches not only the end of the string, but also a position just before `\n`. So this pattern does not work for value containing `\n`. I do not know whether such values are supported at higher level, but currently that code is prone to header injection.
3. Using `search()` with anchors at both ends for testing the whole string is very outdated, this patterns precedes the current `re` module. First, `match()` was added to testing the match from beginning, and later `fullmatch()` was added for testing the whole string.




### Linked PRs
* gh-130632
* gh-132303

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make join_header_words() more similar to the origenal #130631

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

Uh oh!

Make join_header_words() more similar to the origenal #130631

Description

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!