htmlpurifier

mirrors/htmlpurifier

Fork 0

mirror of https://github.com/ezyang/htmlpurifier.git synced 2024-12-23 08:51:53 +00:00

Commit Graph

Author	SHA1	Message	Date
Edward Z. Yang	a44187a5c1	Cleanup after data validation. Signed-off-by: Edward Z. Yang <ezyang@mit.edu>	2012-10-27 02:30:58 -07:00
Edward Z. Yang	e76f4b45d0	Dramatically rewrite null host URI handling. Basically, browsers don't parse what should be valid URIs correctly, so we have to go through some backbends to accomodate them. Specifically, for browseable URIs, the following URIs have unintended behavior: - ///example.com - http:/example.com - http:///example.com Furthermore, if the path begins with //, modifying these URLs must be done with care, as if you remove the host-name component, the parse tree changes. I've modified the engine to follow correct URI semantics as much as possible while outputting browser compatible code, and invalidate the URI in cases where we can't deal. There has been a refactoring of URIScheme so that this important check is always performed, introducing a new member variable allow_empty_host which is true on data, file, mailto and news schemes. This also fixes bypass bugs on URI.Munge. Signed-off-by: Edward Z. Yang <ezyang@mit.edu>	2011-01-25 18:56:46 +00:00
Edward Z. Yang	97125ed18b	Implement data URI scheme. Signed-off-by: Edward Z. Yang <ezyang@mit.edu>	2010-03-07 21:45:39 -05:00

Author

SHA1

Message

Date

Edward Z. Yang

a44187a5c1

Cleanup after data validation.

Signed-off-by: Edward Z. Yang <ezyang@mit.edu>

2012-10-27 02:30:58 -07:00

Edward Z. Yang

e76f4b45d0

Dramatically rewrite null host URI handling.

Basically, browsers don't parse what should be valid URIs correctly, so
we have to go through some backbends to accomodate them.  Specifically,
for browseable URIs, the following URIs have unintended behavior:

    - ///example.com
    - http:/example.com
    - http:///example.com

Furthermore, if the path begins with //, modifying these URLs must
be done with care, as if you remove the host-name component, the
parse tree changes.

I've modified the engine to follow correct URI semantics as much
as possible while outputting browser compatible code, and invalidate
the URI in cases where we can't deal.  There has been a refactoring
of URIScheme so that this important check is always performed,
introducing a new member variable allow_empty_host which is true
on data, file, mailto and news schemes.

This also fixes bypass bugs on URI.Munge.

Signed-off-by: Edward Z. Yang <ezyang@mit.edu>

2011-01-25 18:56:46 +00:00

Edward Z. Yang

97125ed18b

Implement data URI scheme.

Signed-off-by: Edward Z. Yang <ezyang@mit.edu>

2010-03-07 21:45:39 -05:00

3 Commits