mirror of
https://github.com/ezyang/htmlpurifier.git
synced 2024-12-23 00:41:52 +00:00
e440f25bce
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@417 48356398-32a2-884e-a903-53898d9a118a
50 lines
1.9 KiB
Plaintext
50 lines
1.9 KiB
Plaintext
|
|
TODO List
|
|
|
|
Ongoing
|
|
- Lots of profiling, make it faster!
|
|
- Plugins for major CMSes (very tricky issue)
|
|
|
|
1.1 release
|
|
- Allow HTML 4.01 output (cosmetic changes to the generator)
|
|
- Formatters for plaintext
|
|
- Auto-paragraphing (be sure to leverage fact that we know when things
|
|
shouldn't be paragraphed, such as lists and tables).
|
|
|
|
1.2 release
|
|
- Additional support for poorly written HTML
|
|
- Implement all non-essential attribute transforms
|
|
- Microsoft Word HTML cleaning (i.e. MsoNormal)
|
|
|
|
1.3 release
|
|
- Make URI validation routines tighter (especially mailto)
|
|
- More extensive URI filtering schemes
|
|
- Allow for background-image and list-style-image (see above)
|
|
- Distinguish between different types of URIs, for instance, a mailto URI
|
|
in IMG SRC is nonsensical
|
|
|
|
2.0 release
|
|
- Add various "levels" of cleaning
|
|
- Related: Allow strict (X)HTML
|
|
|
|
3.0 release
|
|
- Extended HTML capabilities based on namespacing and tag transforms
|
|
- Hooks for adding custom processors to custom namespaced tags and
|
|
attributes, offer default implementation
|
|
- Lots of documentation and samples
|
|
|
|
Unknown release (on a scratch-an-itch basis)
|
|
- Silently drop content inbetween SCRIPT tags (can be generalized to allow
|
|
specification of elements that, when detected as foreign, trigger removal
|
|
of children, although unbalanced tags could wreck havoc (or at least delete
|
|
the rest of the document)).
|
|
- Fixes for Firefox's inability to handle COL alignment props (Bug 915)
|
|
- Automatically add non-breaking spaces to empty table cells when
|
|
empty-cells:show is applied to have compatibility with Internet Explorer
|
|
- Pretty-printing HTML (adds dependency of Generator to HTMLDefinition)
|
|
- Non-lossy dumb alternate character encoding transformations, achieved by
|
|
numerically encoding all non-ASCII characters
|
|
|
|
Wontfix
|
|
- Non-lossy smart alternate character encoding transformations
|