2006-07-21 11:31:43 +00:00
|
|
|
|
2006-08-28 02:47:03 +00:00
|
|
|
TODO List
|
|
|
|
|
2006-11-23 23:59:20 +00:00
|
|
|
= KEY ====================
|
|
|
|
# Flagship
|
|
|
|
- Regular
|
2007-03-31 03:09:46 +00:00
|
|
|
? Maybe I'll Do It
|
2006-11-23 23:59:20 +00:00
|
|
|
==========================
|
|
|
|
|
2007-08-02 22:44:42 +00:00
|
|
|
If no interest is expressed for a feature that may required a considerable
|
|
|
|
amount of effort to implement, it may get endlessly delayed. Do not be
|
|
|
|
afraid to cast your vote for the next feature to be implemented!
|
|
|
|
|
2008-02-11 02:21:35 +00:00
|
|
|
|
|
|
|
UPCOMING RELEASE
|
|
|
|
----------------
|
|
|
|
|
2008-01-13 05:40:53 +00:00
|
|
|
IMPORTANT
|
2008-02-11 02:21:35 +00:00
|
|
|
- Release candidate, because of the major changes
|
|
|
|
|
|
|
|
DOCUMENTATION
|
|
|
|
- Update French translation of README
|
|
|
|
|
|
|
|
IMPORTANT FEATURES
|
|
|
|
- Get everything into configuration objects (filters, I'm looking at you)
|
2008-02-17 18:21:45 +00:00
|
|
|
- Factor out command line parser into its own class, and unit test it
|
2008-04-09 02:00:42 +00:00
|
|
|
- Figure out autoload and PEAR
|
2008-02-11 02:21:35 +00:00
|
|
|
|
|
|
|
CONFIGDOC
|
2008-02-24 06:19:28 +00:00
|
|
|
- Properly integrate new ConfigSchema system into configdoc. DESCRIPTIONS
|
|
|
|
ARE CURRENTLY BROKEN AND NEED TO BE FIXED!!! (Configdoc
|
2008-02-11 02:21:35 +00:00
|
|
|
should directly read the configuration files, or at the very least should
|
|
|
|
not use static functions)
|
2008-02-11 00:27:35 +00:00
|
|
|
- Have configdoc use version and deprecated information (hide deprecated
|
|
|
|
info, for example)
|
2008-02-24 06:19:28 +00:00
|
|
|
- Implement source code sniffing for configdoc, so we can easily figure out
|
|
|
|
which files use what configuration (we'll rely on the $config convention)
|
2008-01-13 05:40:53 +00:00
|
|
|
|
2008-02-11 02:21:35 +00:00
|
|
|
NICE FEATURES
|
|
|
|
- Factor demo.php into a set of Printer classes, and then create a stub
|
|
|
|
file for users here (inside the actual HTML Purifier library)
|
|
|
|
- Support exporting configuration, so users can easily tweak settings
|
|
|
|
in the demo, and then copy-paste into their own setup
|
|
|
|
|
2008-03-26 04:11:38 +00:00
|
|
|
BUGS
|
|
|
|
- Style attribute height/width limiting for images
|
|
|
|
- Easy way to blacklist elements and attributes
|
|
|
|
- Investigate iconv error emitting
|
|
|
|
- Investigate UTF-8 optimization <http://htmlpurifier.org/phorum/read.php?3,1496>
|
|
|
|
- Figure out what to do about target="" and name="", since they show up so often
|
|
|
|
- Update htmLawed comparison
|
2008-02-11 02:21:35 +00:00
|
|
|
|
|
|
|
FUTURE VERSIONS
|
|
|
|
---------------
|
|
|
|
|
2008-02-11 00:27:35 +00:00
|
|
|
3.2 release [Error'ed]
|
2007-03-31 03:09:46 +00:00
|
|
|
# Error logging for filtering/cleanup procedures
|
|
|
|
- XSS-attempt detection
|
2006-08-28 02:47:03 +00:00
|
|
|
|
2008-02-11 00:27:35 +00:00
|
|
|
3.3 release [Do What I Mean, Not What I Say]
|
2006-11-23 23:59:20 +00:00
|
|
|
# Additional support for poorly written HTML
|
|
|
|
- Microsoft Word HTML cleaning (i.e. MsoNormal, but research essential!)
|
|
|
|
- Friendly strict handling of <address> (block -> <br>)
|
2007-01-21 15:23:42 +00:00
|
|
|
- Remove redundant tags, ex. <u><u>Underlined</u></u>. Implementation notes:
|
|
|
|
1. Analyzing which tags to remove duplicants
|
|
|
|
2. Ensure attributes are merged into the parent tag
|
|
|
|
3. Extend the tag exclusion system to specify whether or not the
|
|
|
|
contents should be dropped or not (currently, there's code that could do
|
|
|
|
something like this if it didn't drop the inner text too.)
|
|
|
|
- Remove <span> tags that don't do anything (no attributes)
|
|
|
|
- Remove empty inline tags<i></i>
|
|
|
|
- Append something to duplicate IDs so they're still usable (impl. note: the
|
|
|
|
dupe detector would also need to detect the suffix as well)
|
2007-09-02 17:22:31 +00:00
|
|
|
- Externalize inline CSS to promote clean HTML
|
2006-09-23 00:43:21 +00:00
|
|
|
|
2008-02-11 00:27:35 +00:00
|
|
|
3.4 release [It's All About Trust] (floating)
|
2007-05-29 21:26:43 +00:00
|
|
|
# Implement untrusted, dangerous elements/attributes
|
2008-03-26 04:11:38 +00:00
|
|
|
- Objects and Forms are especially wanted
|
2007-08-02 22:44:42 +00:00
|
|
|
# Implement IDREF support (harder than it seems, since you cannot have
|
|
|
|
IDREFs to non-existent IDs)
|
2007-09-02 17:22:31 +00:00
|
|
|
# Frameset XHTML 1.0 and HTML 4.01 doctypes
|
2007-05-29 21:26:43 +00:00
|
|
|
|
2007-11-25 02:24:39 +00:00
|
|
|
4.0 release [Beyond HTML]
|
2007-01-19 23:02:28 +00:00
|
|
|
# Legit token based CSS parsing (will require revamping almost every
|
2007-09-02 17:22:31 +00:00
|
|
|
AttrDef class). Probably will use CSSTidy class
|
2007-05-15 03:01:57 +00:00
|
|
|
# More control over allowed CSS properties (maybe modularize it in the
|
|
|
|
same fashion!)
|
2008-02-11 00:27:35 +00:00
|
|
|
# HTML 5 support
|
2007-06-24 21:35:34 +00:00
|
|
|
- Standardize token armor for all areas of processing
|
2007-01-21 16:17:34 +00:00
|
|
|
- Convert RTL/LTR override characters to <bdo> tags, or vice versa on demand.
|
|
|
|
Also, enable disabling of directionality
|
2007-12-09 22:14:15 +00:00
|
|
|
- Table of Contents generation (XHTML Compiler might be reusable)
|
2006-08-28 02:47:03 +00:00
|
|
|
|
2007-11-25 02:24:39 +00:00
|
|
|
5.0 release [To XML and Beyond]
|
2007-03-31 03:09:46 +00:00
|
|
|
- Extended HTML capabilities based on namespacing and tag transforms (COMPLEX)
|
|
|
|
- Hooks for adding custom processors to custom namespaced tags and
|
|
|
|
attributes, offer default implementation
|
|
|
|
- Lots of documentation and samples
|
|
|
|
|
2006-11-04 05:05:19 +00:00
|
|
|
Ongoing
|
2008-01-05 00:10:43 +00:00
|
|
|
- More refactoring to take advantage of PHP5's facilities
|
2006-11-04 05:05:19 +00:00
|
|
|
- Lots of profiling, make it faster!
|
2006-11-23 23:59:20 +00:00
|
|
|
- Plugins for major CMSes (COMPLEX)
|
2007-06-24 04:22:28 +00:00
|
|
|
- phpBB
|
2006-11-23 23:59:20 +00:00
|
|
|
- more! (look for ones that use WYSIWYGs)
|
2007-05-29 16:51:32 +00:00
|
|
|
- Complete basic smoketests
|
2006-11-04 05:05:19 +00:00
|
|
|
|
2007-12-09 22:14:15 +00:00
|
|
|
AutoFormat
|
|
|
|
- Smileys
|
|
|
|
- Syntax highlighting with <pre> and possibly <?php
|
|
|
|
- Look at http://drupal.org/project/Modules/category/63 for ideas
|
|
|
|
|
2006-08-28 02:47:03 +00:00
|
|
|
Unknown release (on a scratch-an-itch basis)
|
2007-08-19 19:52:45 +00:00
|
|
|
# CHMOD install script for PEAR installs
|
2007-03-31 03:09:46 +00:00
|
|
|
? Have 'lang' attribute be checked against official lists, achieved by
|
2006-09-28 00:31:12 +00:00
|
|
|
encoding all characters that have string entity equivalents
|
2007-05-15 03:01:57 +00:00
|
|
|
- Abstract ChildDef_BlockQuote to work with all elements that only
|
|
|
|
allow blocks in them, required or optional
|
2007-05-29 21:26:43 +00:00
|
|
|
- Reorganize Unit Tests
|
2007-08-02 22:44:42 +00:00
|
|
|
- Advanced URI filtering schemes (see docs/proposal-new-directives.txt)
|
|
|
|
- Implement lenient <ruby> child validation
|
2007-08-03 02:48:52 +00:00
|
|
|
- Explain how to use HTML Purifier in non-PHP languages / create
|
|
|
|
a simple command line stub (or complicated?)
|
2007-09-02 17:22:31 +00:00
|
|
|
- Fixes for Firefox's inability to handle COL alignment props (Bug 915)
|
|
|
|
- Automatically add non-breaking spaces to empty table cells when
|
|
|
|
empty-cells:show is applied to have compatibility with Internet Explorer
|
2008-01-05 19:19:55 +00:00
|
|
|
- Distinguish between default settings and explicitly set settings, so
|
|
|
|
configurations can be merged
|
|
|
|
- Nested configuration namespaces
|
|
|
|
- Allow scoped="scoped" attribute in <style> tags; may be troublesome
|
|
|
|
because regular CSS has no way of uniquely identifying nodes, so we'd
|
|
|
|
have to generate IDs
|
2008-02-18 01:11:17 +00:00
|
|
|
- Time PHPT tests
|
2006-10-31 02:17:52 +00:00
|
|
|
|
|
|
|
Requested
|
2006-08-28 19:21:46 +00:00
|
|
|
|
|
|
|
Wontfix
|
2006-11-04 05:05:19 +00:00
|
|
|
- Non-lossy smart alternate character encoding transformations (unless
|
|
|
|
patch provided)
|
2007-09-03 15:16:33 +00:00
|
|
|
- Pretty-printing HTML: users can use Tidy on the output on entire page
|
2007-06-21 15:28:50 +00:00
|
|
|
- Native content compression, whitespace stripping (don't rely on Tidy, make
|
|
|
|
sure we don't remove from <pre> or related tags): use gzip if this is
|
|
|
|
really important
|