Edward Z. Yang
|
0e715bdda6
|
Fix PHP 5.0 bug involving isset and DOM.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@354 48356398-32a2-884e-a903-53898d9a118a
|
2006-09-01 14:44:50 +00:00 |
|
Edward Z. Yang
|
0ac97774d4
|
More refactoring: bundling charset and entity stuff together makes little sense, so new HTMLPurifier/EntityParser.php.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@341 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-30 02:21:39 +00:00 |
|
Edward Z. Yang
|
89376a11e3
|
Remove a huge swath of duplicated function calls by factoring them into a normalize() function. Also made DirectLex's variable names consistent with the rest of the classes.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@340 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-29 20:05:26 +00:00 |
|
Edward Z. Yang
|
1de3088276
|
Refactor encoding and entity specific processing to HTMLPurifier_Encoder. We also need to refactor the escaping to this class too.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@339 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-29 19:36:40 +00:00 |
|
Edward Z. Yang
|
7588068b7b
|
Hacky full docuement parse thingy removed from DOMLex, fixes barfing on full HTML documents.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@328 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-27 22:06:58 +00:00 |
|
Edward Z. Yang
|
973cc43b64
|
Malformed UTF-8 and non-SGML character detection and cleaning implemented
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@303 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-19 17:53:59 +00:00 |
|
Edward Z. Yang
|
53808ee34a
|
Attempt to fix inconsistent DOM behavior regarding insertion of P tags.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@302 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-19 16:24:17 +00:00 |
|
Edward Z. Yang
|
5690c9e0a2
|
Further optimization: 20% - 12%. Also fixed broken benchmarks.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@266 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 21:19:45 +00:00 |
|
Edward Z. Yang
|
acd7ceb940
|
Major optimization on tokenizeDOM(), reduce execution time from 75% to 20% by passing tokens by reference and using a token factory.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@265 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 20:19:16 +00:00 |
|
Edward Z. Yang
|
9a35dfa6b9
|
Add support for full document parsing, aka discard everything that's not in-between body if applicable.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@258 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 00:53:24 +00:00 |
|
Edward Z. Yang
|
d7140f2e05
|
Outfit a bunch of other classes so they can accept a configuration object. Put in basic scaffolding for extractBody() functionality.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@257 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 00:31:12 +00:00 |
|
Edward Z. Yang
|
299236f695
|
Fix DOM bug where default encoding for HTML docs is not UTF-8.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@252 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-14 13:27:18 +00:00 |
|
Edward Z. Yang
|
b267b0c202
|
Add an attribute entity parse test to Lexer and change PEARSax3 to a proof of concept.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@158 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-04 02:59:15 +00:00 |
|
Edward Z. Yang
|
609977f9f5
|
Add CDATA support to the Lexers, as well as give PEARSax3 entity replacement.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@106 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 23:04:34 +00:00 |
|
Edward Z. Yang
|
5ce0ae7056
|
Implement EntityLookup and put in the Lexer. Some behavior was migrated, since it looks like it will have to be used in all Lexers, not just DirectLex (which is the only one that uses it).
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@105 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 21:07:30 +00:00 |
|
Edward Z. Yang
|
5a18dad64f
|
Typo, space between comma and word.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@103 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:57:13 +00:00 |
|
Edward Z. Yang
|
bcc2b09ac7
|
Finish documenting PEARSax3, touch up the other docs. Nuke the original lexer.txt document.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@102 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:56:00 +00:00 |
|
Edward Z. Yang
|
2fa1161d3d
|
- Implemented special entity conversion.
- Optimized and documented DirectLex.
- Rearranged test cases.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@100 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:13:04 +00:00 |
|
Edward Z. Yang
|
14f481bcf6
|
svn:eol-style = native
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@97 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 00:11:03 +00:00 |
|
Edward Z. Yang
|
ca1aefe271
|
Commit various optimizations to the Lexer, and add stub file for profiling the lexer.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@92 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 22:48:07 +00:00 |
|
Edward Z. Yang
|
ac1e62e043
|
Optimize next*() functions in DirectLex, add test for offset. Update Lexer documents.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@90 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 18:55:34 +00:00 |
|
Edward Z. Yang
|
1ab3ae160a
|
Move classes into Zend style setup.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@88 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 15:38:41 +00:00 |
|