Edward Z. Yang
|
973cc43b64
|
Malformed UTF-8 and non-SGML character detection and cleaning implemented
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@303 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-19 17:53:59 +00:00 |
|
Edward Z. Yang
|
53808ee34a
|
Attempt to fix inconsistent DOM behavior regarding insertion of P tags.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@302 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-19 16:24:17 +00:00 |
|
Edward Z. Yang
|
5690c9e0a2
|
Further optimization: 20% - 12%. Also fixed broken benchmarks.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@266 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 21:19:45 +00:00 |
|
Edward Z. Yang
|
acd7ceb940
|
Major optimization on tokenizeDOM(), reduce execution time from 75% to 20% by passing tokens by reference and using a token factory.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@265 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 20:19:16 +00:00 |
|
Edward Z. Yang
|
9a35dfa6b9
|
Add support for full document parsing, aka discard everything that's not in-between body if applicable.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@258 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 00:53:24 +00:00 |
|
Edward Z. Yang
|
d7140f2e05
|
Outfit a bunch of other classes so they can accept a configuration object. Put in basic scaffolding for extractBody() functionality.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@257 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-15 00:31:12 +00:00 |
|
Edward Z. Yang
|
299236f695
|
Fix DOM bug where default encoding for HTML docs is not UTF-8.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@252 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-14 13:27:18 +00:00 |
|
Edward Z. Yang
|
b267b0c202
|
Add an attribute entity parse test to Lexer and change PEARSax3 to a proof of concept.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@158 48356398-32a2-884e-a903-53898d9a118a
|
2006-08-04 02:59:15 +00:00 |
|
Edward Z. Yang
|
609977f9f5
|
Add CDATA support to the Lexers, as well as give PEARSax3 entity replacement.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@106 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 23:04:34 +00:00 |
|
Edward Z. Yang
|
5ce0ae7056
|
Implement EntityLookup and put in the Lexer. Some behavior was migrated, since it looks like it will have to be used in all Lexers, not just DirectLex (which is the only one that uses it).
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@105 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 21:07:30 +00:00 |
|
Edward Z. Yang
|
5a18dad64f
|
Typo, space between comma and word.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@103 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:57:13 +00:00 |
|
Edward Z. Yang
|
bcc2b09ac7
|
Finish documenting PEARSax3, touch up the other docs. Nuke the original lexer.txt document.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@102 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:56:00 +00:00 |
|
Edward Z. Yang
|
2fa1161d3d
|
- Implemented special entity conversion.
- Optimized and documented DirectLex.
- Rearranged test cases.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@100 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 18:13:04 +00:00 |
|
Edward Z. Yang
|
14f481bcf6
|
svn:eol-style = native
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@97 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-23 00:11:03 +00:00 |
|
Edward Z. Yang
|
ca1aefe271
|
Commit various optimizations to the Lexer, and add stub file for profiling the lexer.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@92 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 22:48:07 +00:00 |
|
Edward Z. Yang
|
ac1e62e043
|
Optimize next*() functions in DirectLex, add test for offset. Update Lexer documents.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@90 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 18:55:34 +00:00 |
|
Edward Z. Yang
|
1ab3ae160a
|
Move classes into Zend style setup.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@88 48356398-32a2-884e-a903-53898d9a118a
|
2006-07-22 15:38:41 +00:00 |
|