Edward Z. Yang
5ce0ae7056
Implement EntityLookup and put in the Lexer. Some behavior was migrated, since it looks like it will have to be used in all Lexers, not just DirectLex (which is the only one that uses it).
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@105 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 21:07:30 +00:00
Edward Z. Yang
7268987846
Replace the HTML entity files with the XHTML files, they're the same, but being valid XML declarations they should be easier to parse.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@104 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 19:43:19 +00:00
Edward Z. Yang
5a18dad64f
Typo, space between comma and word.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@103 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:57:13 +00:00
Edward Z. Yang
bcc2b09ac7
Finish documenting PEARSax3, touch up the other docs. Nuke the original lexer.txt document.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@102 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:56:00 +00:00
Edward Z. Yang
48cf55eae4
Switch benchmark to relative percents: makes comparisons easier.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@101 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:18:11 +00:00
Edward Z. Yang
2fa1161d3d
- Implemented special entity conversion.
...
- Optimized and documented DirectLex.
- Rearranged test cases.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@100 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 18:13:04 +00:00
Edward Z. Yang
dc24215874
Token.php: Document the code.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@99 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 13:20:15 +00:00
Edward Z. Yang
728848c4c7
Begin adding Doxygen documentation.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@98 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 03:43:53 +00:00
Edward Z. Yang
14f481bcf6
svn:eol-style = native
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@97 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:11:03 +00:00
Edward Z. Yang
39c16f5cfd
svn:keywords = Id
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@96 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:06:34 +00:00
Edward Z. Yang
042eda0463
Hook new test file in and delete duplicated tests.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@95 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:03:11 +00:00
Edward Z. Yang
6a5ff6d2a9
Branch the LexerTest in order to partition DirectLex specific unit tests.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@94 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:01:11 +00:00
Edward Z. Yang
cf403eb257
Add Lexer test directory.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@93 48356398-32a2-884e-a903-53898d9a118a
2006-07-23 00:00:36 +00:00
Edward Z. Yang
ca1aefe271
Commit various optimizations to the Lexer, and add stub file for profiling the lexer.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@92 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 22:48:07 +00:00
Edward Z. Yang
de5ab5e6a0
Kick the auto-prepend habit.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@91 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 19:01:59 +00:00
Edward Z. Yang
ac1e62e043
Optimize next*() functions in DirectLex, add test for offset. Update Lexer documents.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@90 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 18:55:34 +00:00
Edward Z. Yang
eac83995e1
Remove dud test file.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@89 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 16:04:22 +00:00
Edward Z. Yang
1ab3ae160a
Move classes into Zend style setup.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@88 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:38:41 +00:00
Edward Z. Yang
758e70dd94
Add test HTMLPurifier folder.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@87 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:27:03 +00:00
Edward Z. Yang
067a38f988
Add HTMLPurifier folder.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@86 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:22:48 +00:00
Edward Z. Yang
5da6634d86
We're adopting Zend's library organization, so create a library folder.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@85 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:19:40 +00:00
Edward Z. Yang
9180d6256f
Make PHP5 lexer default DOMLex.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@84 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 15:07:53 +00:00
Edward Z. Yang
5bcb3c60cd
Update docs, add lexer.txt
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@83 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:57:12 +00:00
Edward Z. Yang
d22140b9a6
Fix benchmarks to work in PHP4 by excluding DOMLex.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@82 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:28:51 +00:00
Edward Z. Yang
f0d74a3bf4
Fix parse error in PHP4.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@81 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 14:00:52 +00:00
Edward Z. Yang
4bf3305dff
Build another lexer from PHP5's DOM library. Extremely fast!
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@80 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 13:50:05 +00:00
Edward Z. Yang
3e982c7f2c
Rename Lexer, separate files. Also augmented benchmarks and benchmarker,
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@79 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 12:53:04 +00:00
Edward Z. Yang
e52890276b
Branch out the SAX impl.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@78 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:28:28 +00:00
Edward Z. Yang
49fea8f8e9
Begin seperating Lexer implementations by copying the Lexer file to the Lexer directory.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@77 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:26:09 +00:00
Edward Z. Yang
a77a720377
Add Lexer folder to separate Lexer implementations.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@76 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:22:39 +00:00
Edward Z. Yang
cafa24482d
Setup include web. Unit tests should now be able to be run independently.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@75 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:13:08 +00:00
Edward Z. Yang
1d8bfebb3a
Update todo (rm renaming and refactoring)
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@74 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:04:27 +00:00
Edward Z. Yang
025b648c99
Split out code in Definition.php .
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@73 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:02:48 +00:00
Edward Z. Yang
f00a80c561
(Test) Branch Definition to ChildDef in order to split out code in the file.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@72 48356398-32a2-884e-a903-53898d9a118a
2006-07-22 00:01:38 +00:00
Edward Z. Yang
6d72d061a9
Branch Definition to AttrDef in order to split out code in the file.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@71 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:58:45 +00:00
Edward Z. Yang
fb4efc52bf
Branch Definition to ChildDef in order to split out code in the file.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@70 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:58:12 +00:00
Edward Z. Yang
404ca68c87
Rename PureHTMLDefinition -> HTMLPurifier_Definition, and internal classes too.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@69 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:38:44 +00:00
Edward Z. Yang
436873e227
Move tester, complete HTML_Purifier -> HTMLPurifier move.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@68 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:27:00 +00:00
Edward Z. Yang
1836b73c63
HTML_Purifier to HTMLPurifier. May want to rethink schematics for this though.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@67 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:13:22 +00:00
Edward Z. Yang
ccebe77e70
Rename HTML_Generator to HTMLPurifier_Generator.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@66 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:10:30 +00:00
Edward Z. Yang
6a6afaccc5
Rename HTML_Lexer to HTMLPurifier_Lexer. However, some more refactoring still needs to be done (namely making the lexer an interface).
...
Also fixed broken benchmarks.
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@65 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 23:07:47 +00:00
Edward Z. Yang
cf4776cfbd
Add a todo list.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@64 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 11:31:43 +00:00
Edward Z. Yang
23dba8b55e
Rename MarkupFragment.php to Token.php, change internal class names and rewire the classes. We also started adding more dependence on the Lexer and Generator in unrelated tests.
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@63 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 11:27:54 +00:00
Edward Z. Yang
8bde230c99
Renamed remotely
...
git-svn-id: http://htmlpurifier.org/svnroot/htmlpurifier/trunk@62 48356398-32a2-884e-a903-53898d9a118a
2006-07-21 02:16:36 +00:00
Edward Z. Yang
0ad09e5d94
Commit entity information from the HTML 4.01 spec. This should be parsed into corresponding mapping files, even though they will be versioned.
...
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@61 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 22:47:26 +00:00
Edward Z. Yang
3427c6c079
Begin getting parsing of character data into shape, not done yet.
...
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@60 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 02:43:06 +00:00
Edward Z. Yang
dadfa87acc
Update documentation (esp. revamp status)
...
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@59 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 00:40:04 +00:00
Edward Z. Yang
ff8f24458d
Finish implementing fixNesting(). Removed security-in-depth check for optimization reasons, since the info array will never cause such a condition.
...
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@58 48356398-32a2-884e-a903-53898d9a118a
2006-07-20 00:30:35 +00:00
Edward Z. Yang
3e6bcb7a0f
Implemented good behavior structure for fixNesting. Goes into infinite loop if bad stuff is passed.
...
Remove dud test and note which tests need to be added. Also, we're only running one test at a time to ease debugging.
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@57 48356398-32a2-884e-a903-53898d9a118a
2006-07-19 00:21:21 +00:00
Edward Z. Yang
2a2d8cbd76
Optimize files by removing is_a() calls in favor of static type member variables.
...
git-svn-id: http://htmlpurifier.org/svnroot/html_purifier/trunk@56 48356398-32a2-884e-a903-53898d9a118a
2006-07-18 23:40:20 +00:00