| Tuesday 31 October 2006 12:04:37 am 
                                                                 Hi 
>Never easy.>I'd first download the html page, tidy them and parse the content I need to import. Then start >with some kind of xml import (have a look at the contrib).
 Ok - I wrote a php script what convert mambo content in the eZ structures (Folders & Articles) - but mambo articles include very UGLY html content - and I found 3 error during run this script with eZPublisg 3.8.5  1. Fatal error: Call to a member function on a non-object in /srv/www/htdocs/ezportal/kernel/classes/datatypes/ezxmltext/input/ezxmlsimplifiedinputparser.php on line 611 orig: 	if ( $parent->nodeName == 'line' && !count( $parent->Children ) ) I am replace: 	if ( $parent->nodeName == 'line' && !count( $parent->Children ) && is_object($parent->parentNode) ) 
2. Fatal error: Call to a member function on a non-object in /srv/www/htdocs/ezportal/kernel/classes/datatypes/ezxmltext/ezxmlinputparser.php on line 772orig:		function &processSubtree( &$element, &$lastHandlerResult )
 {
 $ret = null;
 $tmp = null;
 
I am replace:	function &processSubtree( &$element, &$lastHandlerResult ){
 $ret = null;
 $tmp = null;
 
 if(!is_object($element)) return $ret;
 but after this patch I got the fatal error - segmentation fault (ugly) Can somebody recommdate me algoritm/soft/etc. what convert HTML in to eZXML ? 
I was trying use tidy form http://tidy.sf.net - but has same result If I use tidy options such as <b>--clean true, --word-2000 true</b> - I got empty content
     |