Amar,
you really need to have a look at the code behind the sites you want to
'scrap'
If
they are XHTML compliant, then 'meaningful' XML is only an XSLT
trasnformation
away.
If,
like most sites, they are NOT then you'll need to write some kind of parser
to
ignore/repair malformed tags.
Tony
I am about to set about designing an expert system.
Most of the knowledge is on the net, i want to feed that
knowledge into the pc. XML format seems the natural choice.
What suggestions do folks have about converting html data
to semi meaning ful xml or some other format.
or is this plain not possible.
Are there any sites out there which stream out xml ?
------------------------ Yahoo! Groups Sponsor
---------------------~--> Free $5 Love Reading Risk
Free! http://us.click.yahoo.com/TPvn8A/PfREAA/Ey.GAA/IBOolB/TM ---------------------------------------------------------------------~->
For
more information: http://www.automatedhome.co.uk Post message:
ukha_d@xxxxxxxSubscribe:
ukha_d-subscribe@xxxxxxxUnsubscribe:
ukha_d-unsubscribe@xxxxxxxList owner:
ukha_d-owner@xxxxxxx Your use of Yahoo! Groups is subject to
http://docs.yahoo.com/info/terms/
***********************************************************************
Visit our Internet site at http://www.rbsmarkets.com
This e-mail is intended only for the addressee named above.
As this e-mail may contain confidential or privileged information,
if you are not the named addressee, you are not authorised to
retain, read, copy or disseminate this message or any part of it.
The Royal Bank of Scotland is registered in Scotland No 90312
Registered Office: 36 St Andrew Square, Edinburgh EH2 2YB
Regulated by the Financial Services Authority
***********************************************************************
For more information: http://www.automatedhome.co.uk
Post message: ukha_d@xxxxxxx
Subscribe: ukha_d-subscribe@xxxxxxx
Unsubscribe: ukha_d-unsubscribe@xxxxxxx
List owner: ukha_d-owner@xxxxxxx
Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
|