XML for Beginner
- By Richard Richi
- Published 03/26/2007
- XML
- Unrated
Richard Richi
View all articles by Richard Richi
I have two credit cards from two different credit unions and a checking bank account. Almost every day I logon to their Web sites to check the account statements and spend 30 to 40 minutes (I do lots of shopping!) consolidating them and see the status as to where I am. Being a smart programmer, I wrote a Visual Basic application which connected to all three Web sites, downloaded account statements in HTML format, parsed them and generated a nice merged report for me. Few days later one credit union announced their new redesigned Web site and my application stopped working immediately - because I was getting data tagged inside totally different HTML. I took one day to update and test the code to parse hundreds of lines of HTML text just to get account data tagged inside HTML page. I was happy again until my bank redesigned their Web site making my application useless one more time.
All the three Web sites have my account data, then why isn't there an easy way to get to it and use it the way I want. I then realized that it's all because of HTML - which is only focussed at presentation (how the page should look) rather than the data structure and content. It would be really great to separate the actual data content from the presentation. The benefits include: easy access to data, easily change the presentation, get data in multiple formats, easy
This is where XML comes into picture. XML, like HTML is a markup language. That said, here also you'll have to deal with tags, but unlike HTML, XML does not have any fixed set of tags (hence is extensible!). One more important difference is that XML is case-sensitive. Hence in an XML document <account>, <Account> and <ACCOUNT> are three different tags. Let's now have a little formal introduction to XML.
Extensible Markup Language (XML) is a powerful mechanism of data exchange. It is an ideal solution for transferring structured data from server-to-client, server-to-server or an application-to-application. The W3C

