From: DOlivastro@ChiResearch.Com Newsgroups: comp.text.sgml Subject: A novice needs help or at least pointers Date: Fri, 30 Apr 1999 20:30:43 GMT Organization: Deja News - The Leader in Internet Discussion Lines: 51 Message-ID: <firstname.lastname@example.org> NNTP-Posting-Host: 184.108.40.206 X-Article-Creation-Date: Fri Apr 30 20:30:43 1999 GMT X-Http-User-Agent: Mozilla/3.0 (Win95; I) X-Http-Proxy: 1.0 x5.dejanews.com:80 (Squid/1.1.22) for client 220.127.116.11
Can anyone help me out? Here is the problem:
A data supplier used to send us data in a proprietary format. I was able to read the file, according to the format, and take out the fields that I needed for further work.
Now, this data supplier has decided to change to SGML. No one here has ever come across this before, so we are, so to speak, SGML-idiots. For example, in the old days, the file said:
Number: 05704062 Author: Olivastro; Dominic
and so on. Now I get something like this:
<ENTDOC> <SDOBI> <B100> <DNUM> <PDAT> 05704062 </PDAT> </DNUM> </B100> <B200> <AUT> <NAM> <FNM> Dominic </FNM> <SNM> Olivastro </SNM> </NAM> </AUT/ </B200>
and so on (and on and on and on). The problem is that I can not dream up any clear programmatic way to extract the data I need.
Are there tools for this? Ideally, I want a program that will just take this file and change it to something like the first file. Any ideas or pointers?
-- Dominic Olivastro mailto: DOlivastro@ChiResearch.com -----------== Posted via Deja News, The Discussion Network ==---------- http://www.dejanews.com/ Search, Read, Discuss, or Start Your Own