Re: What can I parse abiword with?

From: Lars Eighner <abbynormal_at_larseighner.com>
Date: Thu Feb 01 2007 - 11:13:41 CET

Anyone interested in following my progress on this project, my "conjured
documentation" of AbiWord, or development of a usable DTD can follow any or
all of that at <http://larseighner.com/AbbyWriter/> (unless I get a real
typewriter first, in which case I will be doing real work).

On Tue, 30 Jan 2007, Dominic Lachowicz wrote:

> I don't know if this addresses your question or not, but our DTD is
> hopelessly out of date, and trying to validate against it is probably a lost
> cause. However, it is valid XML and any conforming XML parser should be able
> to parse the ABW documents.
>
> There's no API for generating documents to speak of. The best advice I can
> give you at the moment is:
>
> 1) Your documents seem simple enough. Make 1 or 2 sample documents using
> AbiWord, and then craft your program to generate those sorts of documents
> using print() or some XML building API.
> 2) If your documents really are plaintext or something similar (maybe
> HTML?), you can use AbiWord to convert that to whatever format you desire.
> If you use AbiWord 2.5, you don't even need the input or output document to
> hit the disk.
>
> Good luck,
> Dom
>
> On 1/30/07, Lars Eighner <abbynormal@larseighner.com> wrote:
>>
>>
>> It seems I cannot parse abiword documents with ONSGMLS.
>>
>> Attributes id and xid are given type ID, but they are all numeric,
>> including the first character. When I add the numeric characters
>> to NAMESTRT in the sgml declaration, ONSGMLS pukes. If I change
>> the (local) awml.dtd so the type is CDATA, of course ONSGMLS will
>> parse it, but then it does not check id and xid for uniqueness.
>>
>> The reason I want to do this, is I want to filter essentially plain
>> text documents through a perl script to produce an abiword document.
>> (So that I can compose in a text-mode editor such as JOE, as the
>> abiword GUI is practically unusable, and I have to build abiword
>> templates by hand anyway. The abiword format is, of course, very
>> brittle, and abiword offers no diagnostics when it rejects a document.
>> It would be helpful to parse the candidate documents first to eliminate
>> the obvious errors.)
>>
>> What will parse abiword documents correctly and throw meaningful
>> error messages?
>>
>> --
>> Lars Eighner
>> http://www.larseighner.com/index.html
>> 8800 N IH35 APT 1191 AUSTIN TX 78753-5266
>>
>> -----------------------------------------------
>> To unsubscribe from this list, send a message to
>> abiword-user-request@abisource.com with the word
>> unsubscribe in the message body.
>>
>
>
>
>

-- 
Lars Eighner
http://www.larseighner.com/index.html
8800 N IH35 APT 1191 AUSTIN TX 78753-5266
-----------------------------------------------
To unsubscribe from this list, send a message to
abiword-user-request@abisource.com with the word
unsubscribe in the message body.
Received on Thu Feb 1 11:12:31 2007

This archive was generated by hypermail 2.1.8 : Thu Feb 01 2007 - 11:12:31 CET