Lisp HUG Maillist Archive

Microsoft Word Document Import

Hi,

I have an archive of relatively simple MS Word documents and would like 
to read them into Lisp, process them and save into a database. Has 
somebody implemented an import of a MS Word document? I'm really 
thankful for any answer.

Kind Regards,
Lukas


Re: Microsoft Word Document Import

On Mon, 13 Sep 2004 17:46:22 +0100, Lukas Trejtnar <l.trejtnar@open.ac.uk> wrote:

> I have an archive of relatively simple MS Word documents and would
> like to read them into Lisp, process them and save into a
> database. Has somebody implemented an import of a MS Word document?
> I'm really thankful for any answer.

I doubt that anybody has (in Lisp). This is a rather heavy task and
AFAIK the Word format itself is proprietary and not documented by
Microsoft.

One option might be to use an external program/library like wvWare[1].

Another option would be to convert the Word documents to RTF and write
a Lisp parser for that. That should be straightforward albeit
tedious. I almost did that some months ago but the project fortunately
went into another direction. Apart from the official RTF documents
this cheap little book is very helpful:

  <http://www.oreilly.com/catalog/rtfpg/index.html>

HTH,
Edi.

[1] <http://wvware.sourceforge.net/>


Re: Microsoft Word Document Import

On 13.9.2004, at 18:46, Lukas Trejtnar wrote:

> I have an archive of relatively simple MS Word
> documents and would like 
> to read them into Lisp, process them and save into a
> database. Has 
> somebody implemented an import of a MS Word document?
> I'm really 
> thankful for any answer.
> 

It should be relatively easy to connect to Word through its COM
interface by means of the LispWork's COM/Automation API.

Michal



Updated at: 2020-12-10 08:55 UTC