External Parsers for htdig

The following chemical files can be parsed, along with the fields currently recognised.

  1. JCAMP dx spectra files: TITLE , OWNER, DATA TYPE
  2. PDB: REMARK , TITLE, AUTHOR, JRNL, KEYWDS, EXPDTA, COMPND
  3. VRML 1.0: AsciiText WWWAnchor WWWInline; VRML 2: Anchor text viewpoint Worldinfo
The parser is able to cope with Gzipped files. The contents of the pdb, Jcamp and vrml files are indexed with a weighting of 1

Installation Instructions

The parsers are written in Java and so you need a java JDK installed on your system

unzip these files into the htdig bin directory - they should go in to a directory called chemical. The VRML files go into a directory of this name (upper case) alongside chemical

Add the following line to the Ht://dig config file:

external_parsers: chemical/x-pdb "/usr/java/bin/java chemical.Htdigfront" \

chemical/x-jcamp-dx "/usr/java/bin/java chemical.Htdigfront" \

x-world/x-vrml "/usr/java/bin/java chemical.Htdigfront" \

model/x-vrml "/usr/java/bin/java chemical.Htdigfront"

Making sure that they are pointing towards the java binary!

Ht://dig should now be able to cope with the pdb files and jcamp dx files ( if the server has these files under a different mime type these will have to be changed accordingly

Test Set

A successful index should give the following;
New server: www.ch.ic.ac.uk, 80
0:0:0:http://www.ch.ic.ac.uk/rzepa/vrml/: ++++++++++*+--------------------------
-- size = 30232
1:1:1:http://www.ch.ic.ac.uk/rzepa/vrml/3cro2.wrl:  size = 69475
2:2:1:http://www.ch.ic.ac.uk/rzepa/vrml/brickmann/panel1b.wrl:  size = 450
3:3:1:http://www.ch.ic.ac.uk/rzepa/vrml/panel1a.wrl:  size = 124486
4:4:1:http://www.ch.ic.ac.uk/rzepa/vrml/panel1c.wrl:  size = 155777
5:5:1:http://www.ch.ic.ac.uk/rzepa/vrml/panel2/chloro/scatter.wrl:  size = 4731
6:6:1:http://www.ch.ic.ac.uk/rzepa/vrml/panel3.html: +++-* size = 5056
7:7:1:http://www.ch.ic.ac.uk/rzepa/vrml/vib.wrl:  size = 79844
8:8:1:http://www.ch.ic.ac.uk/rzepa/vrml/mauveine.pdb:  size = 7091
9:9:1:http://www.ch.ic.ac.uk/rzepa/vrml/ir.jdx:  size = 48163
10:10:1:http://www.ch.ic.ac.uk/rzepa/vrml/lab.wrl:  size = 19801
11:11:1:http://www.ch.ic.ac.uk/rzepa/vrml/panel3d.wrl:  size = 139391
12:12:2:http://www.ch.ic.ac.uk/rzepa/vrml/panel3b.html:  size = 302
13:13:2:http://www.ch.ic.ac.uk/rzepa/vrml/panel3d.html:  size = 328
14:14:2:http://www.ch.ic.ac.uk/rzepa/vrml/panel3c.html:  size = 294
htmerge: Sorting...
htmerge: Merging...
htmerge: 100:article
htmerge: 200:chemistry
htmerge: 300:data
htmerge: 400:envisaged
htmerge: 500:gracefully
htmerge: 600:interpolated
htmerge: 700:metaphor
htmerge: 800:over
htmerge: 900:provide
htmerge: 1000:scripting
htmerge: 1100:taken
htmerge: 1200:watching

Last updated 4/06/99

For further information please contact Dr H Rzepa