External Parsers
Supported parser types
Parser is any executable program which converts a file of an arbitrary type into TEXT, HTML or XML format.
mnoGoSearch for Windows can use programs able to read data from an input file and send result to an output file:
parser.exe infile outfile
or read data from an input file and send results to STDOUT, so that it is possible to redirect data from STDOUT to an output file:
parser.exe infile > outfile
Configuring parsers
Since version 3.2.24, mnoGoSearch for Windows is distributed with pre-configured parsers for DOC, XLS, RTF and PDF file formats.
For other file types you can try to find mnoGoSearch compatible parsers on the Internet. For example, for postscript files you can use 'ps2ascii.exe', which reads a postscript file and produces a text file.

In order to add a parser for your file type, please follow these steps.
- Download and install your favourite mnoGoSearch compatible program-converter.
- Run mnoGoSearch Indexer.
- Associate your file extension with a mime type at the "Mime types" tab, for example application/postscript - String - Match - Insensitive - *.ps
- Check that your file type is not disallowed at the "Documents" tab, and remove it if it's necessary. For example, remove *.ps extension from the list of disallowed file extensions.
- Add parser: application/postscript - text/plain - C:\PATH_TO_PARSER\ps2ascii.exe $1 $2
- See parser documentation for details. Meta-variables $1 and $2 will be replaced with the input and the output file names respectively during parser execution.
- On the "Servers" tab, add some resource containing PS files, for example: file:///C:/PS_Files/
- Start indexing
- Search
|