This issue tracks how we configure / intend to configure the mg4j index for maximum performance in the experiment.
- [Not implemented] Disable positions.
- Disable scoring.
- [Not verified]. Use BitStreamHPIndexReader. Actually, we probably want to use the subclass InMemoryHPIndex. Check to see if this is used by default. Right now it looks like the code uses QuasiSuccinctIndex.
- [Not verified]. Use in-memory index. See JavaDocs for Index.UriKeys.
- [Not verified]. Use wired index.
- No stemming.
- No stop word elimination.
- ??? Disable advanced queries (e.g. near, WAND, phrase).
- ??? Disable forward index storage for titles.
- ??? Disable forward index storage for BM25F scoring information.
- Exporter for Partitioned Elias-Fano index generates a frequency of 1 for every posting.
This issue tracks how we configure / intend to configure the mg4j index for maximum performance in the experiment.