diff options
author | Luuk Arts | 2017-10-06 12:22:27 +0200 |
---|---|---|
committer | GitHub | 2017-10-06 12:22:27 +0200 |
commit | d1c52ddd65448ebcbcf0eaf8699306ed91ad4e7a (patch) | |
tree | f7990a1345886b89b28d96e6052809cd5675e7e6 /Plan.md | |
parent | Update installation (diff) |
Update Plan.md
Diffstat (limited to 'Plan.md')
-rw-r--r-- | Plan.md | 6 |
1 files changed, 5 insertions, 1 deletions
@@ -43,8 +43,12 @@ Both Indexes are likely implemented by the Nordlys package that we will describe More information about the way it was indexed can be found [here](https://iai-group.github.io/DBpedia-Entity/index_details.html). +### Our Approach Our hypothesis is that not all of the fields are of similar importance. -As such, our idea is to use some kind of Hill-Climbing algorithm to determine just what combination of fields (or possible weights) produces the best output. +As such, our idea is to use some kind of Hill-Climbing algorithm to determine just what combination of fields (or possible weights) produces the best ranking. + +The quality of the ranking is determined by measuring effectiveness in terms of Normalized Discounted Cumulative Gain (NDCG). +We calculate the NDCG@10, NDCG@100 for (each of) the ranking function(s) (BM25, PRMS, MLM, LM), so the hill climbing algorithm can compare the output for the various combinations of fields and so we can analyze if the effectiveness of our output is an improvement over the base rankings described in the DBpedia-Entity repo. ## Nordlys Nordlyss is a toolkit for entity-oriented and semantic search. It currently supports four entity-oriented tasks, which could be useful for our project. These entity-oriented tasks are: |