From 2f8ac2a790326c1092e40ee895829c8ee3a86da2 Mon Sep 17 00:00:00 2001 From: Camil Staps Date: Fri, 15 Dec 2017 11:52:00 +0100 Subject: Add results --- Implementation.md | 49 ++++++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 48 insertions(+), 1 deletion(-) diff --git a/Implementation.md b/Implementation.md index b60b8b9..ae527d7 100644 --- a/Implementation.md +++ b/Implementation.md @@ -187,4 +187,51 @@ This assumes that you have the following files from Nordlys: The system is agnostic with regards to the ranking function (BM25 or another method). -## Intermediate Result +## Intermediate Results +These are the thirty most important fields as found by our measure: + +| Field | Score | Used by Nordlys | +|------------------------------|----------:|:---------------:| +| `` | 2205.50 | ![][n] | +| `` | 2246.90 | ![][n] | +| `` | 2357.07 | ![][n] | +| `` | 2357.35 | ![][n] | +| `` | 2384.14 | ![][n] | +| `` | 2440.73 | ![][n] | +| `` | 2485.96 | ![][n] | +| `` | 2689.86 | ![][n] | +| `` | 2691.94 | ![][n] | +| `` | 2694.41 | ![][y] | +| `` | 2698.75 | ![][n] | +| `` | 2836.44 | ![][n] | +| `` | 3248.58 | ![][n] | +| `` | 3425.58 | ![][n] | +| `` | 3430.20 | ![][n] | +| `` | 4316.34 | ![][y] | +| `` | 4470.38 | ![][n] | +| `` | 4480.06 | ![][n] | +| `` | 4704.52 | ![][n] | +| `` | 8040.36 | ![][n] | +| `` | 10999.72 | ![][n] | +| `` | 22065.46 | ![][n] | +| `` | 23442.34 | ![][n] | +| `` | 24697.75 | ![][n] | +| `` | 25500.42 | ![][y] | +| `` | 32860.37 | ![][y] | +| `` | 86218.71 | ![][y] | +| `` | 105358.89 | ![][y] | +| `` | 514446.08 | ![][n] | +| `` | 581355.57 | ![][n] | + +We see that many of the relevant fields are actually [not used by +Nordlys](https://iai-group.github.io/DBpedia-Entity/index_details.html). +However, this is not yet an indication that these fields should be added to the +index. After all, adding more fields means more computation time to build the +index and to retrieve search results. + +In fact, we expect that many of the fields not used actually display +similarities with fields that *are* indexed. For example, the `` +field will probably match because the title is repeated in the abstract. + +[y]: http://i.stack.imgur.com/iro5J.png +[n]: http://i.stack.imgur.com/asAya.png -- cgit v1.2.3