Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • For Instances coded as English, the default behavior is to apply word-stemming in exact phrase searches. Evaluation will occur to determine if this is expected behavior.

Actions required: None

Relevance sorting (UXPROD-4200)

Description: There is a new option to sort Inventory results by relevance. This sorting uses the Okapi BM25 algorithm, which takes into account the following factors:

  • Term Frequency (TF): First, it looks at how often your search words appear in each instance. If an instance has your search words many times, it gets a higher score because it's more likely to be a good match
  • Inverse Document Frequency (IDF): Then, it checks how common or rare your search words are across all the instances in the library. If your words are rare, they get a higher score. If they are common, they get a lower score. This helps give importance to unique words.
  • Document Length (DL): BM25 also considers how long each instance is. If an instance is very long, it might dilute the importance of your search words, so it gets a lower score.
  • Parameter Tuning: BM25 has a few parameters that you can adjust to fine-tune your search. These parameters help you control how much importance you want to give to term frequency, inverse document frequency, and document length.
  • Calculation: Finally, BM25 combines all these factors using a mathematical formula to calculate a score for each instance. The instance with the highest score is considered the best match for your search.

Image Added

View file
nameRelevanceSortDemo.webm
height400

Changes/additions: None

Known limitations: Inability to tune algorithm at this time

Actions required: None

MARC authority

...

Single record automated linking (UXPROD-3874)

...