Search Relevancy 101
Big Data means more than just Volume; it can also show up as Big Variety. There are so many science datasets available, that searching for and finding the right one is becoming harder every day. One solution is to return search results with the most relevant ones at the top. Why so difficult? Well, dataset relevancy is a little different than the ordinary relevancy rankings one would use for web pages. Dataset versioning, temporal overlap, spatial overlap, download frequency are all potential means of presenting the datasets most likely to be useful to a user.