<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Nuggets &#187; relevancy tuning</title>
	<atom:link href="http://blog.comperiosearch.com/blog/tag/relevancy-tuning/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.comperiosearch.com</link>
	<description>A blog about Search as THE solution</description>
	<lastBuildDate>Mon, 13 Jun 2016 08:59:45 +0000</lastBuildDate>
	<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=3.9.40</generator>
	<item>
		<title>Enterprise Search Optimization (ESO)</title>
		<link>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/</link>
		<comments>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/#comments</comments>
		<pubDate>Sat, 10 Jan 2015 11:45:54 +0000</pubDate>
		<dc:creator><![CDATA[Christoffer Vig]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[enterprise search optimization]]></category>
		<category><![CDATA[ESO]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[relevance]]></category>
		<category><![CDATA[relevancy tuning]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=3153</guid>
		<description><![CDATA[So, you got your enterprise search engine, but still can&#8217;t find what you&#8217;re looking for?  It&#8217;s time to stop your sobbing and learn to play the exciting game of Enterprise Search Optimization (ESO). Enterprise search differs from web search in some fundamental ways. But there are also similarities. Since we all know how successful web search [...]]]></description>
				<content:encoded><![CDATA[<p>So, you got your enterprise search engine, but still can&#8217;t find what you&#8217;re looking for?  It&#8217;s time to stop your sobbing and learn to play the exciting game of Enterprise Search Optimization (ESO).</p>
<p><span id="more-3153"></span></p>
<p><a href="http://en.wikipedia.org/wiki/Enterprise_search">Enterprise search</a> differs from <a href="http://en.wikipedia.org/wiki/Web_search_engine">web search</a> in some fundamental ways. But there are also similarities. Since we all know how successful web search is, let&#8217;s see if there is something to learn by examining the differences.</p>
<p>&nbsp;</p>
<table>
<tbody>
<tr>
<th></th>
<th>Web search</th>
<th> Enterprise search</th>
</tr>
<tr>
<td>Search Engine</td>
<td>Google, Bing, Baidu&#8230;</td>
<td>SharePoint, Elasticsearch, Solr, Virtualworks, Autonomy, GSA&#8230;</td>
</tr>
<tr>
<td>Sources</td>
<td>Web pages, web applications (++)</td>
<td>databases, file shares, intranet, web pages, email, SAP, CRM&#8230;</td>
</tr>
<tr>
<td>Content ambitions</td>
<td>everything</td>
<td>limited</td>
</tr>
<tr>
<td>Authority ranking</td>
<td>Pagerank</td>
<td>custom</td>
</tr>
<tr>
<td>Control of search engine</td>
<td>web search company</td>
<td>tech department, power users</td>
</tr>
<tr>
<td>Control of content</td>
<td>user</td>
<td>user</td>
</tr>
<tr>
<td>Writer/reader ratio</td>
<td>low</td>
<td>high</td>
</tr>
</tbody>
</table>
<p>The most striking similarity is that both solutions involve content produced by a user.</p>
<p>The main differences are the search engine and who controls it, the different types of content sources, and the use of the pagerank algorithm.</p>
<h3><strong>Web content and web search</strong></h3>
<p>What makes web search so successful?  Web search was revolutionized when Google introduced their web search using the pagerank algorithm. Pagerank uses the natural structure of the world wide web, and assigns high weight to pages with many incoming links. It rests on the assumption that pages with correct and important information will be used as references on other pages. Along with pagerank, there is a large number of other factors used to drive relevancy; content quality, keywords, social media sharing etc. Most of the details are not publicly available.</p>
<p>Content publishers on the world wide web can use the tricks of <a href="http://en.wikipedia.org/wiki/Search_engine_optimization">Search Engine Optimization</a> (SEO) to make sure their content gets optimal visibility on the web. SEO is the art of combining knowledge of two things;</p>
<p>- how web search engines work</p>
<p>- what search terms people use.</p>
<p>Both of these areas involve a lot of guessing. Some information can be found in guides such as the <a href="https://support.google.com/webmasters/answer/35769?hl=en&amp;ref_topic=6002025">Google Webmaster Guidelines</a>, which explains what a webmaster can do to make sure her web sites are properly indexed. Parts of this read almost like an instruction on how to create a nice school paper: &#8220;Create a useful, information-rich site, and write pages that clearly and accurately describe your content.&#8221;</p>
<p>By following these guidelines, you are helping web search engines understand your content.</p>
<p><strong>Enterprise content and enterprise search </strong></p>
<p>Enterprise search is a different story. Content is gathered from different sources, with varying degrees of  structure, and mostly without links that could be used for pagerank.</p>
<p>Content publishers in an enterprise search solution are on their own, with no official guidelines describing the rules to follow to win top ranking on the intranet search. More often than not, nobody knows how the enterprise search engine really works. Compare this to the web search situation, and it should not come as a surprise if enterprise search sucks.</p>
<p>A solution to this dilemma requires taking a step back from the idea that enterprise search is a box that you can plug in to your intranet  and &#8220;there was search&#8221;.</p>
<p>Search tech guys often pride themselves in the abilities of their search engines, and will rather fix relevancy problems created by bad content by doing tricks on the technical side of things. On the other side of the story, content producers expect search to &#8220;just work&#8221;, and put all the responsibility on technology and the implementer.</p>
<p>Creating a great enterprise search solution requires cooperation between the makers of content and the makers of search solutions.</p>
<p>Content producers should know how their content will end up in search. They should know what factors affect findability. Search solutions should have documentation targeted towards the end user, which in the enterprise also might be a content producer.</p>
<h3><strong>ESO</strong></h3>
<p>We can define Enterprise Search Optimization (ESO) as the art of improving Enterprise Search. Where ESO has been applied, we should expect to find a well functioning search solution, where employees and content producers know how to create easily findable content.</p>
<p>Compared to SEO, Enterprise search optimization is a simple procedure, involving little  guesswork in regard to figuring out how the search engine works. It is also a difficult procedure, since ESO needs to be individually tailored and optimized for the specific informational needs for each enterprise.  To develop ESO guidelines, the search technicians need to sit down with the content producers and users to figure out the details of the information model and where the pain of missing information hurts the most.</p>
<p>ESO should result in a list of guidelines, or rules, similar to the lists of SEO. These rules can range from simple and obvious, making sure documents have descriptive titles, correct dates and author, to more complex involving consistent language use, metadata fields for categorization, etc. ESO rules should also explain how structure is imposed on data with less structure.</p>
<p>Recognizing authoritative content is solved in web search engines by using the pagerank algorithm. Enterprise search will rarely be able to use pagerank directly. Authority can often be determined by other means. This can range from simple facts like &#8220;This book is the company procedure bible&#8221; to &#8220;powerpoint is more important than word&#8221;.</p>
<h3><strong>Optimizing  enterprise search </strong></h3>
<p>Enterprise search can suck a little less by applying a customized version of SEO.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Relevance tuning in the search domain. What is it exactly?</title>
		<link>http://blog.comperiosearch.com/blog/2014/03/07/recipe-app-relevance-tuning-what-is-it-exactly/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/03/07/recipe-app-relevance-tuning-what-is-it-exactly/#comments</comments>
		<pubDate>Fri, 07 Mar 2014 16:26:06 +0000</pubDate>
		<dc:creator><![CDATA[Espen Klem]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[filters]]></category>
		<category><![CDATA[ideas]]></category>
		<category><![CDATA[recipe app]]></category>
		<category><![CDATA[relevance]]></category>
		<category><![CDATA[relevancy tuning]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[user story]]></category>
		<category><![CDATA[ux]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=2032</guid>
		<description><![CDATA[First thing first! Let&#8217;s get rid of the bullshit bingo lingo: &#8220;Relevancy tuning&#8221; in search is a fancy description for something that&#8217;s not very magical, even if it sounds like just that. It&#8217;s about getting the right results on top of your search result. End of story. If somebody asks you a question, you should [...]]]></description>
				<content:encoded><![CDATA[<p>First thing first! Let&#8217;s get rid of the bullshit bingo lingo: &#8220;Relevancy tuning&#8221; in search is a fancy description for something that&#8217;s not very magical, even if it sounds like just that. It&#8217;s about getting the right results on top of your search result. End of story. If somebody asks you a question, you should start by giving that person the most likely answer first. Most search engines seems to be digressing. It&#8217;s because we haven&#8217;t told in a clear manner what to be expected from them. And because we often use generic tools to solve specific problems.</p>
<p>One generic tool for getting the right results on top is the &#8220;<a href="http://en.wikipedia.org/wiki/Tf%E2%80%93idf">term frequency–inverse document frequency</a>&#8220;, or <a href="https://www.google.no/?gfe_rd=ctrl&amp;ei=_NAZU7qjL8-AwAP6_oCgBg&amp;gws_rd=cr#q=tf-idf&amp;safe=off">tf-idf</a> for short. It&#8217;s a combination of how often a term is mentioned in a document compared to how often it&#8217;s mentioned in all of your documents in the index. So, rare terms within the whole index used often in one document makes it a good search result when searching for that term. But most likely, not good enough. You need to figure out what&#8217;s the characteristics of your content, and what are the most characteristic use cases and user stories for your users. Only then can you achieve great relevancy, &#8230;  I mean get the right result on top of your search result.</p>
<h2>Model for relevance tuning</h2>
<p>We&#8217;ll use our Recipe app as an example&#8230;</p>
<p><img class="alignnone" src="http://farm4.staticflickr.com/3122/12989606394_97436f539e_z.jpg" alt="Model for relevance tuning" width="360" height="640" /></p>
<p>So, for our food recipe app, we have some obvious content characteristics:</p>
<ul>
<li>The more ingredients in-season for one recipe is good. We&#8217;re doing an OR-search on all ingredients in-season so this comes out-of-the-box &#8230; almost.</li>
<li>Quite a lot of recipes doesn&#8217;t stand the test of time. We know that most of the recipes at oppskrift.klikk.no from 2008 or newer are quite good and have nice photos.</li>
<li>We&#8217;re not sure if we need this, but we know whom of the writers to trust. This may be an overkill when we already have a boosting on newer recipes.</li>
</ul>
<p>And we know a lot about our users as well:</p>
<ul>
<li>Most grown up people in Norway have a job, thus limited time to prepare a meal. This means that recipes that takes shorter preparations should be boosted from Monday through Thursday. The verdict on Friday is still not decided.</li>
<li>During the weekend people have more time to make dinner. The recipes that takes a short time to prepare most probably cut some corners, and are not that good compared to recipes that takes a little longer time. So for the weekends, we should do a demotion of really quick recipes, at least for dinners.</li>
</ul>
<p>This is the info we&#8217;re going to use to sort our search result. But we have more knowledge about our users that we can use to auto-set filters for certain times of the day:</p>
<ul>
<li>Most work days, people don&#8217;t plan a breakfast meal or lunch. The whole day we can auto-set the main &#8220;course&#8221; filter.</li>
<li>During the weekend, people may also plan a lunch. We&#8217;ve decided to auto-set the &#8220;light meal&#8221;-filter during weekends up until lunch time. After that the &#8220;main course&#8221; filter is auto-set. We&#8217;ll log if the first thing our users do is to set another filter.</li>
<li>On Friday and Saturday a lot of Norwegians drink beer, wine or liquor. After some hours of drinking, they get hungry. Maybe we should have an &#8220;afterparty, quick and greasy and tasty-meal&#8221;-filter auto-set for late Fridays and Saturdays?</li>
</ul>
<p>So, what&#8217;s the filters we&#8217;ve decided on:</p>
<ul>
<li>Light meals</li>
<li>Starters</li>
<li>Main courses</li>
<li>Deserts</li>
<li>&#8230; and maybe the Afterparty-thingy</li>
</ul>
<p>What do you think?</p>
<p><img class="alignnone" src="http://farm4.staticflickr.com/3143/12988485805_d1a693e95c_z.jpg" alt="Model for relevance tuning" width="640" height="360" /></p>
<p>Sounds nice? This is work in progress, so <a href="http://blog.comperiosearch.com/blog/tag/recipe-app/">check back every now and then for new blog posts</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/03/07/recipe-app-relevance-tuning-what-is-it-exactly/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Seasonal Food Recipe Web Application</title>
		<link>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/#comments</comments>
		<pubDate>Mon, 03 Feb 2014 19:14:42 +0000</pubDate>
		<dc:creator><![CDATA[Espen Klem]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[design pattern]]></category>
		<category><![CDATA[Elasticsearch]]></category>
		<category><![CDATA[forage]]></category>
		<category><![CDATA[Forage Document Processor]]></category>
		<category><![CDATA[Forage Search Enginge]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[recipe app]]></category>
		<category><![CDATA[relevancy]]></category>
		<category><![CDATA[relevancy tuning]]></category>
		<category><![CDATA[relevant results]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[ux]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=1942</guid>
		<description><![CDATA[So, what&#8217;s this you ask? It&#8217;s a series of mini-hackathons some of us at Comperio are doing to achieve a small list of goals: Learn more about search, both tech and UX. Show that a nice search user experience doesn&#8217;t need a search input box. Show a search that is light weight. Generally, build something [...]]]></description>
				<content:encoded><![CDATA[<p>So, what&#8217;s this you ask? It&#8217;s a series of mini-hackathons some of us at Comperio are doing to achieve a small list of goals:</p>
<ol>
<li>Learn more about search, both tech and UX.</li>
<li>Show that a nice search user experience doesn&#8217;t need a search input box.</li>
<li>Show a search that is light weight.</li>
<li>Generally, build something nice, quickly, to show off.</li>
</ol>
<p>&#8220;But what is it?&#8221; you ask again. The second answer is that it&#8217;s a search application for recipes containing the most in-season vegetables, fruits, mushrooms, fish, shellfish, poultry and meat. So, a search without a search box giving you the recipes that will make you pick the freshest, most tasty food available at any given time of the year. In Norway, you can get strawberries the whole year around, but it&#8217;s only a month or two during the summer that they taste really good. and are cheap. This goes for a lot of different foods.</p>
<p><a href="http://www.flickr.com/photos/eklem/10562070553/sizes/o/in/photolist-h6knAH-hZUTrM/"><img class="alignnone" src="http://farm3.staticflickr.com/2844/10562070553_436b7db420_c.jpg" alt="Crawling data for the seasonal food recipe web application" width="800" height="574" /></a></p>
<p>We&#8217;ll blog every step of the journey. First small hackathon is this Friday, January 7th. To test our ideas we&#8217;ve already crawled <a href="http://oppskrift.klikk.no/">food recipes from Klikk.no</a>, and transformed them into JSON-format by using Forage Document Processor from <a href="http://blog.comperiosearch.com/blog/2013/11/18/get-the-new-version-of-forage-the-search-server-for-node-js/">Fergus McDowell&#8217;s excellent Forage Search Engine</a>.</p>
<p>Next step is to get an Amazon EC2-server up and running, install <a href="http://www.elasticsearch.org/">Elasticsearch</a>, and feed it recipes in JSON-format. We&#8217;ll also make some rough mockups of how the user interface is going to be, and maybe, hopefully get a simple version of the Seasonal Food Recipe Web Application up and running. If we after a while get enough traffic, we&#8217;ll try to do some machine learning. And we&#8217;ll definitely test out different ways of doing <a href="http://blog.comperiosearch.com/blog/2012/11/21/in-relevance-we-trust/">relevancy tuning</a>.</p>
<p>Sounds nice? This is work in progress, so <a href="http://blog.comperiosearch.com/blog/tag/recipe-app/">check back every now and then for new blog posts</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
