<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Nuggets &#187; enterprise search</title>
	<atom:link href="http://blog.comperiosearch.com/blog/tag/enterprise-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.comperiosearch.com</link>
	<description>A blog about Search as THE solution</description>
	<lastBuildDate>Mon, 13 Jun 2016 08:59:45 +0000</lastBuildDate>
	<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=3.9.40</generator>
	<item>
		<title>Search: better user experience with one line of JavaScript</title>
		<link>http://blog.comperiosearch.com/blog/2015/05/18/search-better-user-experience-with-one-line-of-javascript/</link>
		<comments>http://blog.comperiosearch.com/blog/2015/05/18/search-better-user-experience-with-one-line-of-javascript/#comments</comments>
		<pubDate>Mon, 18 May 2015 13:59:01 +0000</pubDate>
		<dc:creator><![CDATA[Espen Klem]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[focus]]></category>
		<category><![CDATA[Javascript]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[search box]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[ux]]></category>
		<category><![CDATA[website search]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=3680</guid>
		<description><![CDATA[What&#8217;s the cheapest trick you can do to get a better user experience on your search solution, and make your users do better search queries? Add a small line of JavaScript in your template&#8217;s document ready function: [crayon-69e8777d4c7c5320482021/] This will do two things for the user: It&#8217;ll be easier to see the search box . [...]]]></description>
				<content:encoded><![CDATA[<p>What&#8217;s the cheapest trick you can do to get a better user experience on your search solution, and make your users do better search queries?</p>
<p><img class="alignnone wp-image-3683 size-full" src="http://blog.comperiosearch.com/wp-content/uploads/2015/05/search-box.png" alt="Illustration: A standard search box" width="598" height="66" /></p>
<p>Add a small line of JavaScript in your template&#8217;s document ready function:</p><pre class="crayon-plain-tag">$("#MySearchBox").focus();</pre><p>This will do two things for the user:</p>
<ol>
<li>It&#8217;ll be easier to see the search box .</li>
<li>The user can start typing without having to click inside the search box.</li>
</ol>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-just-cursor.gif"><img class="alignnone wp-image-3682 size-full" src="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-just-cursor.gif" alt="Illustration: Better user experience ny setting focus on the search box" width="598" height="66" /></a></p>
<p>Next issue is that most intranet and websites are more than just a search solution. Maybe you don&#8217;t want that much attention on the search box on your homepage. The solution is then to do this on your search result page.</p>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-one-word.gif"><img class="alignnone wp-image-3685 size-full" src="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-one-word.gif" alt="Illustration: Better user experience ny setting focus on the search box" width="598" height="66" /></a></p>
<p>This will make it easier for your users to enhance their search query when they&#8217;re not happy with the search result at hand.</p>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-more-words.gif"><img class="alignnone wp-image-3684 size-full" src="http://blog.comperiosearch.com/wp-content/uploads/2015/05/anim-more-words.gif" alt="Illustration: Better user experience ny setting focus on the search box" width="598" height="66" /></a></p>
<p>Do you have any other examples on other quick fixes that could make an even better user experience for your search solution?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2015/05/18/search-better-user-experience-with-one-line-of-javascript/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ny versjon av Comperio FRONT.NET</title>
		<link>http://blog.comperiosearch.com/blog/2015/05/13/ny-versjon-av-comperio-front-net/</link>
		<comments>http://blog.comperiosearch.com/blog/2015/05/13/ny-versjon-av-comperio-front-net/#comments</comments>
		<pubDate>Wed, 13 May 2015 10:24:54 +0000</pubDate>
		<dc:creator><![CDATA[Christoffer Vig]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Comperio Front]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[logstash]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=3661</guid>
		<description><![CDATA[Comperio har gjennom tidenes løp levert over 100 søkeprosjekter. Tankegods, svette og erfaringer hentet fra dette arbeidet har krystallisert seg inn i vår egentuviklede programvare for søkeapplikasjoner: FRONT. Tidligere i vår lanserte vi versjon 5 av Java-FRONT, denne gang er det den noe yngre fetteren Comperio FRONT.NET som har fått ligge på operasjonsbordet. Hovedtrekkene i [...]]]></description>
				<content:encoded><![CDATA[<p>Comperio har gjennom tidenes løp levert over 100 søkeprosjekter. Tankegods, svette og erfaringer hentet fra dette arbeidet har krystallisert seg inn i vår egentuviklede programvare for søkeapplikasjoner: FRONT. Tidligere i vår lanserte vi versjon 5 av Java-FRONT, denne gang er det den noe yngre fetteren Comperio FRONT.NET som har fått ligge på operasjonsbordet. Hovedtrekkene i den nye versjonen er nye søkeadaptere, forbedret stabilitet og ytelse, samt forbedret logging.<br />
<span id="more-3661"></span></p>
<h4>Mellomvare for søk</h4>
<p>FRONT.NET opererer som mellomvare, og lar deg konfigurere forretningslogikk for søk uavhengig av både søkemotor og presentasjon. FRONT.NET er laget for å kunne hente og sette sammen informasjon fra ulike kilder, og kan gjerne kalles en søkeorkestrator.</p>
<p>FRONT.NET lar deg skille mellom forretningslogikk og applikasjonslogikk. Applikasjoner som trenger søkefunksjonalitet trenger ikke bry seg med kompliserte søkeuttrykk, men sender simpelthen spørreord over til FRONT.NET. Trenger du å avgrense søket, kan du sende med filter, som for eksempel brukerinformasjon, sted, avdeling, eller lignende. De komplekse spørringene tar FRONT seg av.</p>
<h4>Søkemotoruavhengighet</h4>
<p>FRONT.NET tilbyr et generelt format for spørringer, og søkeresultater. Dataformatet fra FRONT er det samme, uavhengig av om motoren i bakkant er SharePoint, ESP, eller Solr. FRONT.NET har i dag adaptere for Fast ESP, SharePoint 2010 og 2013, Elasticsearch, Solr og Google Search Appliance. Dette gjør det enkelt å sette sammen resultater fra ulike søkemotorer. Dersom du ønsker å bytte ut søkemotoren trenger det ikke innebære endringer i din applikasjon, da det kun er snakk om å bytte ut søkeadapter i FRONT.NET. Nye adaptere utvikles så snart vi ser behovet melde seg.</p>
<h4>Elasticsearch adapter</h4>
<p>Elasticsearch er en søkemotor i stor vekst. Til utvikling av Elasticsearch adapteret har vi kunnet dra nytte av NEST, den offisielle .NET klienten for Elasticsearch. Elasticsearch har enorm fleksibilitet i forhold til hvordan spørringer kan uttrykkes, med mulighet for nestede boolske uttrykk og dynamiske ranking-funksjoner. I utvikling av adapteret har vi valgt å minimere kompleksiteten i FRONT ved å delegere disse mulighetene inn i Elasticsearch via søkemaler (search templates). Dette ivaretar fleksibiliteten, samtidig som APIer og programmeringsgrensesnittene er beholdt.</p>
<h4>Google Search Appliance Adapter</h4>
<p>Comperio ble ifjor partner med Google, og vi har nå utviklet FRONT.NET adapter for Googles intranett søkemotor Google Search Appliance, eller bare GSA for kort. GSA tilbyr enkel integrasjon mot en rekke ulike kilder, søkegrensesnittet er enkelt og forholde seg til og adapteret har støtte for alle vanlige søkeoperasjoner.</p>
<h4>Logging</h4>
<p>For å kunne utvikle en god søkeløsning er det avgjørende at man har tilgang til gode søkelogger som avslører hvordan søkeapplikasjon brukes.<br />
FRONT.NET har nylig fått funksjonalitet for å kunne logge direkte til Logstash. Logstash kombinert med Elasticsearch og Kibana gir deg et kraftig verktøy for dataanalyse.</p>
<h4>FRONTD</h4>
<p>Versjon 5 av FRONT.NET kjører som en frittstående tjeneste i Windows.<br />
Tidligere versjoner opererte som web applikasjon under IIS (Internet information server), men vi ser at når vi kjører frittstående oppnås forenklet administrasjon, samt forbedret stabilitet og ytelse.</p>
<h4>Microsoft, .NET og veien videre</h4>
<p>Microsoft og .NET verdenen er under rivende utvikling for tiden, ikke minst gjennom Microsoft sin nye og kjært velkomne åpning mot open source. Vi liker veldig godt ideen om kryssplattform .NET, og neste versjon av FRONT.NET vil forhåpentligvis kjøre like bra på OS X og Linux som på Microsoft.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2015/05/13/ny-versjon-av-comperio-front-net/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Public 360 connector for SharePoint søk</title>
		<link>http://blog.comperiosearch.com/blog/2015/02/09/public-360-konnektor-sharepoint-sok/</link>
		<comments>http://blog.comperiosearch.com/blog/2015/02/09/public-360-konnektor-sharepoint-sok/#comments</comments>
		<pubDate>Mon, 09 Feb 2015 12:37:50 +0000</pubDate>
		<dc:creator><![CDATA[Pål Christian Kjølberg]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Business 360]]></category>
		<category><![CDATA[connector]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[Kildeintegrasjon]]></category>
		<category><![CDATA[konnektor]]></category>
		<category><![CDATA[P 360]]></category>
		<category><![CDATA[p360]]></category>
		<category><![CDATA[Public 360]]></category>
		<category><![CDATA[sharepoint 2010]]></category>
		<category><![CDATA[SharePoint 2013]]></category>
		<category><![CDATA[SharePoint 2016]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=3311</guid>
		<description><![CDATA[Software Innovation sine systemer for dokumenthåndtering, Public 360 og Business 360, er brukt av mange offentlige og private aktører både i Norge og Sverige. Comperio gjør det enkelt å finne riktig dokument selv om du ikke er godt kjent med Public 360. Ved å gjøre innholdet i Public 360 tilgjengelig i en søkeindeks kan man [...]]]></description>
				<content:encoded><![CDATA[<p><a title="Software Innovation Public 360" href="http://www.software-innovation.com/no/produkter/360offentlig/pages/default.aspx">Software Innovation</a> sine systemer for dokumenthåndtering, Public 360 og Business 360, er brukt av mange offentlige og private aktører både i Norge og Sverige.</p>
<p>Comperio gjør det enkelt å finne riktig dokument selv om du ikke er godt kjent med Public 360. Ved å gjøre innholdet i Public 360 tilgjengelig i en søkeindeks kan man få søketreff på tvers av kilder. SharePoint 2013 tilbyr en slik søkeindeks. Denne vil kunne gi presise treff fordi spørringen henter treff basert på data både fra dokumentet og metadata til dokumentet. På den måten kan man utforske hva som finnes av innhold og historikk knyttet til et tema eller en sak.<br />
For å få tilgang på dokumentene i Public 360 må man koble kildesystemet sammen med søkemotoren med en konnektor.</p>
<p>Comperio laget i første omgang en konnektor for å kunne indeksere innhold fra Public 360 basert på Fast Search Server for SharePoint 2010. Siden har vi laget en ny versjon for Microsoft SharePoint 2013.<br />
Når vi laget konnektoren så bruke vi Microsofts rammeverk for å lage tilpassede BCS (Business Connectivity Services) konnektorer til SharePoint 2013.</p>
<h2>Så, hvordan fungerer konnektoren for indeksering av innhold?</h2>
<p>FileFinder web servicen er endepunktet i Public 360, og det brukes til å hente Public 360 dokumenter. Fra denne web servicen får vi all informasjon om hvert dokument, som egenskaper, metadata og rettighetsinformasjon. Videre blir hvert dokument og dets egenskaper plukket opp av SharePoint 2013, hvor dokumentet crawles, prosesseres og lagres i søkeindeksen.</p>
<h2>Og hvordan fungerer konnektoren ved søk utført av en bruker?</h2>
<p>Når en bruker gjør et søk i søkeboksen, så legges det til hva slags tilganger brukeren har på spørringen. Disse tilgangene hentes fra Public 360 ved hjelp av en SharePoint security trimmer som settes opp for Public 360. Dette fører til at kun dokumenter som brukeren har rettigheter til blir returnert som en del av søkeresultatet.</p>
<p><a title="Ta kontakt for å høre mer om Public 360 og SharePoint" href="http://www.comperio.no/kontakt-comperio/">Ta kontakt for å høre mer om Public 360 og SharePoint.</a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2015/02/09/public-360-konnektor-sharepoint-sok/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enterprise Search Optimization (ESO)</title>
		<link>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/</link>
		<comments>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/#comments</comments>
		<pubDate>Sat, 10 Jan 2015 11:45:54 +0000</pubDate>
		<dc:creator><![CDATA[Christoffer Vig]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[enterprise search optimization]]></category>
		<category><![CDATA[ESO]]></category>
		<category><![CDATA[findability]]></category>
		<category><![CDATA[relevance]]></category>
		<category><![CDATA[relevancy tuning]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=3153</guid>
		<description><![CDATA[So, you got your enterprise search engine, but still can&#8217;t find what you&#8217;re looking for?  It&#8217;s time to stop your sobbing and learn to play the exciting game of Enterprise Search Optimization (ESO). Enterprise search differs from web search in some fundamental ways. But there are also similarities. Since we all know how successful web search [...]]]></description>
				<content:encoded><![CDATA[<p>So, you got your enterprise search engine, but still can&#8217;t find what you&#8217;re looking for?  It&#8217;s time to stop your sobbing and learn to play the exciting game of Enterprise Search Optimization (ESO).</p>
<p><span id="more-3153"></span></p>
<p><a href="http://en.wikipedia.org/wiki/Enterprise_search">Enterprise search</a> differs from <a href="http://en.wikipedia.org/wiki/Web_search_engine">web search</a> in some fundamental ways. But there are also similarities. Since we all know how successful web search is, let&#8217;s see if there is something to learn by examining the differences.</p>
<p>&nbsp;</p>
<table>
<tbody>
<tr>
<th></th>
<th>Web search</th>
<th> Enterprise search</th>
</tr>
<tr>
<td>Search Engine</td>
<td>Google, Bing, Baidu&#8230;</td>
<td>SharePoint, Elasticsearch, Solr, Virtualworks, Autonomy, GSA&#8230;</td>
</tr>
<tr>
<td>Sources</td>
<td>Web pages, web applications (++)</td>
<td>databases, file shares, intranet, web pages, email, SAP, CRM&#8230;</td>
</tr>
<tr>
<td>Content ambitions</td>
<td>everything</td>
<td>limited</td>
</tr>
<tr>
<td>Authority ranking</td>
<td>Pagerank</td>
<td>custom</td>
</tr>
<tr>
<td>Control of search engine</td>
<td>web search company</td>
<td>tech department, power users</td>
</tr>
<tr>
<td>Control of content</td>
<td>user</td>
<td>user</td>
</tr>
<tr>
<td>Writer/reader ratio</td>
<td>low</td>
<td>high</td>
</tr>
</tbody>
</table>
<p>The most striking similarity is that both solutions involve content produced by a user.</p>
<p>The main differences are the search engine and who controls it, the different types of content sources, and the use of the pagerank algorithm.</p>
<h3><strong>Web content and web search</strong></h3>
<p>What makes web search so successful?  Web search was revolutionized when Google introduced their web search using the pagerank algorithm. Pagerank uses the natural structure of the world wide web, and assigns high weight to pages with many incoming links. It rests on the assumption that pages with correct and important information will be used as references on other pages. Along with pagerank, there is a large number of other factors used to drive relevancy; content quality, keywords, social media sharing etc. Most of the details are not publicly available.</p>
<p>Content publishers on the world wide web can use the tricks of <a href="http://en.wikipedia.org/wiki/Search_engine_optimization">Search Engine Optimization</a> (SEO) to make sure their content gets optimal visibility on the web. SEO is the art of combining knowledge of two things;</p>
<p>- how web search engines work</p>
<p>- what search terms people use.</p>
<p>Both of these areas involve a lot of guessing. Some information can be found in guides such as the <a href="https://support.google.com/webmasters/answer/35769?hl=en&amp;ref_topic=6002025">Google Webmaster Guidelines</a>, which explains what a webmaster can do to make sure her web sites are properly indexed. Parts of this read almost like an instruction on how to create a nice school paper: &#8220;Create a useful, information-rich site, and write pages that clearly and accurately describe your content.&#8221;</p>
<p>By following these guidelines, you are helping web search engines understand your content.</p>
<p><strong>Enterprise content and enterprise search </strong></p>
<p>Enterprise search is a different story. Content is gathered from different sources, with varying degrees of  structure, and mostly without links that could be used for pagerank.</p>
<p>Content publishers in an enterprise search solution are on their own, with no official guidelines describing the rules to follow to win top ranking on the intranet search. More often than not, nobody knows how the enterprise search engine really works. Compare this to the web search situation, and it should not come as a surprise if enterprise search sucks.</p>
<p>A solution to this dilemma requires taking a step back from the idea that enterprise search is a box that you can plug in to your intranet  and &#8220;there was search&#8221;.</p>
<p>Search tech guys often pride themselves in the abilities of their search engines, and will rather fix relevancy problems created by bad content by doing tricks on the technical side of things. On the other side of the story, content producers expect search to &#8220;just work&#8221;, and put all the responsibility on technology and the implementer.</p>
<p>Creating a great enterprise search solution requires cooperation between the makers of content and the makers of search solutions.</p>
<p>Content producers should know how their content will end up in search. They should know what factors affect findability. Search solutions should have documentation targeted towards the end user, which in the enterprise also might be a content producer.</p>
<h3><strong>ESO</strong></h3>
<p>We can define Enterprise Search Optimization (ESO) as the art of improving Enterprise Search. Where ESO has been applied, we should expect to find a well functioning search solution, where employees and content producers know how to create easily findable content.</p>
<p>Compared to SEO, Enterprise search optimization is a simple procedure, involving little  guesswork in regard to figuring out how the search engine works. It is also a difficult procedure, since ESO needs to be individually tailored and optimized for the specific informational needs for each enterprise.  To develop ESO guidelines, the search technicians need to sit down with the content producers and users to figure out the details of the information model and where the pain of missing information hurts the most.</p>
<p>ESO should result in a list of guidelines, or rules, similar to the lists of SEO. These rules can range from simple and obvious, making sure documents have descriptive titles, correct dates and author, to more complex involving consistent language use, metadata fields for categorization, etc. ESO rules should also explain how structure is imposed on data with less structure.</p>
<p>Recognizing authoritative content is solved in web search engines by using the pagerank algorithm. Enterprise search will rarely be able to use pagerank directly. Authority can often be determined by other means. This can range from simple facts like &#8220;This book is the company procedure bible&#8221; to &#8220;powerpoint is more important than word&#8221;.</p>
<h3><strong>Optimizing  enterprise search </strong></h3>
<p>Enterprise search can suck a little less by applying a customized version of SEO.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2015/01/10/enterprise-search-optimization_eso/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ComperioFrokost 11. juni</title>
		<link>http://blog.comperiosearch.com/blog/2014/06/11/comperiofrokost-11-juni/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/06/11/comperiofrokost-11-juni/#comments</comments>
		<pubDate>Wed, 11 Jun 2014 15:43:10 +0000</pubDate>
		<dc:creator><![CDATA[Trond Renshusløkken]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Bjørn Olstad]]></category>
		<category><![CDATA[Breakfast]]></category>
		<category><![CDATA[ComperioFrokost]]></category>
		<category><![CDATA[Elasticsearch]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[Frokostseminar]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Seminar]]></category>
		<category><![CDATA[The Thief]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=2501</guid>
		<description><![CDATA[Takk til alle 55 deltakerne på vårt frokostseminar på THE THIEF onsdag morgen! Noen utvalgte bilder fra arrangementet:]]></description>
				<content:encoded><![CDATA[<p>Takk til alle 55 deltakerne på vårt frokostseminar på THE THIEF onsdag morgen!</p>
<h3>Noen utvalgte bilder fra arrangementet:</h3>
<p><img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/01.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/02.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/03.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/04.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/05.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/06.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/07.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/08.JPG" alt="" height="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/09.JPG" alt="" height="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/10.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/11.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/12.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/13.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/14.JPG" alt="" width="600" /><br />
<img src="http://blog.comperiosearch.com/wp-content/uploads/2014/06/frokost/15.JPG" alt="" width="600" /></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/06/11/comperiofrokost-11-juni/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comperio Search London Breakfast Seminar 14th March</title>
		<link>http://blog.comperiosearch.com/blog/2014/03/17/comperio-search-breakfast-seminar-14th-march/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/03/17/comperio-search-breakfast-seminar-14th-march/#comments</comments>
		<pubDate>Mon, 17 Mar 2014 13:07:27 +0000</pubDate>
		<dc:creator><![CDATA[Job Maelane]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Breakfast]]></category>
		<category><![CDATA[business productivity]]></category>
		<category><![CDATA[collaboration]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[enterprise search strategy]]></category>
		<category><![CDATA[enterprise social networks]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[search driven applications]]></category>
		<category><![CDATA[search driven apps]]></category>
		<category><![CDATA[search seminar]]></category>
		<category><![CDATA[SharePoint 2013]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[Yammer]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=2102</guid>
		<description><![CDATA[Comperio Search would like to thank all the guests who attended the Breakfast seminar last Friday, the 14th of March 2014 in London. Download the presentations: Leveraging Search Based Applications on new Intranet (David Wright, Lead SharePoint Architect, Coutts) Searching Internally Using Enterprise Social Networks (Zane Freame, Technical Sales Professional, Microsoft UK) Enterprise Search Strategy [...]]]></description>
				<content:encoded><![CDATA[<div class="mceTemp">
<div class="mceTemp">
<div class="mceTemp">
<div>
<div class="mceTemp">Comperio Search would like to thank all the guests who attended the Breakfast seminar last Friday, the 14th of March 2014 in London.</div>
</div>
</div>
</div>
</div>
<p>Download the presentations:</p>
<ul>
<li>Leveraging Search Based Applications on new Intranet<br />
(David Wright, Lead SharePoint Architect, Coutts)</li>
<li><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/Searching-Internally-Using-Enterprise-Social-Networks.pdf">Searching Internally Using Enterprise Social Networks</a><br />
(Zane Freame, Technical Sales Professional, Microsoft UK)</li>
<li><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/Enterprise-Search-Strategy-by-Comperio-Search.pdf">Enterprise Search Strategy by Comperio Search</a><br />
(Job Maelane, John Thompson, Comperio Search)</li>
</ul>
<p>Some pictures from the event&#8230;</p>
<div id="attachment_2111" style="width: 235px" class="wp-caption alignnone"><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0413-e1395071629414.jpg"><img class="size-medium wp-image-2111" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0413-e1395071629414-225x300.jpg" alt="" width="225" height="300" /></a><p class="wp-caption-text">Comperio Search welcomes its guests... </p></div>
<div id="attachment_2112" style="width: 310px" class="wp-caption alignnone"><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0437.jpg"><img class="size-medium wp-image-2112" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0437-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">Zane Freame, Technical Sales Professional, Microsoft UK</p></div>
<div id="attachment_2117" style="width: 310px" class="wp-caption alignnone"><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0429.jpg"><img class=" wp-image-2117" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0429-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">David Wright, Lead SharePoint Architect, Coutts</p></div>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0417.jpg"><img class="alignnone size-medium wp-image-2113" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0417-300x225.jpg" alt="" width="300" height="225" /></a></p>
<div id="attachment_2114" style="width: 310px" class="wp-caption alignnone"><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0460.jpg"><img class=" wp-image-2114" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0460-300x225.jpg" alt="" width="300" height="225" /></a><p class="wp-caption-text">Trond Renshuslokken, Comperio CEO</p></div>
<div class="mceTemp"><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0439.jpg"><img class="alignnone  wp-image-2115" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_0439-300x225.jpg" alt="" width="300" height="225" /></a></div>
<div class="mceTemp"></div>
<div class="mceTemp" style="text-align: left"><a href="http://twitter.com/comperiosearch">Follow us on Twitter so you do not miss the next breakfast seminar.</a></div>
<div class="mceTemp" style="text-align: left"></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/03/17/comperio-search-breakfast-seminar-14th-march/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ComperioFrokost 12. mars</title>
		<link>http://blog.comperiosearch.com/blog/2014/03/13/comperiofrokost-12-mars/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/03/13/comperiofrokost-12-mars/#comments</comments>
		<pubDate>Thu, 13 Mar 2014 06:28:25 +0000</pubDate>
		<dc:creator><![CDATA[Trond Renshusløkken]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Breakfast]]></category>
		<category><![CDATA[ComperioFrokost]]></category>
		<category><![CDATA[Continental]]></category>
		<category><![CDATA[Elasticsearch]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[Frokostseminar]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[Seminar]]></category>
		<category><![CDATA[user experience]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=2055</guid>
		<description><![CDATA[Takk til alle 65 deltakerne på vårt frokostseminar på Continental onsdag morgen.
Last ned presentasjonene og se noen utvalgte bilder fra arrangementet.]]></description>
				<content:encoded><![CDATA[<p>Takk til alle 65 deltakerne på vårt frokostseminar på Continental onsdag morgen!</p>
<h3>Last ned presentasjonene:</h3>
<ul>
<li>
<div>
<div><strong><a href="/wp-content/uploads/2014/03/frokost120314_comperio.pdf">Søkemarkedet i 2014</a> </strong>(PDF)</div>
<div>Trond Renshusløkken og Espen Klem, Comperio</div>
</div>
</li>
<li>
<div>
<div><strong><a href="/wp-content/uploads/2014/03/frokost120314_gyldendal.pdf">Bruk av søketeknologi i rettsdata.no</a> </strong>(PDF)</div>
<div>Kristian Skeie, Forlagsdirektør, Gyldendal Rettsdata</div>
</div>
</li>
<li><strong><a href="/wp-content/uploads/2014/03/frokost120314_posten.pdf">Om Postens Tilbudssøk.no</a></strong> (PDF)
<div>
<div>Live Marstein, Prosjektleder, Posten Norge</div>
</div>
</li>
<li>
<div>
<div><strong>Hvordan kan du være innovativ i din virksomhet med Google-teknologi?</strong> (kommer)</div>
<div>Christopher Conradi, Enterprise Sales Manager, Google</div>
</div>
</li>
</ul>
<p>&nbsp;</p>
<h3>Noen utvalgte bilder fra arrangementet:</h3>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5323.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5337.jpg" alt="" width="600" /><br />
Takket være oppussing på Continental fikk alle frokostdeltakerne starte dagen med litt trappetrim.</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5324.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5333.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5335.jpg" alt="" width="350" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5339.jpg" alt="" width="600" /><br />
Daglig leder i Comperio Trond Renshusløkken ønsker velkommen.</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5340.jpg" alt="" width="600" /><br />
Trond Renshusløkken, Comperio</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5346.jpg" alt="" width="600" /><br />
Espen Klem, Comperio</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5347.jpg" alt="" width="600" /><br />
Kristian Skeie, Gyldendal Rettsdata</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5351.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5357.jpg" alt="" width="600" /><br />
Live Marstein, Bring/Posten</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5361.jpg" alt="" width="350" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5363.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5364.jpg" alt="" width="600" /><br />
Christopher Conradi</p>
<p><img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5367.jpg" alt="" width="600" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5368.jpg" alt="" width="350" /><br />
<img title="Velkommen" src="http://blog.comperiosearch.com/wp-content/uploads/2014/03/IMG_5369.jpg" alt="" width="600" /></p>
<p><a href="http://twitter.com/comperiosearch">Følg oss på Twitter så du ikke går glipp av neste frokostseminar.</a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/03/13/comperiofrokost-12-mars/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dynamic search ranking using Elasticsearch, Neo4j and Piwik</title>
		<link>http://blog.comperiosearch.com/blog/2014/02/05/dynamic-search-ranking-using-elasticsearch-neo4j-and-piwik/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/02/05/dynamic-search-ranking-using-elasticsearch-neo4j-and-piwik/#comments</comments>
		<pubDate>Wed, 05 Feb 2014 14:49:52 +0000</pubDate>
		<dc:creator><![CDATA[Christian Rieck]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[dynamic rank tuning]]></category>
		<category><![CDATA[dynamic search ranking]]></category>
		<category><![CDATA[Elasticsearch]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[fast]]></category>
		<category><![CDATA[neo4j]]></category>
		<category><![CDATA[Piwik]]></category>
		<category><![CDATA[rank tuning]]></category>
		<category><![CDATA[ranking]]></category>
		<category><![CDATA[search ranking]]></category>
		<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=1957</guid>
		<description><![CDATA[Getting the correct result at the top of your search results isn’t easy. Anyone working within search quickly realizes this. Tuning the underlying ranking model is a job that just doesn’t end. There is an entire profession about search engine optimization, making sure your site gets as high as possible on Google (and Bing, I [...]]]></description>
				<content:encoded><![CDATA[<div>
<p>Getting the correct result at the top of your search results isn’t easy. Anyone working within search quickly realizes this. Tuning the underlying ranking model is a job that just doesn’t end. There is an entire profession about search engine optimization, making sure your site gets as high as possible on Google (and Bing, I guess). If it is not the top result on Google, it is somehow your fault and not Google&#8217;s.<span id="more-1957"></span></p>
</div>
<div>
<p><strong>Nobody optimizes for an internal enterprise search solution</strong></p>
</div>
<div>
<p>If your document is not the top result in the internal search solution it is somehow the search engine&#8217;s fault, not yours. There is no link cardinality on a file system. All the metadata is wrong and the document your user is trying to find doesn’t even contain the words the user remembers it to contain; the end result being that the target document is not found. As a result of this, trust in the enterprise search diminishes and soon you are left without users. Let’s see how we can use <a title="Piwik" href="http://piwik.org">Piwik</a>, <a title="neo4j" href="http://www.neo4j.org">neo4j </a>and <a title="Elasticsearch" href="http://www.elasticsearch.org">Elasticsearch </a>to remedy this. (Yes, you can use <a title="Solr" href="http://lucene.apache.org/solr/">Solr</a> if you want).</p>
</div>
<div>
<p>This post is made up of three parts. First I’ll talk about gathering the data necessary. Then we’ll tackle getting the ‘right’ documents at the top of your search and lastly we’ll see if we can expand documents with words your users recalls  them by, but are not part of the documents themselves. The journey will be based on the work performed on Comperio’s internal search, at the moment implemented on an old Fast ESP installation.</p>
</div>
<div>
<p><strong>Gathering data</strong></p>
</div>
<div>
<p>First you need to know what your users are searching for and what they end up clicking on. We use Piwik, an open source web analytics platform, for this. Seeing the searches, modifications to the searches and if they ended up clicking on anything that they thought was exciting. For a while we only used this for statistics since Piwik offered better insight than the built in query statistics in Fast ESP. Here is an example of one search session:</p>
</div>
<div></div>
<div>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/piwik.png"><img class="alignnone size-full wp-image-1968" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/piwik.png" alt="" width="411" height="179" /></a></p>
<p>We see a user entering the site, querying ‘rank order words’ and clicking on a document. Then the same search is executed again. It is reasonable to conclude the clicked document did not contain the wanted information. Lastly ‘boost position term’ is searched. Sadly the session does not end with a click so I guess our search couldn’t deliver. :( [1]</p>
</div>
<div>
<p>In their current form, the statistics aren’t very useful. But what were to happen if we took these chains of activities and created a graph? We used neo4j for this. A small Java program was written to download the Piwik-history as an XML-file and insert it into a newly created neo4j database.</p>
</div>
<div>
<p>The nodes are either the start of a session, a search or a document. They are linked by relationships such as CLICKED, SEARCHED, RETURNED_FROM.  Since a neo4j database isn’t very screen shot friendly, here is a part of the graph as rendered by <a title="Gephi" href="https://gephi.org">Gephi</a>:</p>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/chinese.png"><img class="alignnone size-full wp-image-1964" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/chinese.png" alt="" width="411" height="141" /></a></p>
</div>
<div></div>
<div>
<p>We see someone looking for help with Chinese query suggestions. S361 marks the beginning of this session and the first search term was ‘chinese’. They then clicked a link for an internal mail archive before refining their search to ‘chinese als’ and so forth. Links that show when a user back tracked are not shown. That was an isolated little island. The more central documents and search terms at your company will create bigger webs.</p>
</div>
<div>
<p> <a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/web.png"><img class="alignnone size-full wp-image-1971" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/web.png" alt="" width="605" height="368" /></a></p>
</div>
<div></div>
<div>
<p>Seeing your search history organized like this should give an urge to dive in and explore. It is really interesting, fun and recommended!</p>
</div>
<div>
<p><strong>Finding popular documents</strong></p>
</div>
<div>
<p>The simplest way of finding the popular documents is to track search term -&gt; clicks directly. It is also the most common way of doing it. That wouldn’t utilize our fancy new graph now, would it? Since we can do queries against the database let’s get all search sessions of 8 or less actions that resulted in a click on document X:</p>
<p>&nbsp;</p>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/query.png"><img class="alignnone size-full wp-image-1969" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/query.png" alt="" width="605" height="40" /></a></p>
</div>
<div></div>
<div>
<p>Page Break(Small disclaimer: As my neo4j skills are very rudimentary there might be more efficient ways of doing this.)</p>
</div>
<div>
<p>Now we iterate over all sessions and give a score to each search term. The closer it is to the clicked document, the higher score it gets. Sum the score across all sessions.  After doing that you get a score indicating how ‘close’ a search term is to any document. This is example data for the single-word search term ‘vpn’:</p>
<p>&nbsp;</p>
</div>
<div><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/json.png"><img class="alignnone size-full wp-image-1966" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/json.png" alt="" width="605" height="66" /></a></div>
<div>
<p>&nbsp;</p>
<p>When the score passes a threshold we add the search-document pair to an Elasticsearch index. For every search executed at our search we first check Elasticsearch to see if the term is boosted. For ‘vpn’ the search logs state</p>
</div>
<div></div>
<div>
<p> <a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/log.png"><img class="alignnone size-full wp-image-1967" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/log.png" alt="" width="605" height="50" /></a></p>
</div>
<div>
<p>&nbsp;</p>
<p>We can see how three documents are boosted for ‘vpn’. (By choice we only boost the top three). Using Fast ESP we wrap the original query with boosts for those specific documents.</p>
</div>
<div></div>
<div>
<p> <a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/fql.png"><img class="alignnone size-full wp-image-1965" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/fql.png" alt="" width="546" height="179" /></a></p>
</div>
<div>
<p>&nbsp;</p>
<p>In FAST ESP, as well as in Sharepoint Search 2013 the beloved xrank-operator is your friend. In a Lucene based search application use boost queries for this.</p>
</div>
<div>
<p>The search result returns the popular hits (only one shown here) at the top</p>
</div>
<div></div>
<div>
<p> <a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/topdoc.png"><img class="alignnone size-full wp-image-1970" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/topdoc.png" alt="" width="605" height="105" /></a></p>
</div>
<div>
<p>&nbsp;</p>
<p>The ugly star and cheesy feedback is me trying to tell the users rather bluntly that things happened behind the scene and that their actions will affect future searches. Currently there is no way of giving negative feedback to say ‘no, this is actually not a good hit’. Oh well.</p>
</div>
<div>
<p>As a bonus all terms that results in boosted documents are, as far as we know, smart things to search for and free of spelling errors. Therefor all such terms are added to a second Elasticsearch index we base our query completion on. (As a side note – if misspelled terms appear often enough to overcome the threshold for them to be taken into account, they could be part of your organization’s tribal language. If the users choose to spell the term “definately” so often that it “makes the cut” then the system should adapt to that. )</p>
</div>
<div>
<p><strong>Expanding documents to increase recall</strong></p>
</div>
<div>
<p>Often a user thinks of one document and searches for what, to them, identifies the document. That term might or might not be present in the document itself. If it doesn’t the document is not returned and the user becomes sad. Hopefully they alter their search and continue to look. Should they end up at their document we have the tools needed to remedy the situation. Here is a concrete example:</p>
<p><a href="http://blog.comperiosearch.com/wp-content/uploads/2014/02/arch.png"><img class="alignnone size-full wp-image-1963" src="http://blog.comperiosearch.com/wp-content/uploads/2014/02/arch.png" alt="" width="594" height="217" /></a></p>
</div>
<div></div>
<div>
<p>Here we can see that the node marked 1 might be tagged with ‘sort order refiner entries’ or at least ‘refiner’, a term used twice when trying to find this document. (As an interesting side note, if you observe a lot of ‘sort X’ followed by ‘sort Y’ you might consider adding a synonym between X and Y.) If a term or phrase is used often enough across different sessions we save this to an Elasticsearch index. Each time a document is indexed we look up the document in our index and add any popular search terms to a low ranking field. This guarantees a recall of the document but it will not automatically top the results for those queries. This is a two-step process. If your search engine supports partial updates of documents, go with that.</p>
</div>
<div>
<p>Before adding the last step we noticed that for some searches we boosted documents that didn’t get recalled and thus were never displayed to the user even though we knew it was a good hit!</p>
</div>
<div>
<p><strong>Closing words</strong></p>
</div>
<div>
<p>As a first step towards dynamic ranking this has shown good results. As long as your search engine supports query time boosting you can implement this.</p>
</div>
<div>
<p><strong>By the way</strong></p>
</div>
<div>
<p>It should be noted that SharePoint will actually do some of this for you. It comes with an interface meant to be used by an end user (as opposed to all search engines I’ve seen) and the UI contains the event listeners on all links, tracking what you do. This is fed into a database and the data does affect ranking. As far as I know only the last search term before a click is associated with the clicked link.</p>
</div>
<div>
<p>[1] One scenario that Piwik and click tracking does not pick up is if the sought information is found in the returned teasers. Search sessions that don’t end in a click might in fact have a happy ending.</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/02/05/dynamic-search-ranking-using-elasticsearch-neo4j-and-piwik/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Big Data and Enterprise Search</title>
		<link>http://blog.comperiosearch.com/blog/2013/02/27/big-data-and-enterprise-search/</link>
		<comments>http://blog.comperiosearch.com/blog/2013/02/27/big-data-and-enterprise-search/#comments</comments>
		<pubDate>Wed, 27 Feb 2013 13:32:02 +0000</pubDate>
		<dc:creator><![CDATA[John Thompson]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[enterprise search]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=1245</guid>
		<description><![CDATA[There have been a number of reports and papers issued recently on Big Data including: Forrester reviewing the Big Data solutions of 2013 The Economist talks about what is Big Data and how can it be used Wall Street Journal talk about what is next for Big Data The Sunday Times reviewing how Big Data [...]]]></description>
				<content:encoded><![CDATA[<p>There have been a number of reports and papers issued recently on Big Data including:</p>
<ul>
<li><a href="http://www.forrester.com/pimages/rws/reprints/document/85601/oid/1-KWYFVB">Forrester</a> reviewing the Big Data solutions of 2013</li>
<li><a href="http://www.economist.com/blogs/graphicdetail/2013/02/elusive-big-data">The Economist</a> talks about what is Big Data and how can it be used</li>
<li><a href="http://blogs.wsj.com/cio/2013/02/20/preparing-for-analytics-3-0/">Wall Street Journal</a> talk about what is next for Big Data</li>
<li><a href="http://www.thesundaytimes.co.uk/sto/public/Appointments/article1158273.ece">The Sunday Times</a> reviewing how Big Data has helped various companies</li>
<li><a href="http://www.intranetfocus.com/archives/915">Martin White</a> discusses if there is a need for enterprise search whilst Big Data lives</li>
<li><a href="http://arnoldit.com/wordpress/2013/01/01/big-data-and-search/">Stephen Arnold</a> discusses his Big Data trends for 2013</li>
<li><a href="http://www.mike-walsh.com/blog/bid/56566/What-s-Your-Big-Data-Strategy-for-2013?utm_campaign=insight-newsletter&amp;utm_source=hubspot_email_marketing&amp;utm_medium=email&amp;utm_content=5673394&amp;_hse=ulvnes%40gmail.com&amp;_hsmi=5673394&amp;_hsh=6a1f115d7615a6eee182fc59d5429e29">Mike Walsh</a> reviews Big Data strategies for 2013</li>
</ul>
<p>As well as software vendors pushing their appetite for Big Data, either via their websites e.g. Microsoft and their <a href="http://www.microsoft.com/en-us/news/presskits/bigdata/Default.aspx">Big Data Week</a> or via taking adverts in the UK national press i.e. IBM running a number of full page adverts in The Times, week commencing Monday 18<sup>th</sup> February 2013.</p>
<p>From reading these reports and papers, what actually is Big Data? Is Big Data a hype? Is it only relevant for a small number of very large organisations where the volume, rate of change, variety and worth of this data is highly relevant?  It is very hard to answer these questions – what may be Big Data to you may not be Big Data to me.</p>
<p>Big Data to me is being able to capture data, whether it is structured, semi-structured or totally unstructured, store it, interpret it, and leverage it to provide insights in order to help the business.</p>
<p>So how does Big Data and enterprise search co-exist? Can traditional search tools work as the “gateway” to explore Big Data by, for instance, preparing the data to help in creating the predictive model?</p>
<p>Based upon the Forrester report mentioned above, SAS and IBM are the leaders in the Big Data space with a large number of tools available to process and analyse the data.   As the first part of any analysis is preparing the data, and with a large proportion of the data being unstructured, could enterprise search use its distinctive capabilities of pre-processing large amounts of both structured and unstructured content – I think not.  Currently, enterprise search tools do not have the capability to traverse some a large amount of data in a timely manner in order to try and produce a smaller, relevant content set without using a very large amount of hardware. The structured data that exists may already be well organised but unstructured content is another matter and trying to interpret meaning between structured and unstructured content can be very complex.</p>
<p>So, if enterprise search cannot help directly with preparing the data for analysis, where can search help? <a href="http://www.siliconrepublic.com/strategy/item/31182-global-enterprise-search-ma">Frost and Sullivan</a> forecast the global enterprise search market to be US$4.68bn by 2019. Search has the capability to bring a wide variety of different content sources together and produce meaning.   Search queries, from simple to complex, can then be run against the search index returning, hopefully the most relevant content based upon the search query terms entered.  But, for the most part, this won’t give the answers to the Big Data questions e.g. – helping to uncover the answers to making the best use of the data available.</p>
<p>However, <a href="http://en.wikipedia.org/wiki/Search-based_application">Search Based Applications</a> (SBA), as defined by Wikipedia , “<em>use </em><em>semantic technologies to aggregate, normalize and classify unstructured, semi-structured and/or structured content across multiple repositories, and employ natural language technologies for accessing the aggregated information</em>.”, can be built to slice and dice the information in the search index on-the-fly – isn’t this close to what the Big Data engines are trying to achieve.  There are a number of search related companies building SBAs which look to build insight in the realms of data that organisations amass.</p>
<p>There are obviously limitations to what search engines can do in terms of the size of the data sets – that’s why it’s called Big Data.  However, there must be a reason why companies like <a href="http://www-03.ibm.com/press/us/en/pressrelease/37491.wss">IBM</a> purchased Vivisimo in 2012 or <a href="http://www.oracle.com/us/corporate/acquisitions/endeca/index.html">Oracle</a> bought Endeca in 2011, with both companies looking to capitalize on the capabilities both Vivisimo and Endeca offered in terms of unlocking the structured and unstructured content within organisations.</p>
<p>Quote from Oracle on the Endeca acquisition, “<em>Oracle with Endeca plans to create a comprehensive technology platform to process, store, manage, search and analyze structured and unstructured information together enabling businesses to make stronger and more profitable decisions”</em> – search and Big Data complementing each other – I think so.</p>
<p>Finally, the latest information from Gartner indicates “<em>Big Data is forecast to drive $34 billion of IT spending in 2013 and create 4.4 million IT jobs by 2015, but it is currently still a solution looking for a problem</em>”.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2013/02/27/big-data-and-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SharePoint conference 2012 keynote: raising the bar for Enterprise Search</title>
		<link>http://blog.comperiosearch.com/blog/2012/11/26/sharepoint-conference-2012-keynote-raising-the-bar-for-enterprise-search/</link>
		<comments>http://blog.comperiosearch.com/blog/2012/11/26/sharepoint-conference-2012-keynote-raising-the-bar-for-enterprise-search/#comments</comments>
		<pubDate>Mon, 26 Nov 2012 20:42:16 +0000</pubDate>
		<dc:creator><![CDATA[Marcus Johansson]]></dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[Enterprise Social]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[sharepoint]]></category>
		<category><![CDATA[spc]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=1148</guid>
		<description><![CDATA[A little after 8:30 this morning, Jared Spataro, Senior Director of SharePoint, opened this year&#8217;s SharePoint Conference (SPC) to the sound of 10,000 SharePoint geeks simultaneously tapping away on their devices, furiously making #SPC12 the trending topic on Twitter. The release of SharePoint 2013 is touted by Microsoft as a bridge from the past to [...]]]></description>
				<content:encoded><![CDATA[<p>A little after 8:30 this morning, Jared Spataro, Senior Director of SharePoint, opened this year&#8217;s SharePoint Conference (SPC) to the sound of 10,000 SharePoint geeks simultaneously tapping away on their devices, furiously making #SPC12 the trending topic on Twitter.</p>
<p>The release of SharePoint 2013 is touted by Microsoft as a bridge from the past to the future; a pivotal shift from server- and wave-based releases to a rapid release cycle, combining the muscles from the cloud, fresh design principles and Enterprise Social technology. They&#8217;re proud, and they should be.</p>
<p><img class="alignnone size-full" src="http://blog.comperiosearch.com/wp-content/uploads/2012/11/20121112-123453.jpg" alt="20121112-123453.jpg" /></p>
<p>In the keynote, Microsoft also revealed some of the rationale behind the recent acquisition of Yammer, seeing how it complements SharePoint&#8217;s strong document-management skills with Yammer&#8217;s deep expertise in making the enterprise more social.</p>
<p>Jeff Teper, another high-profile SharePoint executive, told the crowds that over the last three years, Microsoft&#8217;s engineering team has centralized their effort around three important pillars: Experiences, Innovation and Ecosystem. This shows off in a dramatically improved user experience, better development tools and a completely revamped application model paving the way for us partner companies to more rapidly and with more flexibility deploy innovative applications in the cloud.</p>
<p>Another consistent theme throughout the keynote, was how Search now has a bigger role across SharePoint and throughout the Microsoft stack. FAST is now fully integrated, and the new search core drives both content and recommendations throughout the suite.</p>
<p>We now have a much better baseline when implementing Search technology in the enterprise. Hopefully this is finally the time when custom implementations of Enterprise Search will stop ending up as just a search box, and instead unleash the potential of Your organization&#8217;s accumulated knowledge and tackle today&#8217;s overwhelming growth of information.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2012/11/26/sharepoint-conference-2012-keynote-raising-the-bar-for-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
