Sitevision – förbättra söket med Nutch

Ett av Sveriges mest populära CMS verktyg är Sitevision, som används kanske främst av stora statliga myndigheter och kommuner. Valet att använda sig av Sitevision hos dessa myndigheter och kommuner är nog att det är väldigt enkelt för redaktörer och sidansvariga att använda och att underhålla informationen på sina sidor. Detta i en miljö där [...]

Analysing Solr logs with Logstash

Analysing Solr logs with Logstash Although I usually write about and work with Apache Solr, I also use the ELK stack on a daily basis on a number of projects. If you’re not familiar with Solr, take a look at some of my previous posts. If you need some more background info on the ELK [...]

Impressions from Berlin Buzzwords 2015

May 31 – June 3 2015 Stream processing, Internet of things, Real time analytics, Big data, Recommendations, Machine learning. Berlin Buzzwords undoubtedly lives up to its name by presenting the frontlines of data technology trends.

Solr: Indexing SQL databases made easier! – Part 2

Last summer I wrote a blog post about indexing a MySQL database into Apache Solr. I would like to now revisit the post to update it for use with Solr 5 and start diving into how to implement some basic search features such as Facets Spellcheck Phonetic search Query Completion Setting up our environment The [...]

Solr As A Document Processing Pipeline

Recently on a project I got an interesting request. Content owners wanted to enrich new documents submitted to the search index with content from documents already present in the index. We use Solr as the search backend for this particular customer so I started thinking about how to achieve this with Solr. A bit of [...]

Idea: Your life searchable through Norch – NOde seaRCH, IFTTT and Google Drive

First some disclaimers: This has been posted earlier on Even though some of these ideas are not what you’d normally implement in a business environment, some of the concepts can obviously be transferred over to businesses trying to provide an efficient workplace for its employees. Norch is developed by Fergus McDowall, an employee of [...]

Solr: Indexing SQL databases made easier!

Update Part two is now available here! At the beginning of this year Christopher Vig wrote a great post about indexing an SQL database to the internet’s current search engine du jour, Elasticsearch. This first post in a two part series will show that Apache Solr is a robust and versatile alternative that makes indexing [...]

Search technology: Picking the right horse

For many years, Solr was the only realistic choice for most customers wanting to do an enterprise search project based on open source. Things changed around 2010/2011 when Elasticsearch started to gain traction. The last few years, the community around Elasticsearch has been growing rapidly and the software is regularly downloaded approximately half a million [...]

Dynamic search ranking using Elasticsearch, Neo4j and Piwik

Getting the correct result at the top of your search results isn’t easy. Anyone working within search quickly realizes this. Tuning the underlying ranking model is a job that just doesn’t end. There is an entire profession about search engine optimization, making sure your site gets as high as possible on Google (and Bing, I [...]

Search driven websites

Search driven sites lets the reader find what they need on their own premises, not the architect’s! This year’s summer internship program has been centered around the concept “Search driven websites”. Over the course of the summer we’ve gone from a concept and a vision to a prototype website, and we’d like to share the potential [...]


Comperio AS
Øvre Slottsgate 27
NO-0157 Oslo,
+47 22 33 71 00
View map


Search Provider Sverige AB
Gamla Brogatan 34
SE-11 120 Stockholm
+46 8-21 49 00
View map