<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Search Nuggets &#187; Forage Document Processor</title>
	<atom:link href="http://blog.comperiosearch.com/blog/tag/forage-document-processor/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.comperiosearch.com</link>
	<description>A blog about Search as THE solution</description>
	<lastBuildDate>Mon, 13 Jun 2016 08:59:45 +0000</lastBuildDate>
	<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=3.9.40</generator>
	<item>
		<title>Idea: search server running inside browser</title>
		<link>http://blog.comperiosearch.com/blog/2014/04/29/idea-search-server-running-inside-your-browser/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/04/29/idea-search-server-running-inside-your-browser/#comments</comments>
		<pubDate>Tue, 29 Apr 2014 18:15:36 +0000</pubDate>
		<dc:creator><![CDATA[Espen Klem]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[forage]]></category>
		<category><![CDATA[Forage Document Processor]]></category>
		<category><![CDATA[Forage Search Engine]]></category>
		<category><![CDATA[Forage Search Server]]></category>
		<category><![CDATA[Javascript]]></category>
		<category><![CDATA[node]]></category>
		<category><![CDATA[node.js]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[search engine]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=2232</guid>
		<description><![CDATA[Got an idea to use the browser as a virtual machine for Forage Forage is Fergus McDowall&#8217;s pet project: A search server written in JavaScript and based on Node.js and LevelDB. Since it&#8217;s JavaScript, and HTML5 local storage has the same key/value storage as levelDB (HTML5 local storage for Chrome actually is levelDB) it has the [...]]]></description>
				<content:encoded><![CDATA[<p>Got an idea to use the browser as a virtual machine for Forage <a href="https://github.com/fergiemcdowall/forage">Forage is Fergus McDowall&#8217;s pet project</a>: A search server written in JavaScript and based on Node.js and LevelDB. Since it&#8217;s JavaScript, and HTML5 local storage has the same key/value storage as levelDB (HTML5 local storage for Chrome actually is levelDB) it has the possibility to run inside any modern browser. This would mean that the user could get a search server running inside browser.</p>
<p>Forage could then be added with a bookmarklet to any page (A bookmark adding a javascript to the page you&#8217;re on). With some simple UI-stuff you could define the Forage Document Processor Adapter, set up rules for Forage Crawler, crawl, process, index and then search within your indexed documents. All without using any servers, on premises or in the cloud. When the user is satisfied she or he could download the JSON-file with processed documents + scripts for adding a search box,  search result and navigators to a page.</p>
<h2>Possible use cases for search server running inside browser:</h2>
<ul>
<li><strong>Easy site search setup<br />
</strong>One real benefit, and the initial idea, would be that the user would not need any server to test Forage and actually crawl a site. When page crawled the user can download JSON ready to be indexed + setup-files for a search box, navigators and search result. Or add it to a cloud service and there continue the work you started in your browser.</li>
<li><strong>A easy and modern search solution behind the firewall<br />
</strong>Behind the firewall, almost all software looks a bit duller, more beige and basically not modern. But through the browser you could easily combine the strength of  Forage and all the hidden gems behind a firewall. There would be some big issues with security, but for intranet and people search it could be a great solution.</li>
<li><strong>Ad hoc search on a site that is not yours<br />
</strong>Say you&#8217;re looking for something on a site. How about ad hock index it and then search it. Yes, it&#8217;s a bandwidth abuse waiting to happen, but could make a good tool for a lot of situations.</li>
<li><strong>Your life, searchable<br />
</strong>This may need a browser add on, but then again, maybe not. Anyway: How about your whole online life, searchable. Today you have your browser history. It shows you page title and page link. What if all the text and images was searchable?</li>
</ul>
<h2>Some UX sketches of the idea:</h2>
<p><a href="https://www.flickr.com/photos/eklem/14233858701/in/set-72157643790505944">The user finds a page to crawl &#8230;<br />
</a><a href="https://www.flickr.com/photos/eklem/14233858701/in/set-72157643790505944"><img class="alignnone" style="border: 1px solid black" src="https://farm6.staticflickr.com/5492/14233858701_eeae399a87.jpg" alt="Search server running inside browser: The user finds a page to crawl ..." width="500" height="313" /></a></p>
<p><a href="https://www.flickr.com/photos/eklem/14050545130/in/set-72157643790505944/">&#8230; clicks the bookmarklet &#8230;<br />
</a><a href="https://www.flickr.com/photos/eklem/14050545130/in/set-72157643790505944/"><img class="alignnone" style="border: 1px solid black" src="https://farm3.staticflickr.com/2934/14050545130_c6ae387c88.jpg" alt="Search server running inside browser: The user finds a page to crawl ..." width="500" height="313" /></a></p>
<p>.<a href="https://www.flickr.com/photos/eklem/14257334383/in/set-72157643790505944/">.. that adds Forage JavaScript-stuff to the page &#8230;<br />
</a><a href="https://www.flickr.com/photos/eklem/14257334383/in/set-72157643790505944/"><img class="alignnone" style="border: 1px solid black" src="https://farm3.staticflickr.com/2908/14257334383_41deb607f7.jpg" alt="Search server running inside browser: Bookmarklet added" width="500" height="313" /></a></p>
<p><a href="https://www.flickr.com/photos/eklem/14257334533/in/set-72157643790505944/">&#8230; much like a browser plugin or add on &#8230;<br />
</a><a href="https://www.flickr.com/photos/eklem/14257334533/in/set-72157643790505944/"><img class="alignnone" style="border: 1px solid black" src="https://farm6.staticflickr.com/5157/14257334533_ff97de13ef.jpg" alt="Search server running inside browser: Works much like a browser addon or plugin" width="500" height="313" /></a></p>
<p><a href="https://www.flickr.com/photos/eklem/14236885594/in/set-72157643790505944/">&#8230; tests a jQuery selector statement &#8230;<br />
</a><a href="https://www.flickr.com/photos/eklem/14236885594/in/set-72157643790505944/"><img class="alignnone" style="border: 1px solid black" src="https://farm3.staticflickr.com/2899/14236885594_441d1c7f35.jpg" alt="Search server running inside browser: Defining item" width="500" height="313" /></a></p>
<p><a href="https://www.flickr.com/photos/eklem/14237149605/in/set-72157643790505944/">&#8230; and adds the field to the item when satisfied. Repeated until a full item is defined.<br />
<img class="alignnone" src="https://farm6.staticflickr.com/5240/14237149605_b8d0f6be3e.jpg" alt="" width="500" height="313" /></a></p>
<p>Here&#8217;s the <a href="https://github.com/fergiemcdowall/forage/issues/32">feature suggestion at the Forage GitHub page</a>. Ideas or comments are more than welcome! Want to know more about Forage? Check out the <a href="https://github.com/fergiemcdowall/forage">Forage GitHub-pages</a> or <a href="http://blog.comperiosearch.com/blog/tag/forage/">stuff we&#8217;ve written about Forage</a>.</p>
<p>EDIT: <a href="http://blog.comperiosearch.com/blog/2014/05/21/crawl-interfaces-for-forage-running-inside-your-browser/">Drawn some new mock ups on the crawler part: Forage Fetch</a> and written about the <a href="http://blog.comperiosearch.com/blog/2014/05/28/5-reasons-lebron-future-forage-search-engine-will-rock/">killer combo Lebron and what it will mean for search</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/04/29/idea-search-server-running-inside-your-browser/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The Seasonal Food Recipe Web Application</title>
		<link>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/</link>
		<comments>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/#comments</comments>
		<pubDate>Mon, 03 Feb 2014 19:14:42 +0000</pubDate>
		<dc:creator><![CDATA[Espen Klem]]></dc:creator>
				<category><![CDATA[English]]></category>
		<category><![CDATA[User Experience]]></category>
		<category><![CDATA[design pattern]]></category>
		<category><![CDATA[Elasticsearch]]></category>
		<category><![CDATA[forage]]></category>
		<category><![CDATA[Forage Document Processor]]></category>
		<category><![CDATA[Forage Search Enginge]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[recipe app]]></category>
		<category><![CDATA[relevancy]]></category>
		<category><![CDATA[relevancy tuning]]></category>
		<category><![CDATA[relevant results]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[user experience]]></category>
		<category><![CDATA[ux]]></category>

		<guid isPermaLink="false">http://blog.comperiosearch.com/?p=1942</guid>
		<description><![CDATA[So, what&#8217;s this you ask? It&#8217;s a series of mini-hackathons some of us at Comperio are doing to achieve a small list of goals: Learn more about search, both tech and UX. Show that a nice search user experience doesn&#8217;t need a search input box. Show a search that is light weight. Generally, build something [...]]]></description>
				<content:encoded><![CDATA[<p>So, what&#8217;s this you ask? It&#8217;s a series of mini-hackathons some of us at Comperio are doing to achieve a small list of goals:</p>
<ol>
<li>Learn more about search, both tech and UX.</li>
<li>Show that a nice search user experience doesn&#8217;t need a search input box.</li>
<li>Show a search that is light weight.</li>
<li>Generally, build something nice, quickly, to show off.</li>
</ol>
<p>&#8220;But what is it?&#8221; you ask again. The second answer is that it&#8217;s a search application for recipes containing the most in-season vegetables, fruits, mushrooms, fish, shellfish, poultry and meat. So, a search without a search box giving you the recipes that will make you pick the freshest, most tasty food available at any given time of the year. In Norway, you can get strawberries the whole year around, but it&#8217;s only a month or two during the summer that they taste really good. and are cheap. This goes for a lot of different foods.</p>
<p><a href="http://www.flickr.com/photos/eklem/10562070553/sizes/o/in/photolist-h6knAH-hZUTrM/"><img class="alignnone" src="http://farm3.staticflickr.com/2844/10562070553_436b7db420_c.jpg" alt="Crawling data for the seasonal food recipe web application" width="800" height="574" /></a></p>
<p>We&#8217;ll blog every step of the journey. First small hackathon is this Friday, January 7th. To test our ideas we&#8217;ve already crawled <a href="http://oppskrift.klikk.no/">food recipes from Klikk.no</a>, and transformed them into JSON-format by using Forage Document Processor from <a href="http://blog.comperiosearch.com/blog/2013/11/18/get-the-new-version-of-forage-the-search-server-for-node-js/">Fergus McDowell&#8217;s excellent Forage Search Engine</a>.</p>
<p>Next step is to get an Amazon EC2-server up and running, install <a href="http://www.elasticsearch.org/">Elasticsearch</a>, and feed it recipes in JSON-format. We&#8217;ll also make some rough mockups of how the user interface is going to be, and maybe, hopefully get a simple version of the Seasonal Food Recipe Web Application up and running. If we after a while get enough traffic, we&#8217;ll try to do some machine learning. And we&#8217;ll definitely test out different ways of doing <a href="http://blog.comperiosearch.com/blog/2012/11/21/in-relevance-we-trust/">relevancy tuning</a>.</p>
<p>Sounds nice? This is work in progress, so <a href="http://blog.comperiosearch.com/blog/tag/recipe-app/">check back every now and then for new blog posts</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.comperiosearch.com/blog/2014/02/03/the-seasonal-food-recipe-web-application/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
