Crawl interfaces for Forage running inside your browserSearch Nuggets

Got an idea a while back on how we could use the JavaScript/Nodejs Search Engine Forage so that the users would have their own search server inside the browser. The main takeaway from this would be that you don’t need to install anything to test the search engine. Since last time, I’ve made a quick logo for Forage, and drawn some more user interfaces. The mockups are mainly about crawl interfaces setting up the crawler, which in Forage terms is called Forage Fetch.

Crawl interfaces, suggested

Initial Crawl-window

To crawl most pages elegantly and easily, you need five information elements:

Somewhere to start. Which place do you want your crawler to start. You don’t have to specify the domain, we pick the domain name from the page you’re visiting.
Which links to follow. This is not necessarily the pages you want to crawl. Typically these pages have lists of pages you want to crawl.
Which links not to follow. To not make the crawler go wild, you set some boundaries. Often a page has several URLs.
Which links to crawl. These are the actual pages you’re looking for.
Which links not to crawl.

A simple illustration on the above rules. Forage Fetch doesn’t have all these features yet, but they’re suggested as enhancements.