<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	
	>
<channel>
	<title>Comments on: How FS4SP primary keys work</title>
	<atom:link href="http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/</link>
	<description>A blog about Search as THE solution</description>
	<lastBuildDate>Mon, 26 Oct 2015 18:07:52 +0000</lastBuildDate>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>https://wordpress.org/?v=3.9.40</generator>
	<item>
		<title>By: Basant</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-10524</link>
		<dc:creator><![CDATA[Basant]]></dc:creator>
		<pubDate>Thu, 03 Oct 2013 10:14:34 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-10524</guid>
		<description><![CDATA[Thanks for explian in the depth of FS4SP primaty key concept with respect of internalId and contentid.]]></description>
		<content:encoded><![CDATA[<p>Thanks for explian in the depth of FS4SP primaty key concept with respect of internalId and contentid.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mikael Svenson</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-69</link>
		<dc:creator><![CDATA[Mikael Svenson]]></dc:creator>
		<pubDate>Wed, 11 Jan 2012 19:20:10 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-69</guid>
		<description><![CDATA[Christian,

This is somewhat correct but also wrong.

If you add another Content SSA and point it towards a different collection than the first one, for example &quot;sp2&quot; instead of &quot;sp&quot;, this will work just fine as the collection name is appended to the internal id in FS4SP. And you will not get a collision on ID&#039;s in FS4SP. Yes, the same ID will appear in two Content SSA&#039;s, but this works just fine.]]></description>
		<content:encoded><![CDATA[<p>Christian,</p>
<p>This is somewhat correct but also wrong.</p>
<p>If you add another Content SSA and point it towards a different collection than the first one, for example &#8220;sp2&#8243; instead of &#8220;sp&#8221;, this will work just fine as the collection name is appended to the internal id in FS4SP. And you will not get a collision on ID&#8217;s in FS4SP. Yes, the same ID will appear in two Content SSA&#8217;s, but this works just fine.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Marshall Rieck</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-68</link>
		<dc:creator><![CDATA[Christian Marshall Rieck]]></dc:creator>
		<pubDate>Wed, 11 Jan 2012 13:07:58 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-68</guid>
		<description><![CDATA[Just to elaborate on &quot;Luckily, SharePoint makes sure to assign the Item IDs so that they’re unique across all collections, hence creating unique internalid:s even though the items are in the same collection.&quot;

SharePoint will generate this unique ID with a counter. The counter is stored in the Content SSA and this is the reason why you cannot have more than one Content SSA, they would generate the same ID for different documents.]]></description>
		<content:encoded><![CDATA[<p>Just to elaborate on &#8220;Luckily, SharePoint makes sure to assign the Item IDs so that they’re unique across all collections, hence creating unique internalid:s even though the items are in the same collection.&#8221;</p>
<p>SharePoint will generate this unique ID with a counter. The counter is stored in the Content SSA and this is the reason why you cannot have more than one Content SSA, they would generate the same ID for different documents.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marcus Johansson</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-67</link>
		<dc:creator><![CDATA[Marcus Johansson]]></dc:creator>
		<pubDate>Fri, 06 Jan 2012 13:53:35 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-67</guid>
		<description><![CDATA[Hi Ben,

Glad it was useful!

I guess you&#039;re thinking of this tool http://gallery.technet.microsoft.com/scriptcenter/14105abb-29da-43fd-90f4-ac12f1a0233a ?

It asks for the internalid and the contentid, so in your case the contentid should be the full URL that was crawled, and the internalid is derived from the contentid as explained in the post above.]]></description>
		<content:encoded><![CDATA[<p>Hi Ben,</p>
<p>Glad it was useful!</p>
<p>I guess you&#8217;re thinking of this tool <a href="http://gallery.technet.microsoft.com/scriptcenter/14105abb-29da-43fd-90f4-ac12f1a0233a" rel="nofollow">http://gallery.technet.microsoft.com/scriptcenter/14105abb-29da-43fd-90f4-ac12f1a0233a</a> ?</p>
<p>It asks for the internalid and the contentid, so in your case the contentid should be the full URL that was crawled, and the internalid is derived from the contentid as explained in the post above.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ben Liang</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-66</link>
		<dc:creator><![CDATA[Ben Liang]]></dc:creator>
		<pubDate>Thu, 05 Jan 2012 16:42:57 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-66</guid>
		<description><![CDATA[Timely tip indeed. I was trying to figure out why some items in my index have contentid that is not an integer. Microsoft has a PowerShell script (GetFiXML) that seems to require an integer contentid. So I guess I am out of luck when it comes to getting FiXML for content indexed by FAST Web Crawler.]]></description>
		<content:encoded><![CDATA[<p>Timely tip indeed. I was trying to figure out why some items in my index have contentid that is not an integer. Microsoft has a PowerShell script (GetFiXML) that seems to require an integer contentid. So I guess I am out of luck when it comes to getting FiXML for content indexed by FAST Web Crawler.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mikael Svenson</title>
		<link>http://blog.comperiosearch.com/blog/2011/12/18/fs4sp-primary-keys-work/#comment-65</link>
		<dc:creator><![CDATA[Mikael Svenson]]></dc:creator>
		<pubDate>Sun, 18 Dec 2011 19:48:55 +0000</pubDate>
		<guid isPermaLink="false">http://nuggets.comperiosearch.com/?p=733#comment-65</guid>
		<description><![CDATA[Good explanation Marcus!

The good thing about using MD5&#039;s is that the index can be independent of any crawler framework and still generate an internal ID to represent the item.

The bad part however is that, although minuscule, there is a chance of ID overlap, as it&#039;s a check sum.

Storing an integer in the search index also take less space than md5 and will in most cases be more optimal. Time will tell if we can still use multiple crawler frameworks in the future, or if MS optimize it forcing everything via the SP crawler framework. Having one crawler framework makes maintenance a bit easier imo.]]></description>
		<content:encoded><![CDATA[<p>Good explanation Marcus!</p>
<p>The good thing about using MD5&#8242;s is that the index can be independent of any crawler framework and still generate an internal ID to represent the item.</p>
<p>The bad part however is that, although minuscule, there is a chance of ID overlap, as it&#8217;s a check sum.</p>
<p>Storing an integer in the search index also take less space than md5 and will in most cases be more optimal. Time will tell if we can still use multiple crawler frameworks in the future, or if MS optimize it forcing everything via the SP crawler framework. Having one crawler framework makes maintenance a bit easier imo.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
