[freenet-cvs] r14988 - trunk/plugins/XMLSpider

toad at freenetproject.org toad at freenetproject.org
Thu Sep 6 16:40:37 UTC 2007


Author: toad
Date: 2007-09-06 16:40:36 +0000 (Thu, 06 Sep 2007)
New Revision: 14988

Modified:
   trunk/plugins/XMLSpider/XMLSpider.java
Log:
Make sub-indexes much bigger.
We can't rely on grouping them together in containers, because:
- mostly words which are near to each other in the index are not closely related
- we'd need multiple container support and we don't have it
- the containers would be big chunks to fetch and often wouldn't be reused on a big index
So it makes sense to just use huge sub-indexes.
Long term we want sub-indexes to be split by size rather than number of entries.

Modified: trunk/plugins/XMLSpider/XMLSpider.java
===================================================================
--- trunk/plugins/XMLSpider/XMLSpider.java	2007-09-06 16:38:01 UTC (rev 14987)
+++ trunk/plugins/XMLSpider/XMLSpider.java	2007-09-06 16:40:36 UTC (rev 14988)
@@ -138,7 +138,7 @@
 	 * Lists the allowed mime types of the fetched page. 
 	 */
 	public Set allowedMIMETypes;
-	private static final int MAX_ENTRIES = 20;
+	private static final int MAX_ENTRIES = 200;
 	private static int version = 7;
 	private static final String pluginName = "XML spider "+version;
 	/**




More information about the cvs mailing list