When processing an RSS Feed that has duplicate entries with the same id, no duplicate thread is posted but a new thread and post are indexed. This happens everytime the feed is processed.
The result of this is that the search index grows insanely much (2 new index entries for every duplicate entry everytime the feed is processed) and the post/thread ids are a lot higher than they should be (eg on a forum with 5.000 threads and 60.000 posts, the ids have reached respectively 115.000 and 170.000).
There are 2 problems in the Xenforo_Model_Feed class that contribute to this:
There are 2 entries with id http://forums.daybreakgames.com/ps2/index.php?threads/pts-patch-notes-3-11.216825/
(Note that these are at the bottom, so it is entirely possible that they will no longer be included in the feed soon)
While this is a problem caused by broken feeds and they should fix their stuff, it would be nice if Xenforo was able to deal with this and not blow up its own database.
The result of this is that the search index grows insanely much (2 new index entries for every duplicate entry everytime the feed is processed) and the post/thread ids are a lot higher than they should be (eg on a forum with 5.000 threads and 60.000 posts, the ids have reached respectively 115.000 and 170.000).
There are 2 problems in the Xenforo_Model_Feed class that contribute to this:
- _checkProcessedEntries(array $feedData, array $feed) only keeps track of last entry per id when checking for duplicates. If multiple entries have the same id, only the last one will be removed from the list.
- _insertFeedEntry(array $entryData, array $feedData, array $feed) somehow still triggers indexing the thread and post as well as increasing the ids, despite everything being rolled back no actual thread/post being stored.
There are 2 entries with id http://forums.daybreakgames.com/ps2/index.php?threads/pts-patch-notes-3-11.216825/
(Note that these are at the bottom, so it is entirely possible that they will no longer be included in the feed soon)
While this is a problem caused by broken feeds and they should fix their stuff, it would be nice if Xenforo was able to deal with this and not blow up its own database.