The W3C provides a useful Semantic data extraction tool that allows the efficacy of this approach to be tested.
Take a look at how easily the tool extracts the important content from an example thread here on XenForo.com:
Semantic Data Extraction for 'Multi-page Navigation Enhancement' thread
31.1 KB Views: 10,524
55.6 KB Views: 10,043