Third party Having an issue with the parsing of an unFURL

Gator

Well-known member
Affected version
2.1 Beta 6
Cannot get this one to unfurl

 
There are changes in beta 6 to help account for some of the bigger sites on the internet that do weird things, though it has caught this case. Here, they actually have broken HTML. Their head tag is just this:

Code:
<head>
        <title>[TUT]Building your weapon - Fan Creations and Tutorials - World War 3 Official Forum</title></head>
The rest of the stuff that should be in that tag is actually been the </head> and the <body>.

This actually seems to cause a fatal error in validators, which should give you an idea of how wrong it's considered to be.

I'm not sure if we'd want to make any changes here to account for this, or just accept it as a problem with the target site. We can still pull the title, for example.
 
Don't think we're going to change the approach taken unless we see this particular issue with any regularity.

If you have unfurl issues with any other site, then please create a new bug report as the underlying issues are likely to be separate. (The CNN "file size too large" isn't necessarily a bug BTW -- that would seem to indicate that they're delivering over 1.5 MB of HTML; unfortunately, we don't currently have the scope to parse the content it gets up until that point.)
 
Back
Top Bottom