Ensure the Content-Type header is pulled from the metadata correctly, ensure the correct charset is pulled only from actual meta tags, and attempt charset conversion sooner and reflect the charset conversion within the HTML source before crawling it.