Content-addressable attachment for deduplication


Well-known member
One advantage of having xf_attachment and xf_attachment_data tables separate is to be able to deduplicate contents. However, its utilization currently is very limited. A content-addressable identifier (e.g., md5, sha1, or sha256 or any other hash) in the xf_attachment_data table can enable true deduplication that would not only save space, but also allow stats on the reuse of certain popular attachments.

See the link below for further reference: