Just wondering if the contents of external_data are generated from internal_data or not, i.e. is backing up internal_data sufficient or should both be backed up?
Although you could regenerate thumbnails in the data folder, the data folder also contains avatars. It's best to back up both the internal_data and data folders. I suggest using rsync in a cron job.