Can AI image models preserve identity across edits, follow complex instructions, and combine existing assets without visual collapse?