distributions of the real ADC and T2w images via W-distance minimization in unsupervised training, and (3) learning the distinguishable visual features of CS PCa via maximization of the auxiliary distance between CS and nonCS images in unsupervised training.