Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection