Running on Zero Vintern-1B-v3.5-Demo 🥶 Engage in image-based conversations with detailed text responses from two models