Success
Kimi-VL-A3B is a multi-modal LLM that can understand text, single-image, multi-image, and video, and generate reply. For thinking version, please try Kimi-VL-A3B-Thinking.
Note: you can upload no more than 10 images once