Chat with Kimi-VL-A3B-Instruct

Success

Kimi-VL-A3B is a multi-modal LLM that can understand text, single-image, multi-image, and video, and generate reply. For thinking version, please try Kimi-VL-A3B-Thinking.

Note: you can upload no more than 10 images once

0 1
0 1
512 8192
512 16384
1 32
28 896
Examples