Chat with Kimi-VL-A3B-Instruct

Success

Kimi-VL-A3B is a multi-modal LLM that can understand text, single-image, multi-image, and video, and generate reply. For thinking version, please try Kimi-VL-A3B-Thinking.

Chatbot

Note: you can upload no more than 10 images once

File

Gallery

Top-p

0 1

Temperature

0 1

Max Generation Tokens

512 8192

Max Context Length Tokens

512 16384

Max Number of Frames for Video

1 32

Long Edge of Video

28 896

Examples