CogVLM2 is built on Llama3-8B and supports high-resolution inputs, excelling in multi-turn dialogues and visually rich documents.
Choose CogVLM2 for general VQA, document Q&A, and GUI understanding tasks that benefit from high-resolution inputs and multi-turn conversation.
