Vivim: a Video Vision Mamba for Medical Video Object Segmentation Paper • 2401.14168 • Published Jan 25 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated 9 days ago • 124
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 56