
MMLab@NTU
@mmlabntu
Multimedia Laboratory @NTUsg, affiliated with S-Lab.
Computer Vision, Image Processing, Computer Graphics, Deep Learning
ID: 1394997810584428547
http://www.mmlab-ntu.com 19-05-2021 12:46:26
69 Tweet
1,1K Followers
18 Following




Congrats to Guangcong Wang Jianyi Wang and Zhaoxi Chen from MMLab@NTU!

Chase Lean Try StableSR, a diffusion model-based upscaler. We paid extra efforts to maintain fidelity. Code and model: github.com/IceClear/Stablโฆ.


๐ฅ๐ฅWe are excited to announce #Vchitect, an open-source project for video generative models Hugging Face ๐ฝ๏ธLaVie (Text2Video Model) - Code: github.com/Vchitect/LaVie - huggingface.co/spaces/Vchitecโฆ ๐ฝ๏ธSEINE (Image2Video Model) - Code: github.com/Vchitect/SEINE - huggingface.co/spaces/Vchitecโฆ



๐ฌ Our study introduces "Upscale-A-Video," a text-guided latent diffusion framework for video upscaling. It ensures temporal coherence locally & globally, balancing fidelity and quality. ๐ Project page: shangchenzhou.com/projects/upscaโฆ ๐ป GitHub: github.com/sczhou/Upscaleโฆ ๐ฅ Video:


The Upcoming AI talk: ๐LLaVA๐ฆ A Vision-and-Language Approach to Computer Vision in the Wild by Chunyuan Li Chunyuan Li More info: mailchi.mp/1242f078b2b1/aโฆ Subscribe us: mailchi.mp/4417dc2cde83/tโฆ




๐ Meet Harmon โ a unified model for both image generation and understanding! Trained with a shared masked autoregressive encoder, it sets new benchmarks on GenEval & MJHQ30K. ๐ผ๏ธ๐ฌ Try the live demo now on Hugging Face: ๐ huggingface.co/spaces/wusize/โฆ Paper: arxiv.org/abs/2503.21979


๐ฌ ๐๐ฉ๐ฃ๐ฅ ๐ฎ๐ฌ๐ฎ๐ฑ ๐ง๐๐๐ผ๐ฟ๐ถ๐ฎ๐น ๐๐ง๐ค๐ข ๐๐๐๐๐ค ๐๐๐ฃ๐๐ง๐๐ฉ๐๐ค๐ฃ ๐ฉ๐ค ๐๐ค๐ง๐ก๐ ๐๐ค๐๐๐ก ๐ Hosted by MMLab@NTU ร Kuaishou, etc ๐ June 11 | Nashville ๐ world-model-tutorial.github.io ๐ง Video is just the start. World modeling is the goal. #CVPR2025 #WorldModel

