Change the repository type filter
All
Repositories list
40 repositories
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
- OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
Open-Sora-Plan
Public- [CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
- [Arxiv 2025] Implementation of "GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation"
- [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
- [NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
AsFT
PublicLLaVA-CoT
PublicWF-VAE
PublicHoloTime
PublicHoloTime: Taming Video Diffusion Models for Panoramic 4D Scene GenerationUniLLM
PublicNeuralGS
PublicDreamDance
PublicEvaGaussians
PublicSwapAnyone
PublicReasoning-Attack
PublicCycle3D
PublicPiCO
Public[ICLR'25] PiCO: Peer Review in LLMs based on the Consistency Optimization, https://arxiv.org/pdf/2402.01830GPT-as-Language-Tree
PublicNext-Patch-Prediction
PublicN-LoRA
Public- Mixture-of-Experts for Large Vision-Language Models
Video-LLaVA
Public【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before ProjectionLLaVA-o1
PublicChat-UniVi
Public[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding