约 76,900,000 个结果
在新选项卡中打开链接
  1. Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub

    2025年2月23日 · Video-R1 significantly outperforms previous models across most benchmarks. Notably, on VSI-Bench, which focuses on spatial reasoning in videos, Video-R1-7B achieves a …

  2. GitHub - k4yt3x/video2x: A machine learning-based video super ...

    A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018. - k4yt3x/video2x

  3. 【EMNLP 2024 】Video-LLaVA: Learning United Visual ... - GitHub

    Video-LLaVA: Learning United Visual Representation by Alignment Before Projection If you like our project, please give us a star ⭐ on GitHub for latest update. 💡 I also have other video …

  4. Wan: Open and Advanced Large-Scale Video Generative Models

    2025年7月28日 · Wan: Open and Advanced Large-Scale Video Generative Models We are excited to introduce Wan2.2, a major upgrade to our foundational video models. With Wan2.2, …

  5. DepthAnything/Video-Depth-Anything - GitHub

    2025年1月21日 · ByteDance †Corresponding author This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without …

  6. Video-3D LLM: Learning Position-Aware Video Representation for …

    We propose a novel generalist model, i.e., Video-3D LLM, for 3D scene understanding. By treating 3D scenes as dynamic videos and incorporating 3D position encoding into these …

  7. hao-ai-lab/FastVideo - GitHub

    FastVideo is a unified post-training and inference framework for accelerated video generation. FastVideo features an end-to-end unified pipeline for accelerating diffusion models, starting …

  8. GitHub - Lightricks/LTX-Video: Official repository for LTX-Video

    LTX-Video is the first DiT-based video generation model that can generate high-quality videos in real-time. It can generate 30 FPS videos at 1216×704 resolution, faster than it takes to watch …

  9. Wan: Open and Advanced Large-Scale Video Generative Models

    2025年2月25日 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models …

  10. Generate Video Overviews in NotebookLM - Google Help

    Video Overviews, including voices and visuals, are AI-generated and may contain inaccuracies or audio glitches. NotebookLM may take a while to generate the Video Overview, feel free to …